Personality prediction using machine learning classifiers

machine learning personality prediction Big Five Personality regression

Authors

  • Xin Yee Chin School of Computing Asia Pacific University of Technology and Innovation (APU) Kuala Lumpur, Malaysia
  • Han Yang Lau School of Computing Asia Pacific University of Technology and Innovation (APU) Kuala Lumpur, Malaysia
  • Zhi Xin Chong School of Computing Asia Pacific University of Technology and Innovation (APU) Kuala Lumpur, Malaysia
  • Man Pan Chow School of Computing Asia Pacific University of Technology and Innovation (APU) Kuala Lumpur, Malaysia
  • Zailan Arabee Abdul Salam
    zailan@apu.edu.my
    School of Computing Asia Pacific University of Technology and Innovation (APU) Kuala Lumpur, Malaysia https://orcid.org/0009-0003-4288-0843
Vol. 5 No. 1 (2021)
Original Research
January 20, 2026

Downloads

Personality is a fundamental basis of human behaviour. At most basic, personality including patterns of thought, feeling, behaviours that make an individual unique. Personality will directly or indirectly influence the interaction or preferences of a person. This research using different learning algorithms and concepts of data mining to mine on the data features and learn from the pattern. The aim of this experiment is to explore different options of the algorithm on modifying the personality prediction source code by using logistic regression algorithm, and to find whether the accuracy of the classification can be improved. There are five characteristics of different people that are known as the Big Five characteristic, which is openness, neuroticism, conscientiousness, agreeableness and extraversion that have been stored in the dataset used for training. Then, an overview and comparison will be provided on the different measures taken to reduce the issues faced by researchers in this field. Classification methods implemented are Support Vector Machine, Ridge Algorithm, Naive Bayes, Logistic Regression and Voting Classifier. Testing results showed that the Logistic Regression still outperformed the other methods.