Literature DB >> 24969690

A multi-label, semi-supervised classification approach applied to personality prediction in social media.

Ana Carolina E S Lima1, Leandro Nunes de Castro2.   

Abstract

Social media allow web users to create and share content pertaining to different subjects, exposing their activities, opinions, feelings and thoughts. In this context, online social media has attracted the interest of data scientists seeking to understand behaviours and trends, whilst collecting statistics for social sites. One potential application for these data is personality prediction, which aims to understand a user's behaviour within social media. Traditional personality prediction relies on users' profiles, their status updates, the messages they post, etc. Here, a personality prediction system for social media data is introduced that differs from most approaches in the literature, in that it works with groups of texts, instead of single texts, and does not take users' profiles into account. Also, the proposed approach extracts meta-attributes from texts and does not work directly with the content of the messages. The set of possible personality traits is taken from the Big Five model and allows the problem to be characterised as a multi-label classification task. The problem is then transformed into a set of five binary classification problems and solved by means of a semi-supervised learning approach, due to the difficulty in annotating the massive amounts of data generated in social media. In our implementation, the proposed system was trained with three well-known machine-learning algorithms, namely a Naïve Bayes classifier, a Support Vector Machine, and a Multilayer Perceptron neural network. The system was applied to predict the personality of Tweets taken from three datasets available in the literature, and resulted in an approximately 83% accurate prediction, with some of the personality traits presenting better individual classification rates than others.
Copyright © 2014 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Big Five; Multi-label classification; Personality; Semi-supervised learning; Social media; Twitter

Mesh:

Year:  2014        PMID: 24969690     DOI: 10.1016/j.neunet.2014.05.020

Source DB:  PubMed          Journal:  Neural Netw        ISSN: 0893-6080


  4 in total

1.  Public Perception Analysis of Tweets During the 2015 Measles Outbreak: Comparative Study Using Convolutional Neural Network Models.

Authors:  Jingcheng Du; Lu Tang; Yang Xiang; Degui Zhi; Jun Xu; Hsing-Yi Song; Cui Tao
Journal:  J Med Internet Res       Date:  2018-07-09       Impact factor: 5.428

2.  Examining the Impact of COVID-19 Lockdown in Wuhan and Lombardy: A Psycholinguistic Analysis on Weibo and Twitter.

Authors:  Yue Su; Jia Xue; Xiaoqian Liu; Peijing Wu; Junxiang Chen; Chen Chen; Tianli Liu; Weigang Gong; Tingshao Zhu
Journal:  Int J Environ Res Public Health       Date:  2020-06-24       Impact factor: 3.390

3.  Examining the Psychological State Analysis Relationship Between Bitcoin Prices and COVID-19.

Authors:  JianPing Hou; Jingyi Liu; YingJiang Jie
Journal:  Front Psychol       Date:  2021-03-22

4.  Predicting Personality and Psychological Distress Using Natural Language Processing: A Study Protocol.

Authors:  Jihee Jang; Seowon Yoon; Gaeun Son; Minjung Kang; Joon Yeon Choeh; Kee-Hong Choi
Journal:  Front Psychol       Date:  2022-04-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.