| Literature DB >> 35992414 |
Yufei Huang1,2, Jianqiu Zhang3.
Abstract
An accurate personality model is crucial to many research fields. Most personality models have been constructed using linear factor analysis (LFA). In this paper, we investigate if an effective deep learning tool for factor extraction, the Variational Autoencoder (VAE), can be applied to explore the factor structure of a set of personality variables. To compare VAE with LFA, we applied VAE to an International Personality Item Pool (IPIP) Big 5 dataset and an IPIP HEXACO (Humility-Honesty, Emotionality, Extroversion, Agreeableness, Conscientiousness, Openness) dataset. We found that LFA tends to break factors into ever smaller, yet still significant fractions, when the number of assumed latent factors increases, leading to the need to organize personality variables at the factor level and then the facet level. On the other hand, the factor structure returned by VAE is very stable and VAE only adds noise-like factors after significant factors are found as the number of assumed latent factors increases. VAE reported more stable factors by elevating some facets in the HEXACO scale to the factor level. Since this is a data-driven process that exhausts all stable and significant factors that can be found, it is not necessary to further conduct facet level analysis and it is anticipated that VAE will have broad applications in exploratory factor analysis in personality research.Entities:
Keywords: Big 5 personality factors; HEXACO model of personality; artificial intelligence; deep learning; non-linear factor analysis; personality trait; variational auto encoder (VAE)
Year: 2022 PMID: 35992414 PMCID: PMC9388855 DOI: 10.3389/fpsyg.2022.863926
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1An example of a VAE with 100 hidden middle layer nodes and 8 bottleneck layer nodes.
Figure 2Exploratory factor analysis of the IPIP Big 5 dataset.
Figure 3Input-reconstruction correlation reduction when assuming 10 Latent Factors in LFA.
The Mean (Std) of variable-wise input-reconstruction correlations.
|
|
| |||
|---|---|---|---|---|
|
|
|
|
|
|
| 50 | 0.694 (0.066) | 0.714 (0.062) | 0.670 (0.076) | 0.671 (0.075) |
| 100 | 0.728 (0.061) | 0.748 (0.057) | 0.715 (0.058) | 0.716 (0.059) |
| 200 | 0.731 (0.058) | 0.752 (0.050) | 0.730 (0.057) | 0.750 (0.048) |
N = 3,944 samples from the testing dataset. When two middle layers are used, the number of nodes used in the second mid-layer is half of that in the first middle layer.
Figure 4Mean of input-reconstruction correlations in VAE and LFA.
Figure 5Mean of R2 statistics in VAE and communality in LFA.
Figure 6Clustering of factors from 10 VAE runs with 10 bottleneck layer nodes and 100 middle layer nodes in the Big 5 analysis.
Figure 7Input-reconstruction correlation reduction in a Big5 VAE run (E, Extraversion; N, Neuroticism; A, Agreeableness; C, Conscientiousness; O, Openness. See Table 2 for variable definitions).
Factor-variable association by inspecting input-reconstruction correlation reduction with 10 bottleneck layer nodes.
|
|
| |
|---|---|---|
| Factor | Neuroticism | |
| N6 | 0.54880679 | I get upset easily. |
| N9 | 0.52292319 | I get irritated easily. |
| N1 | 0.49218794 | I get stressed out easily. |
| N8 | 0.46884062 | I have frequent mood swings. |
| N7 | 0.43760175 | I change my mood a lot. |
| N3 | 0.39057886 | I worry about things. |
| N10 | 0.34668783 | I often feel blue. |
| N5 | 0.3111992 | I am easily disturbed. |
| N2 | 0.29470729 | I am relaxed most of the time. |
| N4 | 0.17285815 | I seldom feel blue. |
| Factor | Agreeableness | |
| A4 | 0.67151324 | I sympathize with others' feelings. |
| A9 | 0.51527633 | I feel others' emotions. |
| A8 | 0.43419151 | I take time out for others. |
| A5 | 0.43105851 | I am not interested in other people's problems. |
| A6 | 0.41301325 | I have a soft heart. |
| A7 | 0.31421132 | I am not really interested in others. |
| A2 | 0.24600361 | I am interested in people. |
| A1 | 0.18277531 | I feel little concern for others. |
| A3 | 0.1748963 | I insult people. |
| A10 | 0.15831119 | I make people feel at ease. |
| Factor | Conscientiousness | |
| C5 | 0.4724694 | I get chores done right away. |
| C9 | 0.47059017 | I follow a schedule. |
| C1 | 0.39228489 | I am always prepared. |
| C6 | 0.37797469 | I often forget to put things back in their proper place. |
| C7 | 0.361025 | I like order. |
| C2 | 0.32991215 | I leave my belongings around. |
| C4 | 0.28578476 | I make a mess of things. |
| C8 | 0.23683019 | I shirk my duties. |
| C10 | 0.22386922 | I am exacting in my work. |
| C3 | 0.16418281 | I pay attention to details. |
| Factor | X1 | |
| A1 | 0.16598865 | I feel little concern for others. |
| Factor | Not Stable | |
| O4 | 0.17283142 | I am not interested in abstract ideas. |
| N4 | 0.13201049 | I seldom feel blue. |
| O2 | 0.10712201 | I have difficulty understanding abstract ideas. |
| Factor | Not Stable | |
| O9 | 0.13302234 | I spend time reflecting on things. |
| A3 | 0.11271361 | I insult people. |
| Factor | Linguistic Intellect | |
| O1 | 0.53025199 | I have a rich vocabulary. |
| O8 | 0.47849211 | I use difficult words. |
| O7 | 0.32149661 | I am quick to understand things. |
| O2 | 0.28731204 | I have difficulty understanding abstract ideas. |
| O5 | 0.2294474 | I have excellent ideas. |
| O10 | 0.21341601 | I am full of ideas. |
| O4 | 0.17363996 | I am not interested in abstract ideas. |
| O9 | 0.10652723 | I spend time reflecting on things. |
| Factor | Imagination | |
| O6 | 0.40875143 | I do not have a good imagination. |
| O3 | 0.33323853 | I have a vivid imagination. |
| O10 | 0.1164824 | I am full of ideas. |
| Factor | Extraversion | |
| E4 | 0.54024501 | I keep in the background. |
| E2 | 0.52025504 | I don't talk a lot. |
| E7 | 0.51419152 | I talk to a lot of different people at parties. |
| E5 | 0.50472147 | I start conversations. |
| E10 | 0.50394706 | I am quiet around strangers. |
| E1 | 0.4833213 | I am the life of the party. |
| E9 | 0.42410076 | I don't mind being the center of attention. |
| E8 | 0.38028153 | I don't like to draw attention to myself. |
| E6 | 0.34816174 | I have little to say. |
| E3 | 0.33659072 | I feel comfortable around people. |
| Factor | Not stable | |
| C3 | 0.13559234 | I pay attention to details. |
N = 3,944 testing samples; Only variables with correlation reduction higher than 0.1 are kept for each factor.
Correlations between the 7 stable factors in the VAE IPIP Big 5 dataset analysis.
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| (1) Conscientiousness | - | |||||
| (2) Linguistic Intellect | −0.03 | - | ||||
| (3) Extraversion | −0.05 | −0.02 | - | |||
| (4) Agreeableness |
| −0.01 |
| - | ||
| (5) X1 | 0.01 | −0.05 | 0.02 | 0 | - | |
| (6) Neuroticism | 0 | 0.03 | −0.05 |
| −0.03 | - |
| (7) Imagination | 0.02 |
| −0.02 | −0.03 |
| −0.02 |
N = 3,944.
P < 0.05; Bold values and
are indicate that P-value < 0.001.
Figure 8Variable-wise and sample-wise correlation statistics in VAE and LFA.
Figure 9Scree plot of the eigenvalues in the HEXACO dataset.
Figure 10HEXACO factor- variable association when 6 latent factors are assumed in LFA (see section Methods for the list of acronyms).
Figure 11HEXACO factor-variable association when 9 latent factors are assumed in LFA.
Figure 12HEXACO factor-personality variable association when 14 latent factors are assumed in LFA.
Figure 13VAE analysis of the IPIP HEXACO dataset using 12 bottleneck layer nodes.
Figure 14HEXACO factor-personality variable association when 9 latent factors are assumed in VAE.
Figure 15HEXACO factor-personality variable association when 14 latent factors are assumed in VAE.
Correlations between rotated factors in the IPIP HEXACO VAE analysis.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| (1) Inquisitiveness | 1 | |||||||
| (2) Agreeableness | 0.03 | 1 | ||||||
| (3) Extraversion | 0 | 0.05 | 1 | |||||
| (4) Conscientiousness | – | 0.05 |
| 1 | ||||
| (5) Humility | 0.05 | −0.01 | 0.05 | – | 1 | |||
| (6) Machiavellianism | 0.02 | – | −0.05 | 0.02 | 0.04 | 1 | ||
| (7) Thrill-seeking | – | 0.04 | 0.04 | 0.02 | −0.04 |
| 1 | |
| (8) Emotionality | 0 | 0.05 | 0.05 | −0.03 | 0.04 | −0.04 |
| 1 |
| (9) Creativity | – | −0.03 |
| 0.1 | −0.02 | 0.02 | −0.04 | 0.03 |
N = 3,756 testing samples;
P < 0.05; Bold values and
are indicate that P-value < 0.001.
Correlations between factors trained on North American samples and tested on West Europe samples.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| (1) Machiavellianism | 1 | |||||||
| (2) Emotionality |
| 1 | ||||||
| (3) Thrill-seeking | 0.05 | 0.01 | 1 | |||||
| (4) Conscientiousness | 0.2 |
| −0.05 | 1 | ||||
| (5) Inquisitiveness |
|
| −0.03 |
| 1 | |||
| (6) Creativity | 0.1 | 0.01 | 0 |
| −0.02 | 1 | ||
| (7) Extraversion |
| 0.06 | −0.04 | −0.04 |
|
| 1 | |
| (8) Agreeableness | 0.05 |
| 0.03 |
| −0.06 | 0.04 |
| 1 |
| (9) Humility |
| 0.03 |
|
| −0.04 |
| −0.04 |
|
N = 2,607;
P < 0.05; Bold values and
are indicate that P-value < 0.001.
Correlations between factors trained on North American samples and tested on Asian Samples.
|
|
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|---|---|
| (1) Machiavellianism | 1 | |||||||
| (2) Emotionality |
| 1 | ||||||
| (3) Thrill-seeking | −0.07 | 0 | 1 | |||||
| (4) Conscientiousness |
|
| −0.01 | 1 | ||||
| (5) Inquisitiveness | −0.09 |
| −0.02 | −0.05 | 1 | |||
| (6) Creativity | 0.06 | 0.02 | 0.09 | −0.04 | −0.11 | 1 | ||
| (7) Extraversion |
| 0.02 | 0.06 | 0 |
| −0.09 | 1 | |
| (8) Agreeableness | −0.11 | 0.08 | 0.06 |
| −0.02 | 0.03 |
| 1 |
| (9) Humility |
| 0.11 | 0.11 | −0.04 | −0.02 | 0.11 | 0.01 | −0.07 |
N = 850;
P < 0.05; Bold values and
are indicate that P-value < 0.001.