Literature DB >> 30283034

Uncovering the complex genetics of human character.

Igor Zwir1,2, Javier Arnedo2, Coral Del-Val2, Laura Pulkki-Råback3, Bettina Konte4, Sarah S Yang5, Rocio Romero-Zaliz2, Mirka Hintsanen6, Kevin M Cloninger7, Danilo Garcia8,9, Dragan M Svrakic1, Sandor Rozsa1, Maribel Martinez1, Leo-Pekka Lyytikäinen10, Ina Giegling4,11, Mika Kähönen12, Helena Hernandez-Cuervo13, Ilkka Seppälä10, Emma Raitoharju10, Gabriel A de Erausquin14, Olli Raitakari15, Dan Rujescu4, Teodor T Postolache16,17, Joohon Sung5, Liisa Keltikangas-Järvinen3, Terho Lehtimäki10, C Robert Cloninger18,19.   

Abstract

Human personality is 30-60% heritable according to twin and adoption studies. Hundreds of genetic variants are expected to influence its complex development, but few have been identified. We used a machine learning method for genome-wide association studies (GWAS) to uncover complex genotypic-phenotypic networks and environmental interactions. The Temperament and Character Inventory (TCI) measured the self-regulatory components of personality critical for health (i.e., the character traits of self-directedness, cooperativeness, and self-transcendence). In a discovery sample of 2149 healthy Finns, we identified sets of single-nucleotide polymorphisms (SNPs) that cluster within particular individuals (i.e., SNP sets) regardless of phenotype. Second, we identified five clusters of people with distinct profiles of character traits regardless of genotype. Third, we found 42 SNP sets that identified 727 gene loci and were significantly associated with one or more of the character profiles. Each character profile was related to different SNP sets with distinct molecular processes and neuronal functions. Environmental influences measured in childhood and adulthood had small but significant effects. We confirmed the replicability of 95% of the 42 SNP sets in healthy Korean and German samples, as well as their associations with character. The identified SNPs explained nearly all the heritability expected for character in each sample (50 to 58%). We conclude that self-regulatory personality traits are strongly influenced by organized interactions among more than 700 genes despite variable cultures and environments. These gene sets modulate specific molecular processes in brain for intentional goal-setting, self-reflection, empathy, and episodic learning and memory.

Entities:  

Mesh:

Year:  2018        PMID: 30283034      PMCID: PMC7515844          DOI: 10.1038/s41380-018-0263-6

Source DB:  PubMed          Journal:  Mol Psychiatry        ISSN: 1359-4184            Impact factor:   13.437


Introduction

Strong evidence for substantial heritability of human personality comes from family, twin, and adoption studies [1]. However, the genetic and phenotypic architecture of human personality is complex and has remained uncertain despite recent advances in genomics and phenomics [2-4]. In general, geneticists must expect the likelihood that many genes affect each trait and each gene affects many traits [5]. When the architecture is complex, the same genetic networks may lead to different phenotypic outcomes (a phenomenon called multifinality in development or pleiotropy in genetics) [6-8]. Likewise, different genetic networks in complex systems may lead to the same outcome (equifinality, which is also described as heterogeneity) [8, 9]. Human personality is a striking example of the challenges involved in identifying the specific genes and molecular processes that influence complex traits. Twin studies indicate that between 30% and 60% of the phenotypic variance in personality, as assessed by a variety of instruments, is genetic in origin [10-14]. However, adoption studies and studies that include other family members along with twins show that most of the heritability of personality, as assessed by a variety of instruments, is likely to depend on complex interactions among multiple gene loci (i.e., epistasis) or multiple alleles at a locus (i.e., dominance), rather than the average effects of individual genes [11, 13–17]. Put another way, many genes are likely to operate in concert, not separately, to influence the heritability and development of personality. Nevertheless, despite extensive past effort, genome-wide association studies (GWAS) of personality have found few significant associations using a variety of personality instruments [18-20]. The frequent failure to account for most of the heritability of complex traits has been called the “missing” [21] or “hidden” [22] heritability problem. The Temperament and Character Inventory (TCI) measures two domains of personality hypothesized to be related to different genetic and neuronal networks [23]. Imaging studies show that TCI character traits are associated with brain networks for intentional and meta-cognitive processes, such as self-reflection, goal-setting, empathy, and episodic learning, whereas temperament traits are related to generating and conditioning automatic behaviors, such as stress reactions [24-28]. In this article, we focus on TCI character traits of self-directedness (i.e., purposeful, responsible vs. aimless, blaming), Cooperativeness (i.e., helpful, empathic vs. hostile, self-centered), and self-transcendence (i.e., altruistic, spiritual vs. individualistic, skeptical). These are the self-regulatory components of personality that determine the degree to which a person's adaptive functioning is healthy or unhealthy [29]. In related articles, we examine temperament traits and their relations with character in the same samples. We have chosen to apply strictly data-driven machine learning methods in a person-centered approach to GWAS to uncover the complex genotypic and phenotypic architecture of personality [6, 30, 31] (Supplementary Figure S1). We postulate that personality heritability is not missing, but is distributed in multiple networks of interacting genetic and environmental variables that influence different people [6, 31–33].

Subjects and methods

Description of the samples

Our discovery sample was the Young Finns Study, an epidemiological study of 2149 healthy Finnish children followed regularly from 1980 (ages, 3–18 years) to 2012 (ages, 35–50 years) [34]. Childhood environments were directly assessed with the rearing parents in 1980 and 1983 [35-39]. Adult environments and life events were assessed with subjects in 2001 [40, 41]. All subjects (56% women) had thorough standardized genotypic and phenotypic assessments, including administration of the TCI in 1997, 2001, 2007, and 2012 [34, 42]. We replicated the results in two independent samples of healthy adults from Germany [43, 44] and Korea [45, 46], in which comparable genotypic and phenotypic features were available (see Supplement). The Korean study involved 1052 unrelated individuals extracted from a national register (aged 28–81, 57% women). The German study involved 902 subjects (aged 20–74, 49% women) randomly selected from Munich city registry and screened to exclude anyone with a history of psychiatric illness in themselves or their first-degree relatives.

Personality assessment

All subjects completed the TCI to assess seven heritable dimensions of personality [23, 47]. The TCI measures four dimensions of temperament and three dimensions of character (self-directedness, cooperativeness, and self-transcendence) with strong reliability, as described in Supplementary Section 1 and Supplementary Table S1 [23, 47]. The 13 subscales of character from the TCI were used as the primary data about character in all three samples (Supplementary Section 2). Character profiles for each person were based on median splits of each subscale to distinguish high and low scorers [48].

Personality health indices

People at risk of unhealthy personality were identified as the bottom decile of the sum of TCI self-directedness and cooperativeness [48]. Prior work shows this criterion indicates ill-being or personality disorder (i.e., poor physical, mental, and social functioning) [49, 50]. In contrast, people with healthy personalities were identified as the top decile of the product of all three TCI character traits. Prior work shows that this criterion indicates well-being or flourishing (i.e., superior physical, mental, and social functioning) [29, 48, 51]. These indices provided consistent measures of the health status of subjects in all three samples. The health value of a set (i.e., group of people) is the average value of its members. We also identified an empirical index of character functioning by clustering the 13 character subscales of the TCI (Supplementary Section 3 and Table S2). The empirical index of character provided a single comprehensive measure of character functioning that could be associated a posteriori with each SNP set based on semi-supervised learning [52] and used in SNP-set Kernel Association Test (SKAT) [32, 33] and heritability analyses. It was highly correlated with the other health indicators (p < E-20, RMSE 0.03).

Genotyping

The Finnish sample was genotyped by using Illumina Human670-Quad Custom, (i.e., Illumina 670k custom) arrays [53]. The Korean sample used Affymetrix Genome-Wide Human SNP Array 6.0 and Illumina HumanCore [45]. The German sample used Affymetrix Genome-Wide Human SNP Array 6.0, Illumina OMNI Express and the 300 Array, prephased and imputed with SHAPEIT2 and IMPUTE2. Some German individuals had also been genotyped on Illumina Omni1-Quad. Quality control was performed for all samples as in prior work [6] (Supplementary Section 3). After quality checks, a subset of SNPs were preselected with the PLINK software suite [54] to reduce the large search space using a generously inclusive threshold (p-value <0.01 without Bonferroni correction) for possible association with character, taking gender and ethnicity into account as covariates of the individual SNPs. Preselecting SNPs identified SNPs that have weak associations with character that are not individually significant genome wide after Bonferroni correction, but provided presumptive candidates for epistatic interactions in a SNP set. The preselection also identified SNPs with a strong additive effect individually, thereby providing a manageably sized initial pool of SNPs as candidates for both the additive and non-additive components of the genetic architecture of character. We accounted for ethnicity in each sample by using the first three principal components for ancestral stratification of SNP genotypes (Supplementary Section 3) [55].

Computational procedures

The cluster analyses used the validated Generalized Factorization Method, which utilizes deep non-negative matrix factorization (NMF) to uncover naturally occurring (i.e., unsupervised) associations between patterns across different types of data, including genetics [56-59] and neuroimages [30, 60]. The clustering was entirely data driven without restrictive assumptions about the number or content of the clusters [31]. For example, clusters may have different features, and one subject can belong to more than one cluster [6, 30, 31, 56, 61]. The recurrent application of the clustering process is summarized and schematically related to unsupervised deep NMF learning in Supplementary Figure S1 [62]. The advantages of this clustering approach over alternative analyses of single or multiple markers are described in Supplementary Section 4. Our web server application for phenotype–genotype many-to-many relations analysis (PGMRA) in GWAS is published [31] and online at http://phop.ugr.es/fenogeno. The PGMRA method and algorithm are also summarized in Supplementary Sections 5 and 6, which includes a semi-supervised classifier of phenotypes from genotypes. PGMRA properly accounts for linkage disequilibrium (LD) efficiently (i.e., without loss of information about complex genotypic–phenotypic relations) (Supplementary Section 4). Statistical analysis correcting for multiple comparisons, as well as gender and ethnicity as covariates of the SNP sets, was performed by the SKAT [32, 33], also accessible via PGMRA. Heritability was estimated from a trimmed regression of SNPs on the empirical index of character controlling for outliers and environmental variables [63, 64] (also see Supplementary Section 7). Replicability of results was evaluated in the three independent samples for SNP sets, phenotypic sets, and genotypic–phenotypic relations using multi-objective optimization techniques [6], as detailed in Supplementary Section 8. We also evaluated how well the individual genotypic sets were able to predict the classification of the phenotypes in each sample using the PGMRA classifier (Supplementary Section 9). Further details are available in Supplementary Information and elsewhere [56-59].

Results

Identifying SNP sets as candidates for causal variability

We exhaustively identified 902 non-identical but possibly overlapping SNP sets in the Finnish sample using PGMRA without knowledge of the phenotype. The SNP sets were comprised of different numbers of SNPs and/or subjects, regardless of their phenotypic status. The SNPs were mapped to diverse functional classes of genetic variants that may be located on different chromosomes, frequently even within a single SNP set (Figs. 1a, 2a–d). SNP sets are organized as networks of multilocus genotypes (Fig. 1a, b; Supplementary Figure S2, Supplementary Table S3). They were labeled by a genotypic identification ‘G’, followed by two numbers: the first indicates the maximum number of clusters and the second indicates the order of selection by the algorithm. SNP sets were associated with different health risks (Table 1, Supplementary Table S2).
Fig. 1

a Two examples of SNP sets are represented as heatmap submatrices or biclusters. SNP sets were identified by distinct patterns of molecular features of SNPs in subgroups of subjects. Allele values are indicated as BB (dark blue), AB (intermediate blue), AA (light blue), and missing (black). SNP sets were labeled for specificity by a pair of numbers representing the maximum number of clusters from which the bicluster was selected (e.g., 33 clusters may produce more specific than 21) and the order in which they were selected by the method (e.g., 4th bicluster or factor selected by FNMF when the maximum number of clusters was 21) and usually have a prefix G for genotype or P for phenotype. Only a subset of optimal and cohesive sets are selected across all number of clusters (See Supplementary Methods). The SNPs within each SNP set can map to different chromosomes (e.g., 6 and 8) and exhibit distinct molecular consequences (see Supplementary Table S3). The pie chart shows the percentage of SNPs within a SNP set that belong to each type of consequence. b Dissection of a GWAS in a Finnish population to identify the genotypic and phenotypic architecture of personality measured by the TCI. The genotypic network is depicted as nodes (SNP sets) linked by shared SNPs (blue lines) and/or subjects (red lines) (see also Supplementary Figure S3A for additional subnetworks). Each SNP set maps to one or more genes (see Supplementary Table S6 for full list of genes associated with each SNP set). SNP sets associated with each of the five general character profiles are distinguished by color-coding as shown in the legend (see Table 3). c, d Comparison of level of ill-being (c where high values indicate ill-being) and for level of well-being (d where high values indicate well-being) in groups of subjects with each of the five character profiles specified by both phenotypic and genotypic information (evaluated by ANOVA). (Compare with either genetic or phenotypic assessment alone in Supplementary Figure S6). e Variation in health status of SNP sets: well (blue, see d), ill (orange, see c), intermediate (gray). f 12 genotypic-phenotypic pipelines connect different sets of genes to the same character dimension (see also Supplementary Tables S9–S12). Red lines indicate direct connections, whereas blue lines and “&” indicate composite connections. g Surface showing the pattern of health status of the subjects in this study based on SNP set information only (i.e., interpolation from Table 1). The probability of well-being in the z-axis varies from high (red for high well-being) to low (green). The order of the SNP sets is based on shared subjects (x-axis) and on shared SNPs (y-axis) measured by hypergeometric statistics, so SNP sets sharing more SNPs and/or subjects are nearby (see ill health surface in Supplementary Figure S4). h Surface showing the pattern of health status of subjects based on both genotypic information (SNP sets) and phenotypic information (character sets) (as in Table 3). The probability of well-being in the z-axis varies from high (red, high well-being) to low (green). The sharing of subjects is shown for both SNP sets (x-axis) and character sets (y-axis) (see ill health surface in Supplementary Figure S5)

Fig. 2

a, b Types of genetic variants mapped by SNP sets associated with character: a Specific molecular consequences (Supplementary Table S5) and b their subtypes. Genes related only to character sets (red) were less often protein coding and more often RNA genes than those also associated with temperament sets (blue color). c Cell displaying the molecular pathways containing genes associated only with the organized profile. The uncovered genes influence the phosphatidyl inositol/calcium second-messenger signaling system that regulates the seeking of food and other goals in response to external environmental signals (see also Supplementary Tables S4, S7). d Multiple SNPs within a SNP set can affect a single or multiple genes in many ways (Supplementary Table S3). Within the MTA3 gene, SNPs in the SNP set G_12_1 may affect both coding and regulatory regions (thereby inhibiting transcription), whereas SNPs from SNP set 40_26 are mostly located in intronic regions (thereby blocking or decreasing protein production). The SNP sets are associated with profiles exhibiting distinct character features (creative vs. apathetic)

Table 1

Description of 42 SNP sets associated with character sets (p < 1E-05)

Finnish sampleProbability of healthaGenes
SNP setsSNP-set name% CodingSKAT p-valueBest SNPAverage SNPsaSubjectsaSNPsWell-beingIll-being
G_3_1Inositol-calcium signaling*622.88E-1023.01E-052.64E-0131121630.060.1>300
G_8_8Inositol/chemokine pathways672.21E-558.55E-051.99E-012246110.080.07291
G_7_2GPCR dysregulation622.07E-319.00E-052.44E-012113030.090.23142
G_7_3Neurogenesis631.67E-201.07E-041.85E-011333640.170.36136
G_11_4Inositol signaling551.22E-199.00E-053.29E-021411720.070.0451
G_12_8Neuroprotection621.49E-162.53E-043.39E-011732850.090.03111
G_7_7Olfaction561.32E-113.27E-052.25E-011451930.030.155
G_36_29Electron transport505.37E-094.27E-043.37E-01251850.080.4876
G_31_8Neurotrophin551.15E-082.90E-053.13E-01541830.090.5464
G_28_15Histone methylation442.77E-083.76E-052.23E-011011230.080.3834
G_9_8Neuroregulation573.82E-081.11E-043.48E-012092300.170.1277
G_24_6GFI1-neurite outgrowth365.22E-082.65E-047.47E-0272630.10.0814
G_19_5DARPP320-neuroplasticity308.31E-081.23E-042.13E-0186590.160.2210
G_33_15ERK-neurodevelopment531.60E-074.47E-048.47E-0226670.270.2319
G_23_2Biogenic amine synthesis502.24E-071.39E-049.46E-0242560.050.298
G_3_2PAK-neuroprotection633.08E-071.70E-051.80E-011331970.180.2435
G_22_6Blood–brain barrier593.37E-072.53E-042.35E-0137930.080.1632
G_34_13CREB-episodic learning543.88E-072.38E-041.28E-0141490.050.2913
G_40_26Dopamine-feedback656.08E-073.88E-042.61E-0139980.080.3617
G_21_3Cellular senescence621.12E-061.85E-043.55E-01601170.10.2334
G_20_2Enhanced memory791.59E-062.78E-042.34E-0125800.240.1219
G_28_11Sensory transduction442.07E-068.29E-042.71E-0132810.220.069
G_12_1Episodic learning615.06E-069.00E-053.41E-011461890.20.0666
G_41_33GPCR neuroplasticity405.31E-064.91E-042.71E-0156760.110.2115
G_9_3Pyrimidine metabolism506.03E-064.54E-052.55E-02164350.120.046
G_26_14Glucose transport596.98E-061.08E-042.20E-0146750.090.2427
G_20_3Fatty acid oxidation488.03E-062.07E-043.04E-0136820.030.3321
G_23_19Org2-RNA01.17E-053.76E-059.37E-0487320.080.034
G_17_14Dep-RNA01.17E-053.76E-059.37E-0447320.090.154
G_27_25Org3-RNA01.17E-053.76E-059.37E-0454320.110.074
G_41_40Apath-RNA01.17E-053.76E-059.37E-0434320.090.034
G_38_10Org5-RNA01.17E-053.76E-059.37E-0427320.150.194
G_33_19Res-RNA01.17E-053.76E-059.37E-0443320.090.124
G_21_4Org1-RNA01.28E-053.76E-059.41E-0268430.070.074
G_10_1Learning/memory471.29E-053.27E-057.44E-02131480.070.0615
G_35_12Org4-RNA01.33E-053.76E-059.32E-0445310.160.093
G_35_4O-linked glycosylation571.39E-053.74E-042.83E-0142370.050.247
G_5_1CDK neuroplasticity1001.78E-051.70E-055.60E-02100910.170.41
G_19_9Aurora-B381.91E-052.65E-041.25E-0120460.10.48
G_36_27Aging regulation251.96E-058.29E-041.60E-0127570.330.224
G_16_9Olfactory signaling582.00E-052.31E-041.79E-0164360.080.1712
G_33_11Self-control503.67E-052.65E-049.39E-0243320.260.0715

The SNP sets are named based on molecular pathways and neuronal functions of the genes that distinguish the sets from one another (see Supplementary Table S4). Percentage coding indicates the percentage of protein coding genes. Strengths of association are compared for the SNP set, the best SNP, and average SNP based on SKAT p-values. The number of subjects and SNPs comprising each SNP set is specified. The probabilities of the well-being and ill-being are given for subjects in each SNP set (see also Supplementary Table S2)

aGenes indicates the genes mapped by the SNP set (Figure S6), where genes can be mapped by more than one SNP set

a Two examples of SNP sets are represented as heatmap submatrices or biclusters. SNP sets were identified by distinct patterns of molecular features of SNPs in subgroups of subjects. Allele values are indicated as BB (dark blue), AB (intermediate blue), AA (light blue), and missing (black). SNP sets were labeled for specificity by a pair of numbers representing the maximum number of clusters from which the bicluster was selected (e.g., 33 clusters may produce more specific than 21) and the order in which they were selected by the method (e.g., 4th bicluster or factor selected by FNMF when the maximum number of clusters was 21) and usually have a prefix G for genotype or P for phenotype. Only a subset of optimal and cohesive sets are selected across all number of clusters (See Supplementary Methods). The SNPs within each SNP set can map to different chromosomes (e.g., 6 and 8) and exhibit distinct molecular consequences (see Supplementary Table S3). The pie chart shows the percentage of SNPs within a SNP set that belong to each type of consequence. b Dissection of a GWAS in a Finnish population to identify the genotypic and phenotypic architecture of personality measured by the TCI. The genotypic network is depicted as nodes (SNP sets) linked by shared SNPs (blue lines) and/or subjects (red lines) (see also Supplementary Figure S3A for additional subnetworks). Each SNP set maps to one or more genes (see Supplementary Table S6 for full list of genes associated with each SNP set). SNP sets associated with each of the five general character profiles are distinguished by color-coding as shown in the legend (see Table 3). c, d Comparison of level of ill-being (c where high values indicate ill-being) and for level of well-being (d where high values indicate well-being) in groups of subjects with each of the five character profiles specified by both phenotypic and genotypic information (evaluated by ANOVA). (Compare with either genetic or phenotypic assessment alone in Supplementary Figure S6). e Variation in health status of SNP sets: well (blue, see d), ill (orange, see c), intermediate (gray). f 12 genotypic-phenotypic pipelines connect different sets of genes to the same character dimension (see also Supplementary Tables S9–S12). Red lines indicate direct connections, whereas blue lines and “&” indicate composite connections. g Surface showing the pattern of health status of the subjects in this study based on SNP set information only (i.e., interpolation from Table 1). The probability of well-being in the z-axis varies from high (red for high well-being) to low (green). The order of the SNP sets is based on shared subjects (x-axis) and on shared SNPs (y-axis) measured by hypergeometric statistics, so SNP sets sharing more SNPs and/or subjects are nearby (see ill health surface in Supplementary Figure S4). h Surface showing the pattern of health status of subjects based on both genotypic information (SNP sets) and phenotypic information (character sets) (as in Table 3). The probability of well-being in the z-axis varies from high (red, high well-being) to low (green). The sharing of subjects is shown for both SNP sets (x-axis) and character sets (y-axis) (see ill health surface in Supplementary Figure S5)
Table 3

The strength of the genotypic–phenotypic relationships among SNP and character sets and their corresponding health measurements

Character setsCharacter consensus setsSNP setsSNP-set namesHypergeo-metric C-GHealth measurements of subjects
Char setsSNP setsBoth Sets Jointly
Well-beingIll-beingWell-beingIll-beingWell-beingIll-being
C_14_8ResourcefulG_12_8Neuroprotection1.29E-030.010.010.090.030.090.03
C_10_7ResourcefulG_12_8Neuroprotection2.79E-030.4100.090.030.410.03
C_10_7ResourcefulG_33_11Self-control3.68E-030.4100.260.070.410.07
C_10_6iResourcefulG_33_19Res-RNA4.33E-030.030.380.090.120.090.38
C_14_8ResourcefulG_11_4Inositol signaling4.86E-030.010.010.070.040.070.04
C_4_4OrganizedG_11_4Inositol signaling1.26E-110.0100.070.040.070.04
C_3_1OrganizedG_11_4Inositol signaling7.18E-09000.070.040.070.04
C_5_1OrganizedG_11_4Inositol signaling2.19E-060.0500.070.040.070.04
C_4_4OrganizedG_12_8Neuroprotection9.79E-060.0100.090.030.090.03
C_3_1OrganizedG_12_8Neuroprotection1.38E-05000.090.030.090.03
C_8_7OrganizedG_11_4Inositol signaling2.78E-050.0400.070.040.070.04
C_4_4OrganizedG_10_1Learning/memory4.79E-050.0100.070.060.070.06
C_3_1OrganizedG_10_1Learning/memory9.45E-05000.070.060.070.06
C_3_1OrganizedG_21_4Org1-RNA1.70E-04000.070.070.070.07
C_4_4OrganizedG_8_8Global inositol/chemokine pathways1.86E-040.0100.080.070.080.07
C_14_13OrganizedG_24_6Neurogenesis1.86E-040.0900.10.080.10.08
C_4_4OrganizedG_3_1Inositol calcium signaling3.38E-040.0100.060.10.060.1
C_9_8OrganizedG_12_8Neuroprotection4.62E-040.1200.090.030.120.03
C_3_1OrganizedG_3_1Inositol calcium signaling7.05E-04000.060.10.060.1
C_7_7OrganizedG_11_4Inositol signaling7.17E-040.0700.070.040.070.04
C_5_1OrganizedG_12_8Neuroprotection7.58E-040.0500.090.030.090.03
C_14_13OrganizedG_12_8Neuroprotection8.50E-040.0900.090.030.090.03
C_14_13OrganizedG_23_19Org2-RNA1.00E-030.0900.080.030.090.03
C_5_1OrganizedG_10_1Learning/memory1.09E-030.0500.070.060.070.06
C_3_1OrganizedG_8_8Global inositol/chemokine pathways1.12E-03000.080.070.080.07
C_7_7OrganizedG_17_14Dep-RNA1.32E-030.0700.090.150.090.15
C_15_1iOrganizedG_36_29Electron transport1.66E-0300.790.080.480.080.79
C_5_1OrganizedG_8_8Global inositol/chemokine pathways2.07E-030.0500.080.070.080.07
C_14_13OrganizedG_11_4Inositol signaling2.33E-030.0900.070.040.090.04
C_9_1iOrganizedG_38_10Org5-RNA2.34E-0300.020.150.190.150.19
C_7_5iOrganizedG_35_12Org4-RNA2.43E-030.50.060.160.090.50.09
C_7_7OrganizedG_12_8Neuroprotection2.50E-030.0700.090.030.090.03
C_5_1OrganizedG_17_14Dep-RNA3.00E-030.0500.090.150.090.15
C_6_5OrganizedG_11_4Inositol signaling3.04E-030.0700.070.040.070.04
C_12_9OrganizedG_7_3Neurogenesis3.13E-030.5400.170.360.540.36
C_8_7OrganizedG_10_1Learning/memory3.54E-030.0400.070.060.070.06
C_4_4OrganizedG_24_6Neurogenesis3.63E-030.0100.10.080.10.08
C_14_9OrganizedG_31_8Neurotrophin3.65E-030.020.290.090.540.090.54
C_14_9OrganizedG_7_3Neurogenesis3.66E-030.020.290.170.360.170.36
C_9_6OrganizedG_27_25Org3-RNA3.84E-030.110.020.110.070.110.07
C_8_7OrganizedG_8_8Global inositol/chemokine pathways3.99E-030.0400.080.070.080.07
C_12_9OrganizedG_9_8Neuroregulation4.96E-030.5400.170.120.540.12
C_3_3CreativeG_7_3Neurogenesis1.22E-060.9200.170.360.920.36
C_3_3CreativeG_5_1CDK neuroplasticity2.00E-060.9200.170.40.920.4
C_15_5CreativeG_3_2PAK-neuroprotection6.63E-060.720.030.180.240.720.24
C_4_3CreativeG_20_2Enhanced memory1.13E-050.9700.240.120.970.12
C_14_1CreativeG_7_3Neurogenesis2.80E-050.250.110.170.360.250.36
C_11_3CreativeG_33_15ERK-neurodevelopment3.99E-050.970.030.270.230.970.23
C_8_8CreativeG_20_2Enhanced memory6.49E-05100.240.1210.12
C_3_3CreativeG_20_2Enhanced memory9.26E-050.9200.240.120.920.12
C_5_5CreativeG_20_2Enhanced memory1.41E-04100.240.1210.12
C_4_3CreativeG_33_15ERK-neurodevelopment1.65E-040.9700.270.230.970.23
C_5_5CreativeG_33_15ERK-neurodevelopment1.78E-04100.270.2310.23
C_12_7CreativeG_33_15ERK-neurodevelopment3.21E-040.90.040.270.230.90.23
C_4_3CreativeG_7_3Neurogenesis3.65E-040.9700.170.360.970.36
C_5_5CreativeG_7_3Neurogenesis4.21E-04100.170.3610.36
C_6_1CreativeG_33_15ERK-neurodevelopment5.02E-04100.270.2310.23
C_5_5CreativeG_19_5DARPP320-Neuroplasticity5.30E-04100.160.2210.22
C_3_3CreativeG_19_5DARPP320-neuroplasticity7.66E-040.9200.160.220.920.22
C_3_3CreativeG_33_15ERK-neurodevelopment8.46E-040.9200.270.230.920.23
C_4_3CreativeG_12_1Episodic learning9.12E-040.9700.20.060.970.06
C_7_2CreativeG_33_15ERK-Neurodevelopment9.82E-040.9800.270.230.980.23
C_7_2CreativeG_36_27Aging regulation1.18E-030.9800.330.220.980.22
C_13_1CreativeG_20_2Enhanced memory1.23E-030.90.020.240.120.90.12
C_13_1CreativeG_33_15ERK-neurodevelopment1.44E-030.90.020.270.230.90.23
C_3_3CreativeG_12_1Episodic learning1.59E-030.9200.20.060.920.06
C_12_7CreativeG_12_1Episodic learning2.21E-030.90.040.20.060.90.06
C_3_3CreativeG_9_8Neuroregulation2.27E-030.9200.170.120.920.12
C_3_3CreativeG_28_11Sensory transduction3.14E-030.9200.220.060.920.06
C_13_1CreativeG_28_11Sensory transduction3.17E-030.90.020.220.060.90.06
C_4_3CreativeG_9_8Neuroregulation3.32E-030.9700.170.120.970.12
C_5_5CreativeG_12_1Episodic learning3.32E-03100.20.0610.06
C_9_2iCreativeG_34_13CREB-Episodic learning4.05E-030.050.20.050.290.050.29
C_12_7CreativeG_7_3Neurogenesis4.10E-030.90.040.170.360.90.36
C_12_7CreativeG_19_5DARPP320-Neuroplasticity4.22E-030.90.040.160.220.90.22
C_11_3CreativeG_9_8Neuroregulation4.24E-030.970.030.170.120.970.12
C_15_7DependentG_11_4Inositol signaling1.80E-060.030.020.070.040.070.04
C_12_6DependentG_31_8Neurotrophin8.93E-060.090.410.090.540.090.54
C_12_6DependentG_41_33GPCR neuroplasticity1.18E-050.090.410.110.210.110.41
C_15_13DependentG_31_8Neurotrophin6.14E-0500.790.090.540.090.79
C_4_2DependentG_28_15Histone methylation6.90E-0500.450.080.380.080.45
C_9_5DependentG_7_2GPCR dysregulation1.01E-0400.260.090.230.090.26
C_15_7DependentG_8_8Global inositol/chemokine pathways2.03E-040.030.020.080.070.080.07
C_15_13DependentG_28_15Histone methylation2.92E-0400.790.080.380.080.79
C_4_2DependentG_5_1CDK neuroplasticity3.98E-0400.450.170.40.170.45
C_14_5DependentG_7_7Olfaction8.92E-0400.290.030.10.030.29
C_15_7DependentG_17_14Dep-RNA1.01E-030.030.020.090.150.090.15
C_6_3DependentG_17_14Dep-RNA1.09E-0300.320.090.150.090.32
C_12_6DependentG_7_3Neurogenesis1.21E-030.090.410.170.360.170.41
C_5_3DependentG_5_1CDK neuroplasticity1.43E-030.020.290.170.40.170.4
C_15_13DependentG_5_1CDK neuroplasticity1.83E-0300.790.170.40.170.79
C_5_3DependentG_7_3Neurogenesis2.32E-030.020.290.170.360.170.36
C_9_5DependentG_5_1CDK neuroplasticity2.63E-0300.260.170.40.170.4
C_4_2DependentG_7_3Neurogenesis2.66E-0300.450.170.360.170.45
C_6_3DependentG_7_7Olfaction2.73E-0300.320.030.10.030.32
C_4_2DependentG_31_8Neurotrophin2.93E-0300.450.090.540.090.54
C_6_3DependentG_22_6Blood-brain barrier3.40E-0300.320.080.160.080.32
C_4_2DependentG_41_33GPCR neuroplasticity3.44E-0300.450.110.210.110.45
C_7_4DependentG_5_1CDK neuroplasticity3.56E-0300.740.170.40.170.74
C_14_5DependentG_28_15Histone methylation3.73E-0300.290.080.380.080.38
C_3_2ApatheticG_5_1CDK neuroplasticity6.13E-10010.170.40.171
C_3_2ApatheticG_7_3Neurogenesis4.61E-08010.170.360.171
C_3_2ApatheticG_28_15Histone methylation2.32E-05010.080.380.081
C_9_3iApatheticG_26_14Glucose transport3.12E-04010.090.240.091
C_3_2ApatheticG_7_2GPCR dysregulation4.18E-04010.090.230.091
C_10_2ApatheticG_11_4Inositol signaling4.25E-040.0100.070.040.070.04
C_12_4iApatheticG_35_4O-linked glycosylation4.30E-040.030.530.050.240.050.53
C_11_4ApatheticG_16_9Olfactory signaling5.78E-0400.280.080.170.080.28
C_7_3iApatheticG_36_29Electron transport5.96E-04010.080.480.081
C_10_8ApatheticG_5_1CDK neuroplasticity6.43E-0400.790.170.40.170.79
C_14_7iApatheticG_36_29Electron transport7.85E-04010.080.480.081
C_11_10iApatheticG_36_29Electron transport8.39E-040.030.50.080.480.080.5
C_11_6iApatheticG_20_3Fatty acid oxidation1.06E-0300.710.030.330.030.71
C_10_8ApatheticG_31_8Neurotrophin1.21E-0300.790.090.540.090.79
C_15_15iApatheticG_41_40Apha-RNA1.61E-0300.30.090.030.090.3
C_14_11ApatheticG_28_15Histone methylation2.13E-0300.470.080.380.080.47
C_3_2ApatheticG_40_26Dopamine-feedback2.41E-03010.080.360.081
C_10_2ApatheticG_9_3Pyrimidine metabolism2.60E-030.0100.120.040.120.04
C_13_3iApatheticG_26_14Glucose transport2.67E-0300.250.090.240.090.25
C_8_6iApatheticG_36_29Electron transport2.69E-03010.080.480.081
C_10_8ApatheticG_19_9Aurora-B3.25E-0300.790.10.40.10.79
C_10_8ApatheticG_7_3Neurogenesis3.47E-0300.790.170.360.170.79
C_10_8ApatheticG_23_2Biogenic synthesis3.56E-0300.790.050.290.050.79
C_11_6iApatheticG_36_29Electron transport3.88E-0300.710.080.480.080.71
C_12_5ApatheticG_7_2GPCR dysregulation3.99E-0300.090.090.230.090.23
C_10_2ApatheticG_10_1Learning/memory4.03E-030.0100.070.060.070.06
C_8_3ApatheticG_21_3Cellular Senescence4.17E-0300.640.10.230.10.64
C_14_3iApatheticG_26_14Glucose transport4.43E-030.060.380.090.240.090.38

Association is measured by Fisher's exact test (hypergeometric). Probabilities of well-being and ill-being are given for subjects in the character sets, the SNP sets, and subjects identified in both jointly. iIndicates character sets that are more specific than their parental sets, which are also selected

a, b Types of genetic variants mapped by SNP sets associated with character: a Specific molecular consequences (Supplementary Table S5) and b their subtypes. Genes related only to character sets (red) were less often protein coding and more often RNA genes than those also associated with temperament sets (blue color). c Cell displaying the molecular pathways containing genes associated only with the organized profile. The uncovered genes influence the phosphatidyl inositol/calcium second-messenger signaling system that regulates the seeking of food and other goals in response to external environmental signals (see also Supplementary Tables S4, S7). d Multiple SNPs within a SNP set can affect a single or multiple genes in many ways (Supplementary Table S3). Within the MTA3 gene, SNPs in the SNP set G_12_1 may affect both coding and regulatory regions (thereby inhibiting transcription), whereas SNPs from SNP set 40_26 are mostly located in intronic regions (thereby blocking or decreasing protein production). The SNP sets are associated with profiles exhibiting distinct character features (creative vs. apathetic) Description of 42 SNP sets associated with character sets (p < 1E-05) The SNP sets are named based on molecular pathways and neuronal functions of the genes that distinguish the sets from one another (see Supplementary Table S4). Percentage coding indicates the percentage of protein coding genes. Strengths of association are compared for the SNP set, the best SNP, and average SNP based on SKAT p-values. The number of subjects and SNPs comprising each SNP set is specified. The probabilities of the well-being and ill-being are given for subjects in each SNP set (see also Supplementary Table S2) aGenes indicates the genes mapped by the SNP set (Figure S6), where genes can be mapped by more than one SNP set

Identifying clusters of subjects with distinct character profiles

We identified 342 non-identical but possibly overlapping character sets using the 13 character subscales without knowledge of the genotype. Character sets were labeled by a phenotypic identification “C” to distinguish them from the SNP sets. These fine-grained character sets were nested within five character supersets that were identified by recurrently applying PGMRA to minimize the cophenetic correlation coefficient (Table 2) [62]. In other words, five groups of people had highly distinct character profiles.
Table 2

Description of the five character profiles (supersets) and composite character sets identified by PGMRA from profiles of TCI subscales (Y = yes)

Char setsSupersetsNamesd1sd2sd3sd4sd5co1co2co3co4co5st1st2st3Lsd1Lsd2Lsd3Lsd4Lsd5Lco1Lco2Lco3Lco4Lco5Lst1Lst2Lst3#SWell BeingIll Being
C_14_81ResourcefulYYY790.010.01
C_10_71YYYY1020.410
C_10_61YYYY330.030.38
C_14_132OrganizedYYYYYYY920.090
C_14_92YYYYYYYY420.020.29
C_9_82YYYYYYYYYYYY720.120
C_12_92YYYYYYYY410.540
C_6_52YYYYYYYYYYYY560.070
C_8_72YYYYYYYYYYYYY1610.040
C_5_12YYYYYYYYYYYYY1690.050
C_3_12YYYYYYYYYYYYY29300
C_4_42YYYYYYYYYYYYY1900.010
C_7_72YYYYYYYYYYYY1010.070
C_9_62YYYYYYYYYYYYY610.110.02
C_7_52YYYYY340.50.06
C_9_12YYYYYYYYY4600.02
C_15_53CreativeYY1000.720.03
C_12_73YY520.90.04
C_11_33YYY340.970.03
C_13_13YYY420.90.02
C_14_13YYYYYYYY280.250.11
C_7_23YYYYYYYYY660.980
C_8_83YYYYYYYYYYY3910
C_4_33YYYYYYYYYYY720.970
C_5_53YYYYYYYYYYY7310
C_6_13YYYYYYYYYYY3210
C_3_33YYYYYYYYYYYYY1350.920
C_9_23YYYYYY40.050.2
C_15_74DependentYY1210.030.02
C_14_54YYY5500.29
C_15_134YYY2900.79
C_12_64YYYYYY440.090.41
C_4_24YYYYYYY4000.45
C_7_44YYYYYYY2300.74
C_5_34YYYYYYYYY480.020.29
C_6_34YYYYYYYYY3700.32
C_9_54YYYYYYYYYYY3100.26
C_10_25ApatheticYYY1160.010
C_11_45YYY5000.28
C_8_35YYYY3900.64
C_14_115YYYYYY6200.47
C_10_85YYYYYYYYY3300.79
C_3_25YYYYYYYYYY5301
C_12_55YYYYYYY2200.09
C_14_75YYYYY401
C_13_35YYYYY400.25
C_14_35Y320.060.38
C_11_105YYYYYYYY380.030.5
C_15_15YYY210.140.52
C_7_35YY1501
C_8_65YYY701
C_9_35YYYYYYYYYYY701
C_15_155YYY3300.3
C_11_65YY2800.71
C_12_45YYYYYY340.030.53
Consensus setssd1sd2sd3sd4sd5co1co2co3co4co5st1st2st3Lsd1Lsd2Lsd3Lsd4Lsd5Lco1Lco2Lco3Lco4Lco5Lst1Lst2Lst3
ResourcefulYYY
OrganizedYYYYYYYYYYYYY
CreativeYYYYYYYYYYYYY
DependentYYYYYYYYY
ApatheticYYYYYYYYYYYYY

TCI subscales are indicated self-directedness (sd1–sd5), cooperativeness (co1–co5), and self-transcendence (st1–st3). Subscale values were divided by median split into high and low scores (distinguished by L before the low scores). The number of subjects in each character set is specified (#S). The probabilities of well-being and ill-being are shown for subjects in each character set (see also Supplementary Table S2)

Description of the five character profiles (supersets) and composite character sets identified by PGMRA from profiles of TCI subscales (Y = yes) TCI subscales are indicated self-directedness (sd1–sd5), cooperativeness (co1–co5), and self-transcendence (st1–st3). Subscale values were divided by median split into high and low scores (distinguished by L before the low scores). The number of subjects in each character set is specified (#S). The probabilities of well-being and ill-being are shown for subjects in each character set (see also Supplementary Table S2) The people in three of the five character profiles had healthy personalities, which we named resourceful, organized, and creative to be consistent with traditional labels for TCI profiles (Table 2). For example, people with the "organized" character profile were high in most subscales of self-directedness and cooperativeness, but were low in all subscales of self-transcendence (i.e., they were controlling, individualistic, and skeptical). People with the "creative" profile were high in all aspects of character, whereas the "resourceful" were only self-directed. In addition, there were two profiles of people with unhealthy personalities. The people with a "dependent" character profile were highly forgiving when abused (CO4), conscientiously considerate of others (CO5), self-deprecating (SD4), and otherwise low in self-directedness and self-transcendence. The people with an "apathetic" character were low in all aspects of character development (Table 2).

Association of SNP sets with character

We tested the association of SNP sets with character. The empirical index of character, a single quantitative measure of character functioning, was more strongly associated with SNP sets than with the average effects of their constituent SNPs according to SKAT (Table 1). Forty-two SNP sets had significant associations with character (p < 1E-05). For example, the SNP set G_11_4 has a p-value of 1.22 E-19, whereas the best and average SNPs within this set have 9.00 E-05 and 3.29 E-02 p-values, respectively (Table 1). SKAT [32] and PLINK [54] methods estimated similar p-values for the individual SNPs (R2 = 0.99, F statistics, p < 3.8 E-46), showing that SKAT did not inflate results. Forty-two SNP sets significantly associated with character are described in Table 1. We assigned names to the SNP sets based on prominent molecular processes and pathways that distinguished them (Supplementary Table S4). The character-related SNP sets were comprised of networks of SNPs that mapped 727 genes, nearly all of which are known to influence individual differences in brain functions, particularly regulation of neurodevelopment, neuroplasticity, neuroprotection, connectivity, energy metabolism, stress reactivity, resilience, longevity, learning, and memory (Supplementary Tables S5, S6).

Complex genotypic–phenotypic relationships in personality profiles

We found that 55 of the 342 character sets were significantly associated with particular SNP sets (hypergeometric statistics, 1E-11 < p < 1E-03, Table 3). The genotypic–phenotypic relations were complex, demonstrating pleiotropy and heterogeneity. For example, G_5_1 involved neuroplasticity and was frequently associated with dependent character sets, but sometimes with apathetic or creative profiles (Table 3). The 55 character sets were associated with the 42 SNP sets in 128 relationships that were significant by a permutation test (Table 3, empirical p < 4.7 E-03). The strength of the genotypic–phenotypic relationships among SNP and character sets and their corresponding health measurements Association is measured by Fisher's exact test (hypergeometric). Probabilities of well-being and ill-being are given for subjects in the character sets, the SNP sets, and subjects identified in both jointly. iIndicates character sets that are more specific than their parental sets, which are also selected SNP sets (Fig. 1b, Supplementary Figure 2A) often had similar character profiles associated with particular molecular processes (Table 3, Supplementary Tables S4, S7). For example, the organized profile was strongly associated with many SNP sets involving the regulation of inositol–calcium signaling for obtaining food and other goals (e.g., G_8_8, G_11_4) and for neuroprotection against injury (G_12_8). SNP sets regulating episodic learning and hippocampal neurogenesis (e.g., G_7_3, G_12_1) were associated with a creative profile.

Relations among SNP sets to one another and to molecular processes

We found 12 single and disjoint nodes, and at least three subnetworks composed of highly connected nodes, shown in Fig. 1b and Supplementary Figure S3A. These networks were relatively disjoint (i.e., sharing few SNPs and subjects; see Supplementary Information 9. Identification of Sub-networks), suggesting that these are distinct antecedents of personality. These nearly disjoint networks vary in size and complexity: one subnetwork connected eight SNP sets (Supplementary Figure S3A), whereas others had only a single SNP set. One network contained SNP sets primarily connected by shared SNPs, but not subjects (e.g., G_10_1 learning/memory and G_7_7 olfaction, Fig. 1b), as expected when the same SNPs had different allele values. This network was associated with dependent and organized personality profiles (Fig. 1b). Both shared subjects and SNPs connected the other two networks (Fig. 1b), as occurs when one network is a subset of another. The first network was primarily composed of organized (e.g., components of inositol signaling by G_11_4, G_8_8, G_3_1) and apathetic (e.g., G_21_3 cellular senescence, G_7_2 GPCR dysregulation) profiles. The second network displayed creative (e.g., G_3_2, G_7_3, G_9_8) and dependent (e.g., G_38_8, G_5_1) profiles. Finally, some SNP sets within a network do not share SNPs, but independently specify almost the same individuals (e.g., G_8_8 inositol/chemokine signaling, G_7_2 GPCR dysregulation, Fig. 1b), as expected when distinct subsets of genotypic features influence a common pathway or consequence.

Heterogenic pathways influence the same character trait

The genes associated with each of the five character profiles are largely different. In all, 68% of the 727 genes associated with character were unique to a single character profile: 208 with organized, 89 with creative, 70 with dependent, and 130 with apathetic (Supplementary Table S8). Consequently, there were multiple groups of genes that lead to each individual character trait, as depicted in Fig. 1f. For example, high self-directedness occurs in individuals with the resourceful, organized, and creative profiles, even though these profiles have different genetic backgrounds. Put another way, individual character traits were genetically more heterogeneous than the multidimensional character profiles. We refer to the multiple genotypic–phenotypic networks that contribute to individual traits as a pipeline, as outlined in Fig. 1f. Detailed descriptions of the specific genes and molecular processes we found in the pipelines for each of the three character traits are presented in Supplementary Tables S9–S12.

Complex genotypic–phenotypic relationships influence health status

The combination of genotypic and phenotypic information provided more information than either alone for both well-being (Fig. 1g vs. Fig. 1h) and ill-being (Supplementary Figures S4 vs. S5). When health status was based on the joint relationship of SNP sets and character sets, all five character profiles were well distinguished in terms of the probabilities of ill-being (p < 3.89E-26, ANOVA statistics, Fig. 1c) and well-being (p < 3.68E-65, ANOVA, Fig. 1d). In contrast, when health status was based on character scores only, the probability of ill-being was greater in only two profiles and that of well-being was greater in only one profile (Supplementary Figure S6). We identified candidate regulatory genes that we called switch genes because of their relationship to changes in health status among people with the same character profile (Fig. 1e). For example, all apathetic SNP sets were associated with ill-being except G_9_3, which was associated with well-being. In contrast, the creative SNP sets were associated with well-being except for G_7_7, which was associated with ill-being. The 150 switch genes included 50% protein coding genes, 18% RNA genes, 15% pseudogenes, 3% transcription factors, and 4% others (Supplementary Table S13). Overall about 67% of the 727 genes associated with character sets may be involved in regulatory processes: these included transcriptional regulators (10%), lncRNAs (24%), other RNA genes (6%), and targets of microRNAs (27%), as identified in the TRANSFAC® release 2017.1 database (Supplementary Table S14). We identified two microRNAs (MIR431, MIR1762) in association with character, and they target 74 and 119 of the 727 genes we found associated with character in TRANSFAC, respectively. In particular, lnc RNAs were more commonly associated with character only then with temperament and character, whereas protein-coding genes were more commonly associated with both temperament and character, as shown in Fig. 2a, b.

Replication of results in two independent samples

We tested the replicability of our findings in the Finnish study by carrying out the same analyses in the German and Korean samples. In all, 95% of the 42 SNP sets associated with character sets in the Finnish sample were identified in one or both of the replication samples: 36 were identified in both the Korean and German samples, three in the Korean sample only, and one in the German sample only (Supplementary Table S15). In addition, 96% of the 55 character sets associated with SNP sets in the Finnish sample were replicated in one or both of the replication samples: 46 in both, six in Korean sample only, and one in the German sample only (Table S16). The genotypic–phenotypic relations between SNP and character sets identified in the Finnish sample closely matched those observed in the Korean study (94%) and in the German (84%) study (Table S17). The replication of the 25 character sets associated with ill-being in the Finnish sample was reduced in the German sample (72%) compared with the Korean sample (84%)(ANOVA, p = 0.01), as expected because the Germans had been screened to exclude psychopathology, including personality disorders, in themselves or their first-degree relatives (Supplementary Figure S7). The strength of the identity of replicated sets was calculated using hypergeometric statistics and multi-objective optimization techniques (see Pareto values in Supplementary Tables S18, S19). We also surveyed prior literature reporting associations with TCI character-related keywords systematically from PubMed, and identified genes that had been reported to be associated with one or more of the TCI character traits in one or more investigations (Supplementary Tables S20, S21). We found that 116 of our detected genes were related to genes, family of proteins, or pathways of genes previously associated with TCI traits (Supplementary Table S20). Among the genes in character-related SNP sets, we also detected 74% of the 111 genes that had been previously associated with TCI traits, and 75% of the 63 genes that had previously been reported in association with TCI character traits (Supplementary Table S21). Considering all genes previously related to the TCI (Supplementary Table S21), we recovered seven genes with the same exact name, another 34 variants from the same family of proteins, and another 41 genes in the same KEGG pathway previously reported.

Estimation of heritability and environmental influences

The heritability of character controlling for outliers was estimated as 57% in the Finns, 58% in the Germans, and 50% in the Koreans (Supplementary Table S22). In addition, 95% of the SNP sets were strongly associated with the empirical character index (5E-11 > p-value > 5E-77). In other words, the SNPs that comprise different SNP sets strongly distinguished the character values of the subjects in each set, indicating that each individual SNP set contributed significantly to explain the total distributed heritability (Supplementary Section 9). Consequently, when the genotypic sets were used to classify the well-being and ill-being of the subjects as measured by their character values, the predicted values were highly accurate (average areas under curve of the classifications were 0.928 and 0.932, respectively) (Supplementary Figure S9). We also considered environmental influences in the Finnish sample. There were direct associations of sets of environmental influences in childhood and adulthood with character sets (Supplementary Table S23A) and with SNP sets (Supplementary Table S23B). The impact of these correlations was small, so the heritability estimate was still 56% in the Finnish sample when adjusted for gene-environment correlation (Supplementary Table S23C). In addition, five novel associations between SNP sets and character sets emerged when environmental influences were used as mediators: years of education in childhood and stressful life events in adulthood had significant effects on organized and dependent character profiles (Supplementary Table S23D, p < 2.9 to 8.4 E-03).

Discussion

This is the first data-driven study to examine the genotypic–phenotypic architecture of human character traits, which are the self-regulatory components of personality that modulate physical, mental, and social well-being [48, 65]. As such, it represents a pioneering effort to describe the psychobiology of character as a complex network of genotypes with specific molecular processes and neuronal functions that regulate personality development. We explained 50–58% of the heritability of human character and replicated our results in independent samples, thereby accounting for nearly all the heritability expected from twin studies.

Complexity of genotypic–phenotypic pipelines

We observed that 68% of the 727 genes for character were unique to a single character profile and were regulated by distinct molecular processes and neuronal functions. Such minimal overlap in genes and molecular mechanisms between personality profiles is very surprising from a trait perspective. For example, both the organized and creative character profiles are high in self-directedness and cooperativeness, and differ only in self-transcendence. The resourceful profile differs from the apathetic profile only in being high in self-directedness. Thus, we hypothesize that people can become highly self-directed by multiple mechanisms: a creative or intuitive route involving enhancing self-awareness in episodic memory, an organized or analytical route involving executive control of what is known from past experience, and/or taking initiative by learned resourcefulness. Likewise, there are three or more routes via distinct genetic pipelines to cooperativeness and/or self-transcendence. Consequently, individual personality traits are genetically heterogeneous and their development depends on multiple mechanisms that can only be distinguished by consideration of the whole person. Individual traits may still be important for study of development or treatment, but they do not appear to be the fundamental building blocks of personality.

Regulatory processes and functions associated with character

We observed that 67% of the 727 character genes were involved in regulatory systems. In particular, lncRNAs were more common in association with character only than with both temperament and character (Fig. 2a, b). The identified genes are reported to influence neuroplasticity, energy metabolism, and the regulation of adaptations to a wide variety of biological, psychological, and social stressors through processes for intentional goal-seeking, self-control, empathy, and episodic memory (Table 1). These genetic findings are supported by independent neuroimaging findings that TCI character traits are associated with brain networks for these same intentional and meta-cognitive functions [24-27]. An interesting sign of the high predictability of variability in health status was our finding that a few genes could dramatically alter the health status of people with each specific SNP set, including 150 putative switch genes across all 42 SNP sets. The dramatic effect that a few switch genes can have on overall health status is further evidence of the importance of epistasis for understanding personality and its development.

Strengths and limitations

Our unbiased analytical PGMRA method used deep cluster analysis to identify association between possibly interactive sets of features instead of between individual SNPs or character traits. The results were strongly replicated in independent samples, demonstrating remarkable robustness. Furthermore, the neuronal functions of the identified genes are supported by independent research about brain networks related to TCI character. Our initial pool of SNPs was preselected to be the best candidates to have additive and/or non-additive effects on character. The threshold for possible association (p-value of 0.01 without Bonferroni correction) in our initial pool of SNPs was more than six orders of magnitude below what is required for genome-wide significance. We sought to evaluate the cooperative effects of groups of SNPs with possible non-additive gene–gene interactions and those with strong additive effects individually (i.e., very low p-values). Therefore, we included SNPs that were either weakly or strongly associated with character singly, and then compared their significance as a group vs. that of the best SNP within the group. Consequently, these candidate SNPs may have no main (additive) effect on the phenotype at all, but when organized as SNP sets, they presented consistent evidence of epistasis (i.e., each SNP set had stronger associations with character than their best single constituents). In addition, the SNPs we identified were sufficient to account for nearly all the heritability expected from twin studies (about 50%), which includes both additive and non-additive effects. Our findings are based on associations only, which precludes definite conclusions about causation. Nevertheless, the circumstantial evidence for our causal hypotheses is strong and merits further testing.

Conclusions and recommendations for future research

We were able to characterize and replicate the complexity of the genotypic–phenotypic risk architecture of self-regulatory character traits in three large samples. Our findings demonstrate that data-driven analysis of the architecture of genotypic–phenotypic relationships enables investigators to overcome the hidden heritability problem (i.e., the consistent inability to account for most of the heritability of complex traits when only the average effects of genes are considered). We conclude that self-regulatory personality traits are strongly influenced by organized interactions among more than 700 genes, despite variable cultures and environments. We recommend studies that dissect detailed phenomic and genomic data, including brain images and physiological measurements, and integrate these in a multi-faceted view of each person. We also recommend an extended replicability analysis, in which a marker can be replicated at different multi-omic levels, such as genes, family of proteins, or pathways. The precision of our person-centered approach now allows such in-depth analysis and replication, even for complex traits in moderate-sized samples.
  56 in total

Review 1.  Genes, evolution, and personality.

Authors:  T J Bouchard; J C Loehlin
Journal:  Behav Genet       Date:  2001-05       Impact factor: 2.805

Review 2.  Genetic and environmental influences on human psychological differences.

Authors:  Thomas J Bouchard; Matt McGue
Journal:  J Neurobiol       Date:  2003-01

3.  On the pleiotropic structure of the genotype-phenotype map and the evolvability of complex organisms.

Authors:  William G Hill; Xu-Sheng Zhang
Journal:  Genetics       Date:  2012-01-03       Impact factor: 4.562

Review 4.  Personality architecture: within-person structures and processes.

Authors:  Daniel Cervone
Journal:  Annu Rev Psychol       Date:  2005       Impact factor: 24.137

5.  Is the genetic structure of human personality universal? A cross-cultural twin study from North America, Europe, and Asia.

Authors:  Shinji Yamagata; Atsunobu Suzuki; Juko Ando; Yutaka Ono; Nobuhiko Kijima; Kimio Yoshimura; Fritz Ostendorf; Alois Angleitner; Rainer Riemann; Frank M Spinath; W John Livesley; Kerry L Jang
Journal:  J Pers Soc Psychol       Date:  2006-06

6.  Genetics and neuropsychiatric disorders: genome-wide, yet narrow.

Authors:  Petrus J de Vries
Journal:  Nat Med       Date:  2009-08       Impact factor: 53.440

Review 7.  Dissecting the genetic architecture of human personality.

Authors:  Marcus R Munafò; Jonathan Flint
Journal:  Trends Cogn Sci       Date:  2011-08-09       Impact factor: 20.229

8.  Sex differences and nonadditivity in heritability of the Multidimensional Personality Questionnaire Scales.

Authors:  D Finkel; M McGue
Journal:  J Pers Soc Psychol       Date:  1997-04

9.  The genetic and environmental relationship between Cloninger's dimensions of temperament and character.

Authors:  Nathan A Gillespie; C Robert Cloninger; Andrew C Heath; Nicholas G Martin
Journal:  Pers Individ Dif       Date:  2003-12-01

10.  Uncovering the hidden risk architecture of the schizophrenias: confirmation in three independent genome-wide association studies.

Authors:  Javier Arnedo; Dragan M Svrakic; Coral Del Val; Rocío Romero-Zaliz; Helena Hernández-Cuervo; Ayman H Fanous; Michele T Pato; Carlos N Pato; Gabriel A de Erausquin; C Robert Cloninger; Igor Zwir
Journal:  Am J Psychiatry       Date:  2014-10-31       Impact factor: 18.112

View more
  11 in total

1.  The Yin-Yang personality from biopsychological perspective using revised Sasang Personality Questionnaire.

Authors:  Han Chae; Young Il Cho; Soo Jin Lee
Journal:  Integr Med Res       Date:  2020-06-22

2.  Validation of a general subjective well-being factor using Classical Test Theory.

Authors:  Ali Al Nima; Kevin M Cloninger; Franco Lucchese; Sverker Sikström; Danilo Garcia
Journal:  PeerJ       Date:  2020-06-09       Impact factor: 2.984

3.  Phenotypic and genetic analysis of a wellbeing factor score in the UK Biobank and the impact of childhood maltreatment and psychiatric illness.

Authors:  Justine M Gatt; Janice M Fullerton; Javad Jamshidi; Peter R Schofield
Journal:  Transl Psychiatry       Date:  2022-03-19       Impact factor: 7.989

4.  How "dirty" is the Dark Triad? Dark character profiles, swearing, and sociosexuality.

Authors:  Danilo Garcia
Journal:  PeerJ       Date:  2020-07-27       Impact factor: 2.984

5.  Validation of Two Short Personality Inventories Using Self-Descriptions in Natural Language and Quantitative Semantics Test Theory.

Authors:  Danilo Garcia; Patricia Rosenberg; Ali Al Nima; Alexandre Granjard; Kevin M Cloninger; Sverker Sikström
Journal:  Front Psychol       Date:  2020-02-19

Review 6.  The complex genetics and biology of human temperament: a review of traditional concepts in relation to new molecular findings.

Authors:  C Robert Cloninger; Kevin M Cloninger; Igor Zwir; Liisa Keltikangas-Järvinen
Journal:  Transl Psychiatry       Date:  2019-11-11       Impact factor: 6.222

7.  Toxoplasma gondii Serointensity and Seropositivity: Heritability and Household-Related Associations in the Old Order Amish.

Authors:  Allyson R Duffy; Jeffrey R O'Connell; Mary Pavlovich; Kathleen A Ryan; Christopher A Lowry; Melanie Daue; Uttam K Raheja; Lisa A Brenner; André O Markon; Cecile M Punzalan; Aline Dagdag; Dolores E Hill; Toni I Pollin; Andreas Seyfang; Maureen W Groer; Braxton D Mitchell; Teodor T Postolache
Journal:  Int J Environ Res Public Health       Date:  2019-10-03       Impact factor: 3.390

8.  Genetic Dissection of Temperament Personality Traits in Italian Isolates.

Authors:  Maria Pina Concas; Alessandra Minelli; Susanna Aere; Anna Morgan; Paola Tesolin; Paolo Gasparini; Massimo Gennarelli; Giorgia Girotto
Journal:  Genes (Basel)       Date:  2021-12-21       Impact factor: 4.096

Review 9.  Evolution of genetic networks for human creativity.

Authors:  I Zwir; C Del-Val; M Hintsanen; K M Cloninger; R Romero-Zaliz; A Mesa; J Arnedo; R Salas; G F Poblete; E Raitoharju; O Raitakari; L Keltikangas-Järvinen; G A de Erausquin; I Tattersall; T Lehtimäki; C R Cloninger
Journal:  Mol Psychiatry       Date:  2021-04-21       Impact factor: 15.992

10.  A Psychological Profile of Elite Polish Short Track Athletes: An Analysis of Temperamental Traits and Impulsiveness.

Authors:  Katarzyna Gabrys; Antoni Wontorczyk
Journal:  Int J Environ Res Public Health       Date:  2022-03-15       Impact factor: 3.390

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.