| Literature DB >> 33526822 |
Emma Sprooten1, Barbara Franke2, Corina U Greven3,4,5.
Abstract
Different psychiatric disorders and symptoms are highly correlated in the general population. A general psychopathology factor (or "P-factor") has been proposed to efficiently describe this covariance of psychopathology. Recently, genetic and neuroimaging studies also derived general dimensions that reflect densely correlated genomic and neural effects on behaviour and psychopathology. While these three types of general dimensions show striking parallels, it is unknown how they are conceptually related. Here, we provide an overview of these three general dimensions, and suggest a unified interpretation of their nature and underlying mechanisms. We propose that the general dimensions reflect, in part, a combination of heritable 'environmental' factors, driven by a dense web of gene-environment correlations. This perspective calls for an update of the traditional endophenotype framework, and encourages methodological innovations to improve models of gene-brain-environment relationships in all their complexity. We propose concrete approaches, which by taking advantage of the richness of current large databases will help to better disentangle the complex nature of causal factors underlying psychopathology.Entities:
Mesh:
Year: 2021 PMID: 33526822 PMCID: PMC8960404 DOI: 10.1038/s41380-021-01031-2
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Glossary.
| Genetics | |
| Heritability | The proportion of variance of a phenotype that is attributable to genetic factors. |
| Genetic correlation | The degree to which two phenotypes are influenced by the same genetic variation. |
| GWAS | Genome-wide association study: mass-univariate analysis to relate common variation over the entire length of the DNA to a phenotype of interest. |
| SNP | Single nucleotide polymorphism: a (common) genetic variation in the DNA sequence where different alleles (nucleotides) can exist in the population. |
| Polygenic | Influenced by many genetic variants (i.e., hundreds, or thousands of genes), as opposed to monogenic (influenced by a single gene, or single genetic variant). |
| Mendelian Randomisation | Hypothesis-driven method aimed at inferring causality from (cross-sectional) associations between a genetic variant and two or more phenotypes. E.g. to test whether a modifiable behavioural or neural trait potentially mediates the effect of a genetic variant on a disease [ |
| LD-score regression | Linkage-disequilibrium score regression: method to calculate genetic correlations on the basis of GWAS output (i.e., “summary statistics”), given the relationship of the statistics to each variant’s linkage disequilibrium pattern [ |
| Neuroimaging | |
| MRI | Magnetic Resonance Imaging |
| Functional MRI | MRI acquisition method to estimate regional brain activation based on local blood-oxygen level dependent (BOLD) signal. |
| Diffusion MRI | MRI acquisition method to measure microstructural tissue properties based on direction and amount of diffusion of water molecules. Most often used for investigating white matter fibres. |
| Functional connectivity | The degree to which two or more brain regions show similar activation patterns over time, based on the correlation or mutual dependence of their BOLD time-series. |
| Multivariate methods | |
| PCA | Principal Component Analysis: data-driven data reduction method to extract maximally uncorrelated components (i.e., “factors”) from many variables. |
| ICA | Independent Component Analysis: data-driven data reduction method and source identification method, which extracts maximally independent components (i.e., “factors”) from many variables. |
| CCA | Canonical Correlation Analysis: method to extract modes (here:“factors”) across two or more sets of variables (e.g., MRI and behavioural variables), such that the variables within a mode are maximally correlated. |
| SEM | Structural Equation Modelling: data reduction method to fit a priori factor structures to data and extract these factors. Can be confirmatory (1 model is tested) or exploratory (multiple a priori models are tested and compared). |
*Note: For consistency and clarity, throughout the paper the term “factors” is used to describe all kinds of factors, components, dimensions, sources, or modes, even if the term “factors” is unusual for the particular method that was used. For the purpose of the present paper, the interpretation is the same across these terms.
Summary of the 3 P-factors and the methods by which they were derived.
| Statistical methods with relevant information | Design | Psychopathology Assessments | Key references [ | |
|---|---|---|---|---|
| Phenotypic P-factor | Most common methods: SEM, factor analysis These are variance decomposition methods and path diagrams within the phenotypic domain. SEM and factor analysis are generally exploratory of multiple a priori models and/or a test of predefined factor structure | • Generally population based • High-risk cohort [ | • Generally continuous symptom ratings • Binary diagnosis [ • Probability of diagnosis [ | [ |
| Genomic P-factor | Most common methods: PCA, Genomic SEM These are variance decomposition methods on the genetic correlations across disorders, which are derived using LDSR of GWAS stats [ PCA is fully data-driven, while Genomic SEM is exploratory or confirmatory of a hypothesized factor structure. | Case-control GWAS designs | Clinical diagnosis | [ |
| Neural P-factor | Most common methods: CCA, ICA CCA can be used to extract “modes” across behavioural/environmental and MRI measures simultaneously. A mode is driven by a combination of the correlations within and across the variable classes. ICA tends to be run within the MRI domain. The resultant “components” can then be correlated with the behavioural and environmental variables. Both ICA and CCA are fully data-driven and are useful to extract a large number of “factors” from high-dimensional data. | • Population based, adults (Human Connectome Project) • Children and adolescent population, enriched for mental distress (ABCD Cohort) | • Behavioural, demographic & environmental measures • Self-report of (family) diagnosis and substance use • Cognitive task performance | [ |
Note: For consistency and clarity, throughout the paper the term “factors” is used to describe all kinds of factors, components, dimensions, sources, or modes, even if the term “factors” is unusual for the particular method that was used. For the purpose of the present paper, the interpretation is the same across these terms.
1this references list is not exhaustive.
Fig. 1The brain is mediator in theory, but not necessarily in practice.
Intuitive theories and concepts, like endophenotypes, logically assume that genetic effects on behaviour (and on environment) pass through the brain. However, the heritable environment induces rGEs, which propagate over time, through brain, environment and behaviour. Hence, the model is updated with a causal brain-behaviour-environment loop. This loop permits a multitude of causal mechanisms between the genome and behaviour, inside and outside the brain, and we propose that this is reflected in the cross-trait covariance captured by the P-factor and its genetic and neural equivalents. Note that the exact mechanisms by which the environment influences the brain and behaviour are many, and may potentially involve tissues outside the brain. For the sake of clarity, we here do not include routes from genome to behaviour that are outside of the brain.