| Literature DB >> 35076988 |
João P O F T Guimarães1,2,3, E Sprooten1,2,3, C F Beckmann1,2,4, B Franke2,3,5, J Bralten2,3.
Abstract
The amplitude of activation in brain resting state networks (RSNs), measured with resting-state functional magnetic resonance imaging, is heritable and genetically correlated across RSNs, indicating pleiotropy. Recent univariate genome-wide association studies (GWASs) explored the genetic underpinnings of individual variation in RSN activity. Yet univariate genomic analyses do not describe the pleiotropic nature of RSNs. In this study, we used a novel multivariate method called genomic structural equation modeling to model latent factors that capture the shared genomic influence on RSNs and to identify single nucleotide polymorphisms (SNPs) and genes driving this pleiotropy. Using summary statistics from GWAS of 21 RSNs reported in UK Biobank (N = 31,688), the genomic latent factor analysis was first conducted in a discovery sample (N = 21,081), and then tested in an independent sample from the same cohort (N = 10,607). In the discovery sample, we show that the genetic organization of RSNs can be best explained by two distinct but correlated genetic factors that divide multimodal association networks and sensory networks. Eleven of the 17 factor loadings were replicated in the independent sample. With the multivariate GWAS, we found and replicated nine independent SNPs associated with the joint architecture of RSNs. Further, by combining the discovery and replication samples, we discovered additional SNP and gene associations with the two factors of RSN amplitude. We conclude that modeling the genetic effects on brain function in a multivariate way is a powerful approach to learn more about the biological mechanisms involved in brain function.Entities:
Keywords: genetic correlation analysis; genomic SEM; multivariate GWAS; pleiotropy; resting-state networks
Mesh:
Year: 2022 PMID: 35076988 PMCID: PMC8933256 DOI: 10.1002/hbm.25712
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.399
FIGURE 1SNP‐based heritability results obtained for multimodal association and sensory networks. Cortical surface maps displayed show the multimodal association and sensory networks measured in the BIG40 sample. Multimodal association networks are displayed at the top and the sensory networks at the bottom. Both the medial and lateral views of RSNs in the left and right hemispheres are displayed from left to right. RSNs are color‐coded according to their SNP‐based heritability—proportion of variance in the trait explained by SNP effects—whose scales are displayed on the right. RSN, resting state network; SNP, single nucleotide polymorphism
FIGURE 2Genetic correlation matrix of the heritable RSN amplitudes. Genetic correlations results are reported for the BIG40 samples. Multimodal association (MA) and sensory (SN) network amplitudes are represented according to the color bar displayed below. Genetic correlations scoring nominal and Bonferroni‐corrected significance are respectively labeled with * and **. RSN, resting state network
FIGURE 3Summary of exploratory factor analysis. Plot displaying the percentage of cumulative explained variance (r 2) from up to six‐factor models tested using exploratory factor analysis (EFA) on the discovery (left) and BIG40 samples (right) (a); Cumulative explained variance by the one‐, two‐, and three‐factor models tested using EFA on the discovery (left) and BIG40 samples (right) (b); the added explained variance corresponding to an additional factor in the model is shown in parenthesis
Summary of the two‐factor confirmatory factor analysis in the discovery and replication samples
| Sample | Model | Number of factors | Included RSNs | Chi‐square statistic | Degrees of freedom | AIC | CFI | SRMR |
|---|---|---|---|---|---|---|---|---|
| Discovery sample | EFA‐based model | 2 | 18 | 897 | 133 | 973 | 0.81 | 0.12 |
| Model for multivariate GWAS | 2 | 17 | 575 | 118 | 645 | 0.84 | 0.12 | |
| Replication sample | Model for multivariate GWAS of the discovery sample | 2 | 17 | 295 | 118 | 365 | 0.76 | 0.19 |
Note: For each sample listed in the first column, the second column distinguishes the stages composing the CFA approach. For the discovery sample, CFA consisted of two stages: the first stage, that is, EFA‐based model, tested the model design for the two‐factor model indicated by our EFA approach; the second stage, that is, modeling for multivariate GWAS, only kept RSNs that showed Bonferroni‐corrected factor loadings. The CFA conducted for the replication sample consisted of a single stage, which used the model employed in the multivariate GWAS of the discovery sample. In each stage, a model with a given number of factors was tested (third column), with a given number of RSNs (fourth column). For each model, we display the chi‐square statistic, degrees of freedom, AIC, CFI, and SRMR, from the fifth to the ninth columns.
Abbreviations: AIC, Akaike information criterion; CFA, confirmatory factor analysis; CFI, comparative fit index; EFA, exploratory factor analysis; GWAS, genome‐wide association study; RSN, resting state network; SRMR, standardized root mean square residual.
Summary of the two‐factor confirmatory factor analysis in the BIG40 sample
| Model | Number of factors | Included RSNs | Chi‐square statistic | Degrees of freedom | AIC | CFI | SRMR |
|---|---|---|---|---|---|---|---|
| EFA‐based model | 2 | 20 | 1.40E+7 | 167 | 1.40E+7 | 0.74 | 0.12 |
| Model for multivariate GWAS | 2 | 18 | 1,412 | 134 | 1,486 | 0.89 | 0.12 |
Note: For the BIG40 sample, the first column distinguishes the two stages composing the CFA approach, respectively: the first stage, that is, EFA‐based model, tested the model design for the two‐factor model indicated by our EFA approach; the second stage, that is, modeling for multivariate GWAS, only kept RSNs that showed Bonferroni‐corrected factor loadings. In each stage, a model with a given number of factors was tested (second column), with a given number of RSNs (third column). For each model, we display the chi‐square statistic, degrees of freedom, AIC, CFI, and SRMR, from the fourth to the eighth columns.
Abbreviations: AIC, Akaike information criterion; CFA, confirmatory factor analysis; CFI, comparative fit index; EFA, exploratory factor analysis; GWAS, genome‐wide association study; RSN, resting state network; SRMR, standardized root mean square residual.
FIGURE 4Path diagram of two‐factor models reported for the BIG40 sample. Orange circles represent the two latent genetic factors of the two‐factor confirmatory factor analysis (CFA). Factor 1 (F1) and 2 (F2) are connected by a double‐headed arrow, which represents the correlation between the two factors. F1 and F2 are associated with RSNs represented by blue rectangles, with loadings represented by a blue arrow. Factor correlation and loadings reaching nominal and Bonferroni‐corrected significance (p[Bonferroni] ≤ .05/22 = .0022) are indicated respectively by * and **. RSN, resting state network
LD‐independent significant SNPs for latent factor F1 in the discovery and replication samples
| Genomic locus | rsID | Chromosome | Position |
|
| Variant type | Nearest gene |
|---|---|---|---|---|---|---|---|
| 1 | rs7069316 | 10 | 96,000,282 | 9.83E−09 | .0013 | Intronic | PLCE1 |
| rs10786156 | 10 | 96,014,622 | 1.93E−11 | 1.75E−05 | Intronic | PLCE1 | |
| rs11187842 | 10 | 96,052,511 | 2.46E−08 | .0015 | Intronic | PLCE1 | |
| 2 | rs11596664 | 10 | 134,280,157 | 3.89E−09 | .00012 | Intergenic | C10orf91 |
| rs7907962 | 10 | 134,287,486 | 1.92E−09 | .00067 | Intergenic | C10orf91 | |
| rs4880380 | 10 | 134,288,177 | 3.04E−09 | .0016 | Intergenic | C10orf91 | |
| rs7080018 | 10 | 134,301,505 | 8.69E−11 | .00039 | Intergenic | RP11‐432J24.5 | |
| rs34102287 | 10 | 134,331,173 | 2.21E−10 | .00010 | Intronic | RP11‐432J24.5 | |
| 3 | rs1969161 | 17 | 19,194,812 | 8.84E−09 | .00042 | Intronic | EPN2 |
Note: For each genomic locus, the respective independent genome‐wide significant SNPs are displayed. The table also provides information regarding the chromosome, position, genome‐wide p‐values reported with the discovery and the replication samples, variant type, and nearest gene to each SNP.
Abbreviations: LD, linkage disequilibrium; SNP, single nucleotide polymorphism.
LD‐independent significant SNPs for latent factor F1 in the BIG40 sample
| Genomic locus | rsID | Chromosome | Position |
|
| Variant type | Nearest gene |
|---|---|---|---|---|---|---|---|
| 1 | rs2472884 | 6 | 96,863,965 | 8.46E−10 | .543 | Intronic | UFL1‐AS1 |
| 2 | rs17109869 | 10 | 96,000,162 | 2.33E−09 | 7.55E−11 | Intronic | PLCE1 |
| rs7069316 | 10 | 96,000,282 | 5.23E−12 | 1.26E−12 | Intronic | PLCE1 | |
| rs3891783 | 10 | 96,015,793 | 8.52E−18 | 6.08E−06 | Intronic | PLCE1 | |
| rs17109875 | 10 | 96,026,575 | 9.88E−12 | .31 | Intronic | PLCE1 | |
| rs11187844 | 10 | 96,056,629 | 1.04E−09 | 1.14E−02 | Intronic | PLCE1 | |
| rs2077218 | 10 | 96,071,561 | 3.50E−08 | .72 | Intronic | PLCE1 | |
| 3 | rs10747058 | 10 | 134,276,427 | 1.07E−11 | .18 | Intergenic | C10orf91 |
| rs10781575 | 10 | 134,280,542 | 1.36E−10 | .69 | Intergenic | C10orf91 | |
| rs10747059 | 10 | 134,283,787 | 4.19E−08 | 1 | Intergenic | C10orf91 | |
| rs7083220 | 10 | 134,284,485 | 3.57E−14 | 4.50E−04 | Intergenic | C10orf91 | |
| rs7907962 | 10 | 134,287,486 | 2.15E−13 | .38 | Intergenic | C10orf91 | |
| rs4880380 | 10 | 134,288,177 | 1.13E−12 | 4.04E−06 | Intergenic | C10orf91 | |
| rs12360525 | 10 | 134,289,214 | 1.72E−11 | 6.78E−02 | Intergenic | C10orf91 | |
| rs4880389 | 10 | 134,296,598 | 6.94E−09 | 1 | Intergenic | RP11‐432J24.5 | |
| rs71503745 | 10 | 134,304,757 | 3.42E−15 | 1 | Intergenic | RP11‐432J24.5 | |
| rs7076422 | 10 | 134,309,970 | 6.45E−10 | 7.69E−02 | Intergenic | RP11‐432J24.5 | |
| rs35263482 | 10 | 134,313,046 | 3.84E−15 | 9.53E−02 | Intergenic | RP11‐432J24.5 | |
| rs201873436 | 10 | 134,319,073 | 6.96E−10 | 1.87E−04 | Intergenic | RP11‐432J24.5 | |
| rs75348655 | 10 | 134,319,093 | 5.96E−09 | .91 | Intergenic | RP11‐432J24.5 | |
| rs375824681 | 10 | 134,319,172 | 3.99E−08 | 1 | Intergenic | RP11‐432J24.5 | |
| rs12357355 | 10 | 134,335,986 | 3.10E−09 | 1 | Upstream | LINC01165 | |
| 4 | rs34094842 | 11 | 69,983,047 | 1.44E−08 | 9.67E−06 | Intronic | ANO1RP11‐805J14.3 |
| 5 | rs10895201 | 11 | 101,688,535 | 2.95E−10 | 5.71E−10 | — | TRPC6 |
| rs12222606 | 11 | 101,708,903 | 2.06E−09 | 4.09E−09 | — | TRPC6 | |
| 6 | rs1624825 | 17 | 18,923,818 | 2.12E−09 | 6.86E−10 | Untranslated region | SLC5A10 |
| rs199660087 | 17 | 19,015,683 | 2.67E−08 | 1.44E−06 | Exonic | SNORD3D | |
| rs1969161 | 17 | 19,194,812 | 1.37E−12 | 3.83E−11 | Intronic | EPN2 | |
| 7 | rs429358 | 19 | 45,411,941 | 3.70E−08 | 2.56E−07 | Exonic | APOE |
Note: For each genomic locus, the respective independent genome‐wide significant SNPs are displayed. The table also provides information regarding the chromosome, position, genome‐wide p values and the QSNP p values reported with the BIG40 sample, variant type, and nearest gene to each SNP.
Abbreviations: LD, linkage disequilibrium; SNP, single nucleotide polymorphism.
FIGURE 5Manhattan plot of MAGMA gene analysis findings of latent factors F1 and F2 reported with the BIG40 sample. Gene‐wide p values of associations in F1 (top), which comprises genetic effects shared among all 10 multimodal association networks (MA1–10) and two sensory networks (SN4 and SN10); and F2 (bottom), consisted of six sensory networks (SN2 and SN5–9). In each plot, genes located across the 22 autosomes labeled along the x‐axis are represented by blue dots, whose position along the y‐axis represents the log p value scored by their gene‐wide association with each latent factor. The red‐dashed horizontal line marks the Bonferroni‐corrected significance for the number of genes being tested (p[Bonferroni] ≤ 2.64E−6)
FIGURE 6Genetic correlation matrix comparing the two factors of general brain function with neuropsychiatric and physical health traits. Genetic correlations of two genetic factors (F1 and F2), estimated using the BIG40 sample, with 10 neuropsychiatric and physical traits: attention deficit/hyperactivity disorder (ADHD), autistic spectrum disorder (ASD), bipolar disorder (BIP), major depressive disorder (MDD), schizophrenia, Alzheimer's disease (AD), height, body‐mass index (BMI), diastolic blood pressure (DBP), and bone density. Genetic correlations at nominal and false discovey rate (FDR)‐corrected significance are respectively labeled with * and **