| Literature DB >> 32665545 |
Dennis van der Meer1,2, Oleksandr Frei3,4, Tobias Kaufmann3, Alexey A Shadrin3, Anna Devor3,5,6, Olav B Smeland3, Wesley K Thompson3,7, Chun Chieh Fan8, Dominic Holland5,8, Lars T Westlye3,9, Ole A Andreassen3, Anders M Dale10,11.
Abstract
Regional brain morphology has a complex genetic architecture, consisting of many common polymorphisms with small individual effects. This has proven challenging for genome-wide association studies (GWAS). Due to the distributed nature of genetic signal across brain regions, multivariate analysis of regional measures may enhance discovery of genetic variants. Current multivariate approaches to GWAS are ill-suited for complex, large-scale data of this kind. Here, we introduce the Multivariate Omnibus Statistical Test (MOSTest), with an efficient computational design enabling rapid and reliable inference, and apply it to 171 regional brain morphology measures from 26,502 UK Biobank participants. At the conventional genome-wide significance threshold of α = 5 × 10-8, MOSTest identifies 347 genomic loci associated with regional brain morphology, more than any previous study, improving upon the discovery of established GWAS approaches more than threefold. Our findings implicate more than 5% of all protein-coding genes and provide evidence for gene sets involved in neuron development and differentiation.Entities:
Mesh:
Year: 2020 PMID: 32665545 PMCID: PMC7360598 DOI: 10.1038/s41467-020-17368-1
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1MOSTest improves locus discovery.
a Number of independent genome-wide significant loci identified (on the y axis and in the bubbles) for each set of features (on the x axis), by MOSTest (in darker colored circles) and by min-P (in lighter colored squares). b–e Miami plots, contrasting the observed −log10(p-values), shown on the y axis, of each SNP for MOSTest (top half) with min-P (bottom half), for each of the feature sets. The x axis shows the relative genomic location, grouped by chromosome, and the red dashed lines indicate the genome-wide significance threshold of 5 × 10−8. Note, y axis is clipped at −log10(p-value) = 100. f Venn diagram depicting the number of loci, identified with MOSTest, overlapping between the three feature subsets.
Equal replication rates of genome-wide significant loci by MOSTest and min-P.
| Test | Feature set | # loci discovered | # loci replicated | Fraction replicated |
|---|---|---|---|---|
| MOSTest | All | 347 | 122 | 0.35 |
| MOSTest | Subcortical | 177 | 68 | 0.38 |
| MOSTest | Surface area | 139 | 58 | 0.42 |
| MOSTest | Cortical thickness | 71 | 24 | 0.34 |
| min-P | All | 115 | 48 | 0.42 |
| min-P | Subcortical | 90 | 40 | 0.44 |
| min-P | Surface area | 53 | 24 | 0.45 |
| min-P | Cortical thickness | 24 | 9 | 0.38 |
Results from the replication of the genome-wide significant loci are identified through MOSTest and min-P in an additional sample of 4884 individuals. The column “# loci replicated” indicates the amount of discovered loci in the main analysis that surpass a nominal significance (p = 0.05) in the replication sample. The “fraction replicated” column divides the number of loci discovered in the main analysis with the number of loci that replicated.
Fig. 2MOSTest increases effective sample size.
Estimated percent of genetic variance explained by SNPs surpassing the genome-wide significance threshold, on the y axis, as a function of sample size, depicted on the x axis on a log10 scale, for each of the feature sets and for both MOSTest and min-P methods. Percentages of genetic variance explained by discovered SNPs with current sample size (N = 26,502) are shown in parentheses.
Fig. 3The genetic variants identified with MOSTest have distributed effects across the cortex.
Z-values from the univariate GWAS for each cortical region for the two most significant lead SNPs from MOSTest applied to all features combined (left two columns for rs1080066 on chromosome 15, and right two columns for rs13107325 on chromosome 4). The top two rows show the effects of the SNPs on regional surface area, and the bottom two on cortical thickness. Positive effects of carrying the minor allele are shown in red, and negative in blue. Note: the absolute Z-value scaling is clipped at 8 (p = 1.2 × 10−15); an absolute Z-value of 5.45 corresponds to two-tailed genome-wide significance (p = 5 × 10−8). RH = right hemisphere, LH = left hemisphere.
Fig. 4Functional mapping and annotation indicates high neurobiological relevance.
a Number of genome-wide significant genes identified (on the y axis and in the bubbles) for each set of features (on the x axis), by MOSTest (in darker colored circles) and by min-P (in lighter colored squares). b Results from the gene-set analyses following the application of MOSTest and min-P on all brain features. The 15 most significant Gene Ontology sets for MOSTest are listed on the y axis and −log10(p-values) are shown on the x axis. Multiple-comparisons corrected MOSTest gene-set p-values are indicated by black circles, min-P in gray squares.