| Literature DB >> 33236465 |
Ting Li1, Jianchang Hu1, Shiying Wang1, Heping Zhang1.
Abstract
Identifying genetic biomarkers for brain connectivity helps us understand genetic effects on brain function. The unique and important challenge in detecting associations between brain connectivity and genetic variants is that the phenotype is a matrix rather than a scalar. We study a new concept of super-variant for genetic association detection. Similar to but different from the classic concept of gene, a super-variant is a combination of alleles in multiple loci but contributing loci can be anywhere in the genome. We hypothesize that the super-variants are easier to detect and more reliable to reproduce in their associations with brain connectivity. By applying a novel ranking and aggregation method to the UK Biobank databases, we discovered and verified several replicable super-variants. Specifically, we investigate a discovery set with 16,421 subjects and a verification set with 2,882 subjects, where they are formed according to release date, and the verification set is used to validate the genetic associations from the discovery phase. We identified 12 replicable super-variants on Chromosomes 1, 3, 7, 8, 9, 10, 12, 15, 16, 18, and 19. These verified super-variants contain single nucleotide polymorphisms that locate in 14 genes which have been reported to have association with brain structure and function, and/or neurodevelopmental and neurodegenerative disorders in the literature. We also identified novel loci in genes RSPO2 and TMEM74 which may be upregulated in brain issues. These findings demonstrate the validity of the super-variants and its capability of unifying existing results as well as discovering novel and replicable results.Entities:
Keywords: GWAS; UK Biobank; brian connectivity
Mesh:
Year: 2020 PMID: 33236465 PMCID: PMC7927294 DOI: 10.1002/hbm.25294
Source DB: PubMed Journal: Hum Brain Mapp ISSN: 1065-9471 Impact factor: 5.038
FIGURE 1Results on synthetic data. We present the result on synthetic data and the true images of used in simulations. We report the average true nonzero coverage proportion as k increases from 1:100. The result is averaged by repeating 100 times. It reveals that MLRA (light blue diamond) outperforms all other four methods. Moreover, the performance of MLRA is stable under different settings of coefficient matrix, while rank‐one screening (red triangle) performs poorly in Simulation 2
Top SNPs corresponding to 12 verified super‐variants for connectivity matrix
| Super‐variant | SNP name | Position | Major Allele | Minor Allele | Gene |
|---|---|---|---|---|---|
| chr1_119 | rs150587011 | 118,394,978 | CT | C | |
| rs140523673 | 118,104,740 | A | G | ||
| rs187120237 | 118,521,524 | T | C | SPAG17 | |
| chr3_151 | rs754473336 | 150,225,522 | G | T | |
| 3:150010532 | 150,010,532 | G | GTTAC | ||
| rs56911072 | 150,800,599 | G | C | ||
| chr7_139 | rs74888723 | 138,056,520 | A | G | |
| rs76356478 | 138,682,454 | T | C | ||
| rs28644472 | 138,367,105 | C | T | SVOPL | |
| chr8_110 | rs537656432 | 109,182,059 | C | T | |
| rs73316135 | 109,590,330 | T | C | ||
| rs577665281 | 109,352,658 | C | A | ||
| chr9_26 | rs1332432 | 25,214,299 | G | A | |
| rs73471738 | 25,204,838 | G | T | ||
| rs7043237 | 25,201,349 | G | A | ||
| chr9_120 | 9:119683843 | 119,683,843 | G | GGCGACCGAGC | ASTN2 |
| rs564940053 | 119,683,855 | T | A | ASTN2 | |
| rs12002288 | 119,687,073 | T | C | ASTN2 | |
| chr10_30 | rs112305584 | 29,281,784 | A | T | |
| rs58515486 | 29,828,831 | T | C | SVIL | |
| rs73611821 | 29,284,416 | C | T | ||
| chr12_34 | rs7296825 | 33,059,771 | G | C | |
| rs73303683 | 33,050,638 | A | G | PKP2 | |
| rs60059851 | 33,045,241 | G | A | PKP2 | |
| chr15_65 | rs1037847 | 64,279,555 | T | C | DAPK2 |
| rs7168753 | 64,283,625 | T | C | DAPK2 | |
| rs8041460 | 64,279,864 | T | C | DAPK2 | |
| chr16_61 | 16:60145501 | 60,145,501 | T | TG | |
| rs531574432 | 60,145,505 | C | G | ||
| rs144650764 | 60,133,310 | TTA | T | ||
| chr18_71 | rs79191515 | 70,351,615 | C | T | |
| rs17086080 | 70,133,421 | G | A | ||
| rs10514046 | 70,133,865 | C | G | ||
| chr19_55 | rs11881664 | 54,892,237 | G | A | |
| rs113393416 | 54,516,210 | G | A | CACNG6 | |
| rs113772732 | 54,895,558 | A | G |
The concordance with previous results
| Super‐variant | Gene | Papers | Results |
|---|---|---|---|
| Chr3‐151 |
| Boukhzar et al., | Gene |
|
|
Risheg et al., Isidor et al., Nizon et al., | Gene | |
| Chr7‐139 |
|
Jones et al., Lin et al., |
|
| Chr9‐120 |
| Fagerberg et al., |
Gene |
| Wilson, Fryer, Fang, & Hatten, | Gene | ||
|
Glessner et al., Vrijenhoek et al., Lesch et al., | Variations in gene | ||
| Chr12‐34 |
| Moghadam & Jackson, | Gene |
|
Mittelsteadt et al., | Gene | ||
| Woitecki et al., | Gene | ||
| Chr15‐65 |
| Martin et al., | Gene |
| Chr18‐71 |
| Fagerberg et al., | Gene |
| Rong et al., | Genetic elimination of | ||
|
|
Fagerberg et al., | Gene | |
| Ng et al., | Gene | ||
| Chr19‐55 |
| Fagerberg et al., | Gene |
|
Chen et al., Shirafuji et al., | Mutations in this gene results in neurodegenerative disorder spinocerebellar ataxia‐14 (SCA14). | ||
|
| Fagerberg et al., | These genes encode a type II transmembrane AMPA receptor regulatory protein. Genes | |
| Guan et al., | Genes | ||
|
| Martin et al., | Variations in gene | |
|
| Fagerberg et al., | Gene | |
| Halleran et al., | Gene |
FIGURE 2The influence of the super‐variant on Chromosome 9 block 120 on brain connectivity. We standardize the elements of the connectivity matrices to mean 0 and variance 1. Individuals in the discovery set are separated into two groups according to the minor and major variants of the super‐variant on Chromosome 9 block 120. Here, the variant with a lower frequency is referred to as the minor variant. The difference matrix is calculated by subtracting the average connectivity matrix of the group with the major variant from the average connectivity matrix of the group with the minor variant. For visualization, only differences with absolute values in top 5% are plotted in the chord diagram. Red (green) bands indicate the negative (positive) differences and the widths of the bands indicate the magnitudes of the differences. The numbers in the outer circle indicate specific regions in the brain