| Literature DB >> 27747820 |
Xiaohui Yao1,2, Jingwen Yan1, Sungeun Kim1, Kwangsik Nho1, Shannon L Risacher1, Mark Inlow1, Jason H Moore3, Andrew J Saykin1, Li Shen4.
Abstract
Enrichment analysis has been widely applied in the genome-wide association studies, where gene sets corresponding to biological pathways are examined for significant associations with a phenotype to help increase statistical power and improve biological interpretation. In this work, we expand the scope of enrichment analysis into brain imaging genetics, an emerging field that studies how genetic variation influences brain structure and function measured by neuroimaging quantitative traits (QT). Given the high dimensionality of both imaging and genetic data, we propose to study Imaging Genetic Enrichment Analysis (IGEA), a new enrichment analysis paradigm that jointly considers meaningful gene sets (GS) and brain circuits (BC) and examines whether any given GS-BC pair is enriched in a list of gene-QT findings. Using gene expression data from Allen Human Brain Atlas and imaging genetics data from Alzheimer's Disease Neuroimaging Initiative as test beds, we present an IGEA framework and conduct a proof-of-concept study. This empirical study identifies 25 significant high-level two-dimensional imaging genetics modules. Many of these modules are relevant to a variety of neurobiological pathways or neurodegenerative diseases, showing the promise of the proposal framework for providing insight into the mechanism of complex diseases.Entities:
Keywords: Enrichment analysis; Genome-wide association study; Imaging genetics; Quantitative trait
Year: 2016 PMID: 27747820 PMCID: PMC5118198 DOI: 10.1007/s40708-016-0052-4
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Fig. 1Overview of the proposed Imaging Genetic Enrichment Analysis (IGEA) framework. A Perform SNP-level GWAS of brain wide imaging measures. B Map SNP-level GWAS findings to gene-based. C Construct gene-ROI expression matrix from AHBA data. D Construct GS–BC modules by performing 2D hierarchical clustering, and then filter out 2D clusters with an average correlation below a user-given threshold. E Perform IGEA by mapping gene-based GWAS findings to the identified GS–BC modules. F For each enriched GS–BC module, examine the GS using GO terms, KEGG pathways, and OMIM disease databases, and map the BC to the brain
Fig. 3Eight unique brain circuits (BCs) identified from IGEA. ROIs belonging to each BC are colored in red
Fig. 2Manhattan plot of imaging quantitative genome-wide association for Alzheimer’s Disease individuals based on Precuneus (right) measurement from amyloid imaging data. The x axis represents the chromosomes and the y axis represents , where P is the gene-based significance
Twenty-five significantly enriched GS–BC modules from IGEA
| Module ID | Top 20 % Co-expresseda | BC ID | # of ROIs | GS ID | # of genes | Corrected | Corrected | Corrected |
|---|---|---|---|---|---|---|---|---|
| 01 | Rc | BC07 | 8 | GS01 | 81 | – | 2.61E | – |
| 02 | G, R, G&Rd | BC02 | 4 | GS02 | 168 | 9.06E | 9.06E | 9.06E |
| 03 | Gb | BC03 | 11 | GS02 | 168 | 2.54E | – | – |
| 04 | G, R, G&R | BC04 | 5 | GS02 | 168 | 1.44E | 1.44E | 1.44E |
| 05 | G | BC05 | 14 | GS02 | 168 | 6.42E | – | – |
| 06 | R | BC06 | 13 | GS02 | 168 | – | 5.91E | – |
| 07 | R | BC08 | 23 | GS02 | 168 | – | 5.65E | – |
| 08 | G, R, G&R | BC01 | 4 | GS03 | 55 | 1.38E | 1.38E | 1.38E |
| 09 | G | BC02 | 4 | GS03 | 55 | 4.39E | – | – |
| 10 | R | BC04 | 5 | GS03 | 55 | – | 1.41E | – |
| 11 | G | BC05 | 14 | GS03 | 55 | 1.01E | – | – |
| 12 | R | BC06 | 13 | GS03 | 55 | – | 1.72E | – |
| 13 | R | BC07 | 8 | GS03 | 55 | – | 2.40E | – |
| 14 | R | BC07 | 8 | GS04 | 66 | – | 4.00E | – |
| 15 | G, R, G&R | BC01 | 4 | GS05 | 19 | 3.83E | 3.83E | 3.83E |
| 16 | G, R, G&R | BC02 | 4 | GS05 | 19 | 6.88E | 6.88E | 6.88E |
| 17 | G, R, G&R | BC04 | 5 | GS05 | 19 | 2.64E | 2.64E | 2.64E |
| 18 | R | BC06 | 13 | GS05 | 19 | – | 2.26E | – |
| 19 | G, R, G&R | BC07 | 8 | GS05 | 19 | 1.54E | 1.54E | 1.54E |
| 20 | G, R, G&R | BC02 | 4 | GS06 | 28 | 4.87E | 4.87E | 4.87E |
| 21 | G | BC02 | 4 | GS07 | 24 | 7.69E | – | – |
| 22 | G&R | BC01 | 4 | GS08 | 33 | – | – | 1.97E |
| 23 | G | BC02 | 4 | GS08 | 33 | 1.11E | – | – |
| 24 | R | BC04 | 5 | GS08 | 33 | – | 7.39E | – |
| 25 | G | BC02 | 4 | GS09 | 111 | 4.07E | – | – |
See also Sect. 3.2 and Fig. 3 for details about relevant GSs and BCs, respectively
aTo indicate whether the top 20 % modules are selected based on the gene-based, ROI-based, or gene&ROI-based strategy
bG: Gene-based
cR: ROI-based
dG&R: Gene&ROI-based
Fig. 4Brain maps of four brain circuits (BCs) identified from IGEA
Fig. 5Results of KEGG pathway enrichment for identified GSs. The x axis represents unique GS ID, and y axis represents −log p value of enrichment significance of KEGG pathways. Marked cell represents significant enrichment (p value )
Top enriched OMIM diseases of identified GSs
| GS ID | # of gene | OMIM Disease |
|
|---|---|---|---|
| GS01 | 81 | Encephalopathy | 4.2E |
| Dementia | 3.6E | ||
| GS02 | 168 | Encephalopathy | 5.0E |
| Breast cancer | 9.5E | ||
| GS03 | 55 | Leukemia | 2.7E |
| Alzheimer’s disease | 8.9E | ||
| GS04 | 66 | Hypertension | 5.0E |
| GS05 | 19 | Anomalies | 2.4E |
| Alzheimer’s disease | 4.5E | ||
| GS06 | 28 | Ectodermal dysplasia | 2.0E |
| GS07 | 24 | Hypertension | 3.4E |
| Spinocerebellar ataxia | 4.3E | ||
| GS08 | 33 | Glycogen storage disease | 1.6E |
| GS09 | 111 | Immunodeficiency | 1.4E |
* Significantly enriched
Top enriched GO terms of GSs from identified GS–BC modules
| Group | GS ID | # of genes | GO Category | Corrected |
|---|---|---|---|---|
| Behavior | GS03 | 55 | Behavior | 2.2E |
| Learning or memory | 4.4E | |||
| Cell communication | GS01 | 81 | Regulation of synaptic transmission | 2.7E |
| Neuron-neuron Synaptic transmission | 2.9E | |||
| GS03 | 55 | Synaptic transmission | 1.7E | |
| Metabolic process | GS05 | 19 | Fat-soluble vitamin metabolic process | 4.3E |
| Organic hydroxy compound biosynthetic process | 4.8E | |||
| GS06 | 28 | Regulation of translational termination | 2.8E | |
| Mitochondrion | GS02 | 168 | Mitochondrial membrane part | 2.5E |
| Mitochondrial respiratory chain complex I | 4.9E | |||
| Neurological system process | GS03 | 55 | Associative learning | 1.1E |
| Learning | 4.5E | |||
| GS09 | 111 | Detection of chemical stimulus involved in sensory perception | 1.1E | |
| Olfactory receptor activity | 1.9E | |||
| Response to stimulus | GS03 | 55 | Response to amphetamine | 2.0E |
| Visual behavior | 4.5E | |||
| GS05 | 19 | Response to cholesterol | 3.6E | |
| Response to sterol | 3.7E | |||
| GS09 | 111 | Detection of chemical stimulus | 1.6E | |
| Signal transduction | GS01 | 81 | Glutamate receptor signaling pathway | 7.3E |
| GS03 | 55 | Adenylate cyclase-activating dopamine receptor signaling pathway | 3.1E | |
| Dopamine receptor signaling pathway | 1.4E | |||
| GS05 | 19 | Transmembrane receptor protein kinase activity | 4.4E | |
| GS09 | 111 | Olfactory receptor activity | 1.9E |