Literature DB >> 27747820

Two-dimensional enrichment analysis for mining high-level imaging genetic associations.

Xiaohui Yao1,2, Jingwen Yan1, Sungeun Kim1, Kwangsik Nho1, Shannon L Risacher1, Mark Inlow1, Jason H Moore3, Andrew J Saykin1, Li Shen4.   

Abstract

Enrichment analysis has been widely applied in the genome-wide association studies, where gene sets corresponding to biological pathways are examined for significant associations with a phenotype to help increase statistical power and improve biological interpretation. In this work, we expand the scope of enrichment analysis into brain imaging genetics, an emerging field that studies how genetic variation influences brain structure and function measured by neuroimaging quantitative traits (QT). Given the high dimensionality of both imaging and genetic data, we propose to study Imaging Genetic Enrichment Analysis (IGEA), a new enrichment analysis paradigm that jointly considers meaningful gene sets (GS) and brain circuits (BC) and examines whether any given GS-BC pair is enriched in a list of gene-QT findings. Using gene expression data from Allen Human Brain Atlas and imaging genetics data from Alzheimer's Disease Neuroimaging Initiative as test beds, we present an IGEA framework and conduct a proof-of-concept study. This empirical study identifies 25 significant high-level two-dimensional imaging genetics modules. Many of these modules are relevant to a variety of neurobiological pathways or neurodegenerative diseases, showing the promise of the proposal framework for providing insight into the mechanism of complex diseases.

Entities:  

Keywords:  Enrichment analysis; Genome-wide association study; Imaging genetics; Quantitative trait

Year:  2016        PMID: 27747820      PMCID: PMC5118198          DOI: 10.1007/s40708-016-0052-4

Source DB:  PubMed          Journal:  Brain Inform        ISSN: 2198-4026


Introduction

Brain imaging genetics is an emerging field that studies how genetic variation influences brain structure and function. Genome-wide association studies (GWAS) have been performed to identify genetic markers such as single nucleotide polymorphisms (SNPs) that are associated with brain imaging quantitative traits (QTs) [20, 21]. Using biological pathways and networks as prior knowledge, enrichment analysis has also been performed to discover pathways or network modules enriched by GWAS findings to enhance statistical power and help biological interpretation [6]. For example, numerous studies on complex diseases have demonstrated that genes functioning in the same pathway can influence imaging QTs collectively even when constituent SNPs do not show significant association individually [18]. Enrichment analysis can also help identify relevant pathways and improve mechanistic understanding of underlying neurobiology [7, 11, 15, 19]. In the genetic domain, enrichment analysis has been widely studied in gene expression data analysis and has recently been modified to analyze GWAS data. GWAS-based enrichment analysis first maps SNP-level scores to gene-based scores, and then tests whether a pre-defined gene set S (e.g., a pathway) is enriched in a set of significant genes L (e.g., GWAS findings). Two strategies are often used to compute enrichment significance: threshold-based [4, 5, 9, 24] and rank-based [23]. Threshold-based approaches aim to solve an independence test problem (e.g., chi-square test, hypergeometric test, or binomial z-test) by treating genes as significant if their scores exceed a threshold. Rank-based methods take into account the score of each gene to determine if the members of S are randomly distributed throughout L. In brain imaging genetics, the above enrichment analysis methods are applicable only to genetic findings associated with each single imaging QT. Our ultimate goal is to discover high-level associations between meaningful gene sets (GS) and brain circuits (BC), which typically include multiple genes and multiple QTs. To achieve this goal, we propose to study Imaging Genetic Enrichment Analysis (IGEA), a new enrichment analysis paradigm that jointly considers sets of interest (i.e., GS and BC) in both genetic and imaging domains and examines whether any given GS–BC pair is enriched in a list of gene–QT findings. Using whole brain whole genome gene expression data from Allen Human Brain Atlas (AHBA) and imaging genetics data from Alzheimer’s Disease Neuroimaging Initiative (ADNI) as test beds, we present a novel IGEA framework and conduct a proof-of-concept study to explore high-level imaging genetic associations based on brain wide genome-wide association study (BWGWAS) results. For consistency purpose, in this paper, we use GS to indicate a set of genes and BC to indicate a set of regions of interest (ROIs) in the brain. The proposed framework consists of the following steps (see also Fig. 1): (1) conduct BWGWAS on ADNI amyloid imaging genetics data to identify SNP-QT and gene–QT associations, (2) use AHBA to identify meaningful GS–BC modules, (3) perform IGEA to identify GS–BC modules significantly enriched by gene–QT associations using a threshold-based strategy, and (4) visualize and interpret the identified GS–BC modules.
Fig. 1

Overview of the proposed Imaging Genetic Enrichment Analysis (IGEA) framework. A Perform SNP-level GWAS of brain wide imaging measures. B Map SNP-level GWAS findings to gene-based. C Construct gene-ROI expression matrix from AHBA data. D Construct GS–BC modules by performing 2D hierarchical clustering, and then filter out 2D clusters with an average correlation below a user-given threshold. E Perform IGEA by mapping gene-based GWAS findings to the identified GS–BC modules. F For each enriched GS–BC module, examine the GS using GO terms, KEGG pathways, and OMIM disease databases, and map the BC to the brain

Overview of the proposed Imaging Genetic Enrichment Analysis (IGEA) framework. A Perform SNP-level GWAS of brain wide imaging measures. B Map SNP-level GWAS findings to gene-based. C Construct gene-ROI expression matrix from AHBA data. D Construct GS–BC modules by performing 2D hierarchical clustering, and then filter out 2D clusters with an average correlation below a user-given threshold. E Perform IGEA by mapping gene-based GWAS findings to the identified GS–BC modules. F For each enriched GS–BC module, examine the GS using GO terms, KEGG pathways, and OMIM disease databases, and map the BC to the brain

Methods and materials

We write matrices and vectors as bold uppercase and lowercase letters, respectively. Given a matrix , we denote its ith row as and jth column as . Given two column vectors and , we use to denote their Pearson’s correlation coefficient.

Brain Wide Genome-Wide Association Study (BWGWAS)

The imaging and genotyping data used for BWGWAS were obtained from the Alzheimer’s Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 as a public–private partnership, led by Principal Investigator Michael W. Weiner, MD. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer’s disease (AD). For up-to-date information, see http://www.adni-info.org. Preprocessed [18F]Florbetapir PET scans (i.e., amyloid imaging data) were downloaded from adni.loni.usc.edu, then aligned to the corresponding MRI scans and normalized to the Montreal Neurological Institute (MNI) space as mm voxels. ROI level amyloid measurements were further extracted based on the MarsBaR AAL atlas. Genotype data of both ADNI-1 and ADNI-GO/2 phases were also downloaded, and then quality controlled, imputed, and combined as described in [10]. A total of 980 non-Hispanic Caucasian participants with both complete amyloid measurements and genome-wide data were studied. Associations between 105 (out of a total 116) baseline amyloid measures and 5,574,300 SNPs were examined by performing SNP-based GWAS using PLINK [17] with sex, age, and education as covariates. To facilitate the subsequent enrichment analysis, a gene-based p value was determined as the smallest p value of all SNPs located in K bp of the gene [14]. Manhattan plot of imaging quantitative genome-wide association for Alzheimer’s Disease individuals based on Precuneus (right) measurement from amyloid imaging data. The x axis represents the chromosomes and the y axis represents , where P is the gene-based significance

Constructing GS–BC modules using AHBA

There are many types of prior knowledge that can be used to define meaningful GS and BC entities. In the genomic domain, the prior knowledge could be based on Gene Ontology or functional annotation databases; in the imaging domain, the prior knowledge could be neuroanatomic ontology or brain databases. In this work, to demonstrate the proposed IGEA framework, we use gene expression data from the Allen Human Brain Atlas (AHBA, Allen Institute for Brain Science, Seattle, WA, USA; available from http://www.brain-map.org/) to extract GS and BC modules such that genes within a GS share similar expression profiles and so do ROIs within a BC. We hypothesize that, given these similar co-expression patterns across genes and ROIs, each GS–BC pair forms an interesting high-level imaging genetic entity that may be related to certain biological function and can serve as a valuable candidate for two-dimensional IGEA. The AHBA includes genome-wide microarray-based expression covering the entire brain through systematic sampling of regional tissue. Expression profiles for eight health human brains have been released, including two full brains and six right hemispheres. One goal of AHBA is to combine genomics with the neuroanatomy to better understand the connections between genes and brain functioning. As an early report indicated that individuals share as much as 95 % gene expression profile [28], in this study, we only included one full brain (H0351.2001) to construct GS–BC modules. First all the brain samples (900) were mapped to MarsBaR AAL atlas, which included 116 brain ROIs. Due to many-to-one mapping from brain samples to AAL ROIs, there are >1 samples for each ROI. Following [27], samples located in the same ROI were merged using the mean statistics. Probes were then merged to genes using the same strategy. Finally, the preprocessed gene-ROI profiles were normalized for each ROI. As a result, the expression matrix contained 16,076 genes over 105 ROIs. We use to denote this expression matrix, where is the expression level of gene i across all the 105 ROIs in , and is the expression profile of ROI j across all the 16,076 genes in . Given two genes and , we use the Pearson correlation coefficient to define their dissimilarity as follows:Similarly, given two ROIs and , we define their dissimilarity as follows: We performed a 2D cluster analysis on to identify interesting GS–BC modules. First, we calculated the distance matrices for both genes and ROIs, using Eqs. (1) and (2), respectively. Next, two dendrograms were constructed by applying hierarchical clustering to two distance matrices separately, using the UPGMA (Unweighted Pair Group Method with Arithmetic Mean) algorithm [22]. After that, in the genomic domain, as most enrichment analyses placed constraints on genetic pathways of sizes from 10 to 200 [18], we cut the dendrogram at half of its height to build genetic clusters (i.e., GSs) whose sizes are mostly within the above range. Finally, in the imaging domain, we also employed the same parameter to construct ROI clusters (i.e., BCs). Let be a GS–BC module with n genes and m ROIs, where is the expression level of gene i across all the m ROIs in , and is the expression profile of ROI j across all the n genes in . For each pair of genes in , i.e., , we calculate its correlation coefficient. For each pair of ROIs in , i.e., , we also calculate its correlation coefficient. After that, we transform each of these correlation coefficients, say c, to Fisher’s z-statistic z(c) using the following Eq. (3): We then define , the gene-based average Fisher’s z-statistics of correlation coefficient of , as follows: Similarly, we define , the ROI-based average Fisher’s z-statistics of correlation coefficient of , as follows:Based on these average gene-based and ROI-based z-statistics, respectively, we select the top 20 % of all the GS–BC modules and include those in our subsequent analyses, to ensure our studied modules have comparatively high co-expression profiles. Thus, in this work, we focus on the analysis of the following three types of GS–BC modules with top z-statistics: Gene-based These are the modules with relatively high co-expression profiles between genes, i.e., is ranked in the top 20 % of all the scores. ROI-based These are the modules with relatively high co-expression profiles between ROIs, i.e., is ranked in the top 20 % of all the scores. Gene and ROI-based Both (1) and (2) hold. Eight unique brain circuits (BCs) identified from IGEA. ROIs belonging to each BC are colored in red

Imaging Genetic Enrichment Analysis (IGEA)

Pathway enrichment analysis has been extensively employed to genomic domain to analyze the genetic findings associated with a specific imaging QT. In this study, our goal is to identify high-level associations between gene sets and brain circuits, which typically include multiple genes and multiple QTs. In this study, we propose the threshold-based IGEA by extending the existing threshold-based enrichment analysis. SNP-level findings have been mapped to gene level findings in Sect. 2.1. The GWAS findings are a list L of gene–QT associations, where we have a set of genes and a set of QTs in our analysis. From Sect. 2.2, GS–BC modules have been constructed, where either relevant genes share similar expression profiles across relevant ROIs, or relevant ROIs share similar expression profiles across relevant genes, or both. Given an interesting GS–BC module with gene set and QT set , IGEA aims to determine whether the target GS–BC module is enriched in L. Now we describe our threshold-based IGEA method. We have N gene–QT pairs from GWAS. Out of these, pairs (the set A) are significant ones with GWAS p value passed a certain threshold. We also have (the set P) gene–QT pairs from a given GS–BC module, and k significant pairs are from P. Using Fisher’s exact test for independence, the enrichment p value for the given GS–BC module is calculated as follows:Here, we use Pr to denote the probability function.

Evaluation of the identified GS–BC modules

For evaluation purpose, we tested the statistical significance of the IGEA results. We hypothesize that the gene–QT associations from BWGWAS of the original data should be overrepresented in certain GS–BC modules, and the BWGWAS results on permuted data should not be enriched in a similar number of GS–BC modules. We performed the IGEA analysis on permuted BWGWAS results, and estimated the p value for the number of significant GS–BC modules discovered from the original data using a t-distribution with degrees of freedom. Given a BWGWAS result R, let Prop(R) be the proportion of modules which are significantly enriched by R. Let be the original BWGWAS result, and be the ith permuted BWGWAS result. Let be the set of these proportion values for all the permuted results. Then the p value is estimated using Eq. (7).where is the t-distribution with degrees of freedom, is the sample mean of S, and is the sample standard deviation of S. To determine the functional relevance of the enriched GS–BC modules, we also tested whether genes from each module are overrepresented for specific neurobiological functions, signaling pathways, or complex neurodegenerative diseases. We performed pathway enrichment tests using gene ontology (GO) terms, KEGG pathways and OMIM (Online Mend-elian Inheritance in Man) database. Twenty-five significantly enriched GS–BC modules from IGEA See also Sect. 3.2 and Fig. 3 for details about relevant GSs and BCs, respectively
Fig. 3

Eight unique brain circuits (BCs) identified from IGEA. ROIs belonging to each BC are colored in red

aTo indicate whether the top 20 % modules are selected based on the gene-based, ROI-based, or gene&ROI-based strategy bG: Gene-based cR: ROI-based dG&R: Gene&ROI-based Brain maps of four brain circuits (BCs) identified from IGEA

Results

Significant GS–BC modules

By performing hierarchical clustering on both genetic and imaging domains, 171 out of 216 genetic clusters (only those with size ranging from 10 to 200) and 9 imaging clusters (with size ranging from 4 to 23, no clusters are excluded) were identified. 1539 GS–BC modules were generated by combining each pair of genetic and ROI clusters. Two sets of 308 (20 % of 1539) modules were selected according to gene-based and ROI-based z-statistics, respectively. Among them, 90 modules were among top 20 % in both gene-based and ROI-based ranking results. We used a moderate size threshold for the selection, to avoid the exclusion of potentially interesting candidates. For the BWGWAS results, we obtained gene–QT associations after mapping SNP-based p values to genes. Out of these, 1402 gene–QT associations passed the BWGWAS p value of . Figure 2 shows the gene-based GWAS result of an example QT (i.e., the average amyloid deposition in the right precuneus). Precuneus amyloid concentration has been demonstrated to be associated with disordered activity in Alzheimer’s Disease [8].
Fig. 2

Manhattan plot of imaging quantitative genome-wide association for Alzheimer’s Disease individuals based on Precuneus (right) measurement from amyloid imaging data. The x axis represents the chromosomes and the y axis represents , where P is the gene-based significance

Three sets of constructed GS–BC modules (308, 308, and 90 with top z-statistics using gene-based, ROI-based, and gene&ROI-based strategies, respectively, see Sect. 2.2) were tested separately for whether they could be enriched by BWGWAS results using IGEA. Across three sets, totally 25 modules turned out to be significant after Bonferroni correction (see Table 1), of which 15, 17, and 9 are from gene-based, ROI-based, and both gene&ROI-based categories, respectively. We also tested the significance of the number of identified GS–BC modules. Compared to the permuted BWGWAS results, the analysis on the original data yielded a significantly larger number of enriched GS–BC modules with estimated p values of , and , corresponding to gene-based, ROI-based, and gene&ROI-based strategies, respectively, indicating that imaging genetic associations existed in these enriched GS–BC modules.
Table 1

Twenty-five significantly enriched GS–BC modules from IGEA

Module IDTop 20 % Co-expresseda BC ID# of ROIsGS ID# of genesCorrected P value (gene-based)Corrected P value (ROI-based)Corrected P value (Gene&ROI-based)
01Rc BC078GS01812.61E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-06
02G, R, G&Rd BC024GS021689.06E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-069.06E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-069.06E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-06
03Gb BC0311GS021682.54E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-11
04G, R, G&RBC045GS021681.44E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-061.44E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-061.44E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-06
05GBC0514GS021686.42E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-06
06RBC0613GS021685.91E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-07
07RBC0823GS021685.65E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-22
08G, R, G&RBC014GS03551.38E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-061.38E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-061.38E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-06
09GBC024GS03554.39E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-13
10RBC045GS03551.41E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-15
11GBC0514GS03551.01E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-14
12RBC0613GS03551.72E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-08
13RBC078GS03552.40E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-21
14RBC078GS04664.00E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-07
15G, R, G&RBC014GS05193.83E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-053.83E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-053.83E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-05
16G, R, G&RBC024GS05196.88E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-096.88E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-096.88E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-09
17G, R, G&RBC045GS05192.64E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-102.64E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-102.64E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-10
18RBC0613GS05192.26E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-11
19G, R, G&RBC078GS05191.54E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-141.54E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-141.54E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-14
20G, R, G&RBC024GS06284.87E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-084.87E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-084.87E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-08
21GBC024GS07247.69E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-05
22G&RBC014GS08331.97E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-04
23GBC024GS08331.11E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-07
24RBC045GS08337.39E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-09
25GBC024GS091114.07E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-05

See also Sect. 3.2 and Fig. 3 for details about relevant GSs and BCs, respectively

aTo indicate whether the top 20 % modules are selected based on the gene-based, ROI-based, or gene&ROI-based strategy

bG: Gene-based

cR: ROI-based

dG&R: Gene&ROI-based

Top enriched OMIM diseases of identified GSs * Significantly enriched Across all 25 identified modules, there are 9 and 8 unique GS and BC entities, respectively. Figure 3 shows the 8 unique identified BCs with corresponding ROI names, and Fig. 4 maps four of those onto the brain. For example, BC03 and BC04 include structures that are major spots for amyloid accumulation in AD (e.g., cingulum, precuneus). BC05 involves structures responsible for motivated behaviors (e.g., caudate, pallidum, putamen) and sensory information processing (e.g., thalamus). BC08 involves various frontal regions responsible for executive functions. Details of all 25 modules are listed in Table 1. We can find that some modules share common gene sets with different brain circuits, and some share the same brain circuits with different gene sets. This illustrates the complex associations among multiple genes and multiple brain ROIs.
Fig. 4

Brain maps of four brain circuits (BCs) identified from IGEA

Results of KEGG pathway enrichment for identified GSs. The x axis represents unique GS ID, and y axis represents −log p value of enrichment significance of KEGG pathways. Marked cell represents significant enrichment (p value )

Pathway analysis of identified GS–BC modules

To explore and analyze functional relevance of our identified GS–BC modules, we performed pathway enrichment analysis from three aspects including GO terms, functional pathways and diseases using Gene Ontology (GO), KEGG pathways, and OMIM diseases databases, respectively. Figure 5 shows the KEGG pathway enrichment results which were mapped to 15 categories. From the results, most identified GSs had a number of significant functional enrichments. Several of them were directly related to the neurodegenerative disease and its development, e.g., Alzheimer’s Disease enriched in GS05 and Parkinson’s Disease enriched in GS01. Another major part of them were also related to the neurodegenerative diseases and their development. For instance, caffeine as the most widely used psychoactive substance, its metabolism (from GS09 located in Module 25) can affect brain metabolism and has potential benefits on Parkinson’s Disease treatment [16]. There are also several enriched pathways related to oxidative stress, which is a critical factor for a range of neurodegenerative disorders. For example, glycolysis and gluconeogenesis (from GS02 located in Modules 02-07) are associated with hypoxia, ischemia, and AD [2]. Gap junctions (from GS03 located in Modules 08-13) can couple various kinds of cells in the central nervous system (CNS) which play an important role in maintaining normal function. Signaling transduction, like calcium signaling pathway (from GS03 located in Modules 08-13) playing key role in short- and long-term synaptic plasticity has shown abnormality in many neurodegenerative disorders including Alzheimer’s Disease, Parkinson’s disease, amyotrophic lateral sclerosis (ALS), Huntington’s disease, spinocerebellar ataxias (SCA), and so on [1].
Fig. 5

Results of KEGG pathway enrichment for identified GSs. The x axis represents unique GS ID, and y axis represents −log p value of enrichment significance of KEGG pathways. Marked cell represents significant enrichment (p value )

Table 2 shows the OMIM disease enrichment results. Several neurodegeneration-related and age-related diseases and complex disorders were enriched in various gene sets, such as Alzheimer’s Diease from GS03 and GS05, Encephalopathy from GS01 and GS02, and Anomalies from GS05. Besides neurodegeneration diseases and disorders, several cancer-related entities are detected including breast cancer from GS02 and leukemia from GS03. These findings provided potential evidence for the studies that focused on investigating the relationship between cancer and neurodegeneration, with abnormal cell growth and cell loss in common.
Table 2

Top enriched OMIM diseases of identified GSs

GS ID# of geneOMIM Disease P value
GS0181Encephalopathy4.2E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
Dementia3.6E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
GS02168Encephalopathy5.0E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
Breast cancer9.5E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS0355Leukemia2.7E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
Alzheimer’s disease8.9E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS0466Hypertension5.0E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS0519Anomalies2.4E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
Alzheimer’s disease4.5E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
GS0628Ectodermal dysplasia2.0E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
GS0724Hypertension3.4E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
Spinocerebellar ataxia4.3E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
GS0833Glycogen storage disease1.6E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*
GS09111Immunodeficiency1.4E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2*

* Significantly enriched

Top enriched GO terms of GSs from identified GS–BC modules Gene Ontology (GO) enrichment indicates the relationship between identified gene sets and GO terms from three categories including biological process (BP), cellular component (CC), and molecular function (MF) (http://geneontology.org/). For the GO enrichment of all 9 gene sets, 163 various GO terms were significantly enriched. Top enriched terms were selected and grouped to 7 categories including behavior, cell communication, mitochondrion, metabolic process, neurological system process, response to stimulus, and signal transduction, as shown in Table 3. A large number of these terms have direct or indirect relationships with neurodegenerative diseases or phenotypes.
Table 3

Top enriched GO terms of GSs from identified GS–BC modules

GroupGS ID# of genesGO CategoryCorrected p value
BehaviorGS0355Behavior2.2E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
Learning or memory4.4E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
Cell communicationGS0181Regulation of synaptic transmission2.7E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-6
Neuron-neuron Synaptic transmission2.9E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-3
GS0355Synaptic transmission1.7E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-4
Metabolic processGS0519Fat-soluble vitamin metabolic process4.3E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
Organic hydroxy compound biosynthetic process4.8E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS0628Regulation of translational termination2.8E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
MitochondrionGS02168Mitochondrial membrane part2.5E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-3
Mitochondrial respiratory chain complex I4.9E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-3
Neurological system processGS0355Associative learning1.1E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
Learning4.5E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-6
GS09111Detection of chemical stimulus involved in sensory perception1.1E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-4
Olfactory receptor activity1.9E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-5
Response to stimulusGS0355Response to amphetamine2.0E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-3
Visual behavior4.5E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-3
GS0519Response to cholesterol3.6E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
Response to sterol3.7E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS09111Detection of chemical stimulus1.6E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-4
Signal transductionGS0181Glutamate receptor signaling pathway7.3E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-4
GS0355Adenylate cyclase-activating dopamine receptor signaling pathway3.1E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-3
Dopamine receptor signaling pathway1.4E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS0519Transmembrane receptor protein kinase activity4.4E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-2
GS09111Olfactory receptor activity1.9E\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-$$\end{document}-5

Discussion

We have presented a two-dimensional imaging genetic enrichment analysis (IGEA) framework to explore the high-level imaging genetic associations by integrating whole brain genomic, transcriptomic, and neuroanatomic data. Traditional pathway enrichment analysis focused on investigating genetic findings of a single phenotype one at a time, and relationships among imaging QTs could be ignored. Such approach could be inadequate to provide insights into the mechanisms of complex diseases that involve multiple genes and multiple QTs. In this paper, we have proposed a novel enrichment analysis paradigm IGEA to detect high-level associations between gene sets and brain circuits. By jointly considering the complex relationships between interlinked genetic markers and correlated brain imaging phenotypes, IGEA provides additional power for extracting biological insights on neurogenomic associations at a systems biology level. For example, let us take a look at GS03-BC05, an identified module significantly enriched by our GWAS findings. Several ROIs (e.g., caudate, pallidum, and putamen) from BC05 have been indicated responsible for motivated behaviors [3]. Meanwhile both KEGG and GO functional enrichment results of GS03 show high relevance to behavior and normal function maintaining (see Fig. 5; Table 3). These observations suggest that this high-level imaging genetic pattern could be relevant to the behavior mechanism. It warrants further investigation to perform analyses targeted at these genes and ROIs in independent cohorts to better understand the underlying mechanism from the imaging genetic perspective. The real power of IGEA, however, can be affected by several aspects. First, the constructed GS–BC modules should reflect the real relationships among genes as well as brain ROIs. Thus, it is crucial to define meaningful gene sets and brain circuits. In our paper, GSs and BCs were separately extracted from AHBA brain wide expression data based on hierarchical clustering, which were then combined to provide GS–BC modules. This strategy was based on the idea that interlinked genetic markers (or brain ROIs) would conserve similar expression pattern, i.e., would be highly co-expressed. Second, the statistical measure of enrichment evaluation can be based on different strategies. We adopted hypergeometric test in our experiment to estimate the over-representation of our defined GS–BC modules to the list of gene–QT pair. Based on these two considerations, our proposed paradigm can be further improved. From our GS–BC module construction, GSs (or BCs) are clustered together based on their co-expression pattern across all the ROIs in the whole brain (or across all the genes in the genome). Although statistical measures were calculated using Fisher’s z-transformation to restrict our analyses on only highly co-expressed modules from our bi-clustering results, we could be missing other highly co-expressed GSs (or BCs) if they only had similar expression patterns on a small set of ROIs (or genes). In other words, our module construction strategy considered the global expression pattern but ignored the local ones. It is worth for further investigation to try other reasonable strategies by applying prior knowledge such as pre-defined genetic pathways/networks or brain circuits, or by using different co-clustering algorithms (e.g., [26]) to take into consideration of relevant local expression patterns. Hypergeometric test requires a pre-defined threshold to determine the list of gene–QT pairs. Another limitation is that it considers only the count of significant gene–QT pairs, but ignores the strength of gene–QT associations. There are a number of rank-based enrichment analysis methods (e.g., GSEA [23]) that can be employed in our two-dimensional enrichment analysis to overcome these disadvantages. Another issue is that we used the smallest SNP-level p value within the gene to represent the gene-based p value. Therefore, another possible future direction is to explore other set-based methods for calculating gene-based p values such as VEGAS [13], GATES [12], and so on. Besides, from mathematical perspective, associating GS–BC modules and gene–QT findings can be seen as a similarity discovery over two matrices. Thus, another future direction could be to study this problem using machine learning approaches similar to that proposed by Wang et al. [25].
  26 in total

1.  Global functional profiling of gene expression.

Authors:  Sorin Draghici; Purvesh Khatri; Rui P Martins; G Charles Ostermeier; Stephen A Krawetz
Journal:  Genomics       Date:  2003-02       Impact factor: 5.736

2.  Expander: from expression microarrays to networks and functions.

Authors:  Igor Ulitsky; Adi Maron-Katz; Seagull Shavit; Dorit Sagir; Chaim Linhart; Ran Elkon; Amos Tanay; Roded Sharan; Yosef Shiloh; Ron Shamir
Journal:  Nat Protoc       Date:  2010-01-28       Impact factor: 13.491

3.  Large-scale cellular-resolution gene profiling in human neocortex reveals species-specific molecular signatures.

Authors:  Hongkui Zeng; Elaine H Shen; John G Hohmann; Seung Wook Oh; Amy Bernard; Joshua J Royall; Katie J Glattfelder; Susan M Sunkin; John A Morris; Angela L Guillozet-Bongaarts; Kimberly A Smith; Amanda J Ebbert; Beryl Swanson; Leonard Kuan; Damon T Page; Caroline C Overly; Ed S Lein; Michael J Hawrylycz; Patrick R Hof; Thomas M Hyde; Joel E Kleinman; Allan R Jones
Journal:  Cell       Date:  2012-04-13       Impact factor: 41.582

4.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

5.  Molecular pathways involved in neuronal cell adhesion and membrane scaffolding contribute to schizophrenia and bipolar disorder susceptibility.

Authors:  C O'Dushlaine; E Kenny; E Heron; G Donohoe; M Gill; D Morris; A Corvin
Journal:  Mol Psychiatry       Date:  2010-02-16       Impact factor: 15.992

6.  Implication of the immune system in Alzheimer's disease: evidence from genome-wide pathway analysis.

Authors:  Jean-Charles Lambert; Benjamin Grenier-Boley; Vincent Chouraki; Simon Heath; Diana Zelenika; Nathalie Fievet; Didier Hannequin; Florence Pasquier; Olivier Hanon; Alexis Brice; Jacques Epelbaum; Claudine Berr; Jean-Francois Dartigues; Christophe Tzourio; Dominique Campion; Mark Lathrop; Philippe Amouyel
Journal:  J Alzheimers Dis       Date:  2010       Impact factor: 4.472

7.  Genome-wide pathway analysis implicates intracellular transmembrane protein transport in Alzheimer disease.

Authors:  Mun-Gwan Hong; Andrey Alexeyenko; Jean-Charles Lambert; Philippe Amouyel; Jonathan A Prince
Journal:  J Hum Genet       Date:  2010-07-29       Impact factor: 3.172

Review 8.  Multifunctional roles of enolase in Alzheimer's disease brain: beyond altered glucose metabolism.

Authors:  D Allan Butterfield; Miranda L Bader Lange
Journal:  J Neurochem       Date:  2009-09-23       Impact factor: 5.372

9.  GSA-SNP: a general approach for gene set analysis of polymorphisms.

Authors:  Dougu Nam; Jin Kim; Seon-Young Kim; Sangsoo Kim
Journal:  Nucleic Acids Res       Date:  2010-05-25       Impact factor: 16.971

10.  Motivation-dependent responses in the human caudate nucleus.

Authors:  M R Delgado; V A Stenger; J A Fiez
Journal:  Cereb Cortex       Date:  2004-04-27       Impact factor: 5.357

View more
  8 in total

1.  Regional imaging genetic enrichment analysis.

Authors:  Xiaohui Yao; Shan Cong; Jingwen Yan; Shannon L Risacher; Andrew J Saykin; Jason H Moore; Li Shen
Journal:  Bioinformatics       Date:  2020-04-15       Impact factor: 6.937

Review 2.  Imaging Genetics in Epilepsy: Current Knowledge and New Perspectives.

Authors:  Ge Wang; Wenyue Wu; Yuchen Xu; Zhuanyi Yang; Bo Xiao; Lili Long
Journal:  Front Mol Neurosci       Date:  2022-05-30       Impact factor: 6.261

3.  Mining Regional Imaging Genetic Associations via Voxel-wise Enrichment Analysis.

Authors:  Xiaohui Yao; Shan Cong; Jingwen Yan; Shannon L Risacher; Andrew J Saykin; Jason H Moore; Li Shen
Journal:  IEEE EMBS Int Conf Biomed Health Inform       Date:  2019-09-12

4.  Volumetric GWAS of medial temporal lobe structures identifies an ERC1 locus using ADNI high-resolution T2-weighted MRI data.

Authors:  Shan Cong; Xiaohui Yao; Zhi Huang; Shannon L Risacher; Kwangsik Nho; Andrew J Saykin; Li Shen
Journal:  Neurobiol Aging       Date:  2020-07-14       Impact factor: 4.673

5.  Polygenic mediation analysis of Alzheimer's disease implicated intermediate amyloid imaging phenotypes.

Authors:  Eng Yingxuan; Xiaohui Yao; Kefei Liu; Shannon L Risacher; Andrew J Saykin; Qi Long; Yize Zhao; Li Shen
Journal:  AMIA Annu Symp Proc       Date:  2021-01-25

6.  Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data.

Authors:  Ben D Fulcher; Aurina Arnatkeviciute; Alex Fornito
Journal:  Nat Commun       Date:  2021-05-11       Impact factor: 14.919

7.  GC-CNNnet: Diagnosis of Alzheimer's Disease with PET Images Using Genetic and Convolutional Neural Network.

Authors:  Morteza Amini; Mir Mohsen Pedram; AliReza Moradi; Mahdieh Jamshidi; Mahshad Ouchani
Journal:  Comput Intell Neurosci       Date:  2022-08-09

8.  Mining High-Level Imaging Genetic Associations via Clustering AD Candidate Variants with Similar Brain Association Patterns.

Authors:  Ruiming Wu; Jingxuan Bao; Mansu Kim; Andrew J Saykin; Jason H Moore; Li Shen
Journal:  Genes (Basel)       Date:  2022-08-24       Impact factor: 4.141

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.