| Literature DB >> 27814671 |
Le Shu1, Yuqi Zhao1, Zeyneb Kurt1, Sean Geoffrey Byars2,3, Taru Tukiainen4, Johannes Kettunen4, Luz D Orozco5, Matteo Pellegrini5, Aldons J Lusis6, Samuli Ripatti4, Bin Zhang7, Michael Inouye2,3,8, Ville-Petteri Mäkinen9,10,11,12, Xia Yang13,14.
Abstract
BACKGROUND: Complex diseases are characterized by multiple subtle perturbations to biological processes. New omics platforms can detect these perturbations, but translating the diverse molecular and statistical information into testable mechanistic hypotheses is challenging. Therefore, we set out to create a public tool that integrates these data across multiple datasets, platforms, study designs and species in order to detect the most promising targets for further mechanistic studies.Entities:
Keywords: Blood glucose; Cholesterol; Functional genomics; Gene networks; Integrative genomics; Key drivers; Mergeomics; Multidimensional data integration
Mesh:
Substances:
Year: 2016 PMID: 27814671 PMCID: PMC5097440 DOI: 10.1186/s12864-016-3198-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Main modules, data flow between them and examples of data types that can be integrated by Mergeomics
Fig. 2Schematic illustration of the concept of a key driver gene (a) and local hubs with overlapping neighborhoods (b)
Fig. 3Comparison of three pathway enrichment methods across three GWAS. Performance is evaluated by sensitivity (a), specificity (b), positive likelihood ratio (sensitivity/(1-specificity)) (c) and receiver operating characteristic curve (d–f). Sensitivity was defined as the proportion of positive control pathways detected at FDR < 25 %. Specificity was defined as the proportion of negative controls rejected at FDR ≥ 25 %. Error bars denote the standard error of simulation results
Fig. 4Comparison of performance of SNP-level meta-analysis and pathway-level meta-analysis using simulated gene-sets. Results are produced in the same workflow as stated in Table 1. a Sensitivity. b Specificity. c Positive likelihood ratio (Sensitivity/(1-Specificity)). d Receiver operating characteristic curve. Error bars denote the standard error of simulation results
Top 15 pathways associated with cholesterol levels out of 1346 canonical pathways tested in three GWAS datasets
| Pathway | MSEA | Meta-MSEA | Meta-GWAS | |||
|---|---|---|---|---|---|---|
| Finnish ( | Framingham ( | GLGC ( | Without GLGC | With GLGC | ||
| Lipid digestion, mobilization and transport | 4.16 | 5.46 | 6.15 | 8.67 | 13.76 | 5.00 |
| Lipoprotein metabolism | 4.67 | 4.82 | 5.94 | 8.59 | 13.49 | 5.41 |
| Chylomicron-mediated lipid transport | 4.88 | 4.87 | 4.72 | 8.85 | 12.61 | 5.03 |
| Metabolism of lipids and lipoproteins | 3.15 | 1.71 | 6.15 | 4.00 | 8.53 | 3.56 |
| Cytosolic tRNA aminoacylation | 3.58 | 2.09 | 1.92 | 4.77 | 5.86 | 2.70 |
| Binding and Uptake of Ligands by Scavenger Receptors | 1.88 | 2.29 | 3.36 | 3.46 | 5.86 | 2.92 |
| Scavenging by Class A Receptors | 1.83 | 2.22 | 3.22 | 3.33 | 5.62 | 3.47 |
| Metabolism | 1.83 | 1.48 | 3.94 | 2.65 | 5.36 | 2.98 |
| PPARA Activates Gene Expression | 1.66 | 2.22 | 2.83 | 3.17 | 5.13 | 1.33 |
| Retinoid metabolism and transport | 1.01 | 2.75 | 3.04 | 2.84 | 4.94 | 1.42 |
| Regulation of Lipid Metabolism by Peroxisome proliferator-activated receptor alpha (PPARalpha) | 1.32 | 2.02 | 2.79 | 2.64 | 4.52 | 1.60 |
| Fatty acid, triacylglycerol and ketone body metabolism | 1.48 | 1.65 | 2.49 | 2.49 | 4.13 | 1.56 |
| Clathrin derived vesicle budding | 1.91 | 1.27 | 2.36 | 2.50 | 4.05 | 1.30 |
| Diseases associated with visual transduction | 1.41 | 1.89 | 2.18 | 2.62 | 4.03 | 2.34 |
| ABC transporters | 1.77 | 0.89 | 3.16 | 1.97 | 4.01 | 2.75 |
The results are listed as − log10 P-values, and the full table is available in Additional file 3. MSEA was run with top 50 % of markers and LD cutoff r2 < 50 %. The column ‘Meta-GWAS’ was estimated according to inverse-variance meta-analysis of the cohort specific P-values at individual SNP level, followed by MSEA. The Bonferroni-adjusted 5 % significance level for 1346 independent tests is at − log10 P = 4.43
Fig. 5Performance comparison between wKDA and the unweighted key driver analysis. Two empirical subnetworks (Lipid I & II) were obtained from a previous publication [23], and a canonical metabolism of lipids and lipoproteins pathway was obtained from the Reactome database (R-HSA-556833). The methods were tested by projecting the three functional subnetworks onto two independent adipose networks (a–c) and two independent liver regulatory networks (d–f). The adipose and liver networks were constructed from a collection of Bayesian tissue-specific network models (Additional file 1: Table S3). Overlap between the tissue-specific key driver signals across two independent regulatory networks was defined according to the Jaccard index. Overlap ratio was calculated for both original networks and networks with 25, 50, 75 or 100 % rewiring of edges
Key drivers for cholesterol-associated gene subnetworks
| Subnetworks | −log10 P | Functional annotation | Top adipose KDs | Top liver KDs | ||||
|---|---|---|---|---|---|---|---|---|
| Key driver | −log10 P | Co-hubs | Key driver | −log10 P | Co-hubs | |||
| Subnetwork 1 Lipoprotein | 16.0 | Lipid transport; cholesterol metabolism; lipoprotein; blood plasma | - | - | - |
| 9.5 |
|
|
| 4.5 |
| ||||||
| Subnetwork 2 Lipid metabolism | 8.1 | Lipid metabolism; metalloprotein; oxidoreductase; endoplasmic reticulum |
| 33.7 |
|
| 49.0 |
|
|
| 26.8 |
|
| 37.4 |
| |||
|
| 24.0 |
|
| 26.9 |
| |||
|
| 23.3 |
|
| 23.9 |
| |||
|
| 23.0 |
|
| 18.8 |
| |||
| Subnetwork 3 Immunoglobulin | 6.1 | Immunoglobulin V-set |
| 12.4 | - |
| 21.4 |
|
|
| 9.4 |
|
| 11.0 |
| |||
|
| 8.8 |
|
| 10.4 |
| |||
|
| 8.3 |
|
| 9.9 |
| |||
|
| 7.2 |
|
| 9.0 |
| |||
| Subnetwork 4 ABC transport | 5.0 | ATP-binding cassette genes | - | - | - |
| 12.0 |
|
|
| 4.3 |
| ||||||
|
| 3.2 |
| ||||||
| Subnetwork 5 Retinoid metabolism | 4.5 | Retinoid metabolism; Visual transduction | - | - | - |
| 11.2 |
|
|
| 3.2 |
| ||||||
|
| 2.9 |
| ||||||
| Subnetwork 6 Transcription | 3.8 | Transcription regulation; fatty acid metabolism; acyltransferase |
| 18.2 |
|
| 23.6 |
|
|
| 17.7 |
|
| 19.0 |
| |||
|
| 15.9 | - |
| 12.2 |
| |||
|
| 15.1 | - |
| 11.6 |
| |||
|
| 13.7 |
|
| 10.7 |
| |||
Initially, canonical pathways were evaluated for the enrichment of genetic perturbations to circulating cholesterol. As these pathways overlap with each other, non-redundant “subnetworks” were constructed that represent the most shared core genes between overlapping pathways. To verify the association with cholester, enrichment was re-evaluated for the subnetworks (second column in the table). Statistical significance was estimated as described in Table 1. Functional annotations were determined with the DAVID Bioinformatics Tool [45]. Key drivers and co-hubs were determined with the wKDA module within Mergeomics. Bayesian networks from multiple mouse studies were combined to create weighted adipose and liver consensus networks [43, 44]. Gene symbols were translated to human when available
Fig. 6Visualization of adipose (a) and liver (b) networks around top key drivers that were identified for cholesterol-associated subnetworks. Top key drivers (nodes with the largest size) are selected as the top five independent key regulatory genes (genes whose neighbourhood has less than 25 % overlap with the neighbourhood of other independent hubs) for subnetwork 2 and subnetwork 6. Subnetwork member genes are denoted as medium size nodes and non-member genes as small size nodes. Top co-hubs (co-hubs with FDR < 10−10 in wKDA) are highlighted by yellow circles. Only edges that were supported by at least two studies are drawn
Pathways associated with fasting glucose across human and mouse association datasets
| Pathway | MSEA | Meta-MSEA | ||||
|---|---|---|---|---|---|---|
| Human | Mouse | Mouse | Mouse | |||
| GWAS | GWAS | TWAS | EWAS | Value | FDR | |
| Glycolysis/Gluconeogenesis | 2.56 | 0.88 | 3.84 | 0.63 | 4.73 | 2.22 % |
| Starch and sucrose metabolism | 3.67 | 1.37 | 3.29 | 0.17 | 4.57 | 2.22 % |
| Regulation And Function Of ChREBP in Liver | 3.10 | 0.93 | 2.74 | 0.41 | 4.08 | 3.60 % |
| Nuclear Receptors in Lipid Metabolism and Toxicity | 5.58 | 0.48 | 1.99 | 0.35 | 4.00 | 3.60 % |
| Regulation of gene expression in beta cells | 4.16 | 1.75 | 1.48 | 0.19 | 3.97 | 3.60 % |
| Type II diabetes mellitus | 2.11 | 1.09 | 1.48 | 1.08 | 3.66 | 6.00 % |
| Integration of energy metabolism | 2.42 | 0.33 | 2.17 | 1.09 | 3.34 | 10.82 % |
| Steroid biosynthesis | 1.10 | 2.04 | 1.27 | 0.76 | 3.10 | 14.34 % |
| alpha-linolenic acid (ALA) metabolism | 3.40 | 0.32 | 1.72 | 0.69 | 3.09 | 14.34 % |
| Incretin Synthesis, Secretion and Inactivation | 3.24 | 1.14 | 0.09 | 2.06 | 3.02 | 14.34 % |
| Adipocytokine signaling pathway | 2.55 | 0.41 | 1.13 | 1.26 | 2.94 | 14.34 % |
| Chylomicron-mediated lipid transport | 0.43 | 0.89 | 2.89 | 1.26 | 2.92 | 14.34 % |
| Glucose transport | 5.57 | 1.31 | 0.27 | 0.29 | 2.92 | 14.34 % |
The results are listed as − log10 P-values, and the full table is available in Additional file 5. MSEA was run with top 50 % of markers, and an LD cutoff r2 < 50 % was applied to the GWAS. For the human GWAS, SNPs were assigned to genes based on a 20 kb window in the genome sequence. For the mouse GWAS, liver eQTL data were used for gene assignment. The Bonferroni-adjusted 5 % significance level for 1346 independent tests is at − log10P = 4.43