| Literature DB >> 25033284 |
Ville-Petteri Mäkinen1, Mete Civelek2, Qingying Meng3, Bin Zhang4, Jun Zhu4, Candace Levian3, Tianxiao Huan5, Ayellet V Segrè6, Sujoy Ghosh7, Juan Vivar8, Majid Nikpay9, Alexandre F R Stewart10, Christopher P Nelson11, Christina Willenborg12, Jeanette Erdmann13, Stefan Blakenberg14, Christopher J O'Donnell15, Winfried März16, Reijo Laaksonen17, Stephen E Epstein18, Sekar Kathiresan19, Svati H Shah20, Stanley L Hazen21, Muredach P Reilly22, Aldons J Lusis2, Nilesh J Samani11, Heribert Schunkert23, Thomas Quertermous24, Ruth McPherson9, Xia Yang3, Themistocles L Assimes24.
Abstract
The majority of the heritability of coronary artery disease (CAD) remains unexplained, despite recent successes of genome-wide association studies (GWAS) in identifying novel susceptibility loci. Integrating functional genomic data from a variety of sources with a large-scale meta-analysis of CAD GWAS may facilitate the identification of novel biological processes and genes involved in CAD, as well as clarify the causal relationships of established processes. Towards this end, we integrated 14 GWAS from the CARDIoGRAM Consortium and two additional GWAS from the Ottawa Heart Institute (25,491 cases and 66,819 controls) with 1) genetics of gene expression studies of CAD-relevant tissues in humans, 2) metabolic and signaling pathways from public databases, and 3) data-driven, tissue-specific gene networks from a multitude of human and mouse experiments. We not only detected CAD-associated gene networks of lipid metabolism, coagulation, immunity, and additional networks with no clear functional annotation, but also revealed key driver genes for each CAD network based on the topology of the gene regulatory networks. In particular, we found a gene network involved in antigen processing to be strongly associated with CAD. The key driver genes of this network included glyoxalase I (GLO1) and peptidylprolyl isomerase I (PPIL1), which we verified as regulatory by siRNA experiments in human aortic endothelial cells. Our results suggest genetic influences on a diverse set of both known and novel biological processes that contribute to CAD risk. The key driver genes for these networks highlight potential novel targets for further mechanistic studies and therapeutic interventions.Entities:
Mesh:
Year: 2014 PMID: 25033284 PMCID: PMC4102418 DOI: 10.1371/journal.pgen.1004502
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Schematic overview of the study design.
A) The SNP set enrichment analysis (SSEA) comprised four steps. First, gene sets from knowledge-driven pathways and data-driven co-expression modules were collected. Second, the gene sets were converted to expression SNP (eSNP) sets according to genetics of gene expression or eQTL studies. Third, P-values from CAD GWAS were extracted for each eSNP. Fourth, the GWAS P-values within eSNP sets were compared against random expectation to derive pathways and network modules enriched for CAD genetic signals. B) Overlapping CAD-associated gene sets were merged and trimmed into non-overlapping supersets. C) Integration of Bayesian gene-gene network models with CAD-associated supersets to determine key driver genes based on network topology.
Knowledge-based grouping of canonical pathways that were significantly enriched for CAD genetic loci.
| Category | Selected pathways | All eSNPs | Adipose | Liver | Blood | HAEC |
| Control | GWAS Catalog | 29.0 | 9.0 | 25.1 | 17.2 | 7.3 |
| CADGene | 12.0 | 11.9 | 9.1 | 1.3 | 1.7 | |
| Lipids (9) | Metabolism of lipids and lipoproteins (Reactome) | 10.0 | 9.8 | 2.4 | 0.7 | 1.0 |
| Fatty acid metabolism (KEGG) | 5.2 | 10.3 | 1.6 | 0.4 | 0.7 | |
| Recycling of bile acids and salts (Reactome) | 5.3 | - | 8.5 | - | - | |
| Immune system (24) | Immunoregulation between lymphoid and other cells (Reactome) | 9.4 | 11.4 | 8.6 | 1.8 | 1.6 |
| Antigen processing and presentation (KEGG) | 8.9 | 8.1 | 10.8 | 2.9 | 3.0 | |
| Th1/Th2 differentiation (Biocarta) | 6.6 | 5.1 | 5.3 | 2.0 | 0.2 | |
| Adhesion and diapesis of lymphocytes (Biocarta) | 3.3 | 6.0 | - | 0.8 | 3.6 | |
| Adhesion and diapedesis of granulocytes (Biocarta) | 3.3 | 3.8 | - | 0.9 | 3.7 | |
| Cellular stress response (6) | VEGF, hypoxia and angiogenesis (Biocarta) | 4.5 | 7.7 | 5.9 | 3.2 | 1.4 |
| Erythropoietin mediated neuroprotection through NF-kB (Biocarta) | 3.0 | 4.8 | 4.6 | 2.2 | 2.6 | |
| Hypoxia-inducible factor in the cardiovascular system (Biocarta) | 1.5 | 3.8 | 2.6 | 1.9 | 1.1 | |
| Cell cycle and growth (18) | Notch-HLH transcription (Reactome) | 2.5 | 3.3 | - | - | 0.6 |
| NRAGE signals death through JNK (Reactome) | 3.2 | 3.7 | 2.5 | 5.3 | 0.5 | |
| EGF signaling pathway (Biocarta) | 1.7 | 2.7 | 1.7 | 1.8 | 1.2 | |
| G1/S transition (Reactome) | 1.2 | 0.6 | 5.3 | 0.1 | 0.2 | |
| DNA and RNA (7) | Double-strand break repair (Reactome) | 3.1 | 2.5 | 2.3 | 3.3 | 0.7 |
| Spliceosome (KEGG) | 1.7 | 0.6 | 0.3 | 5.9 | 0.5 | |
| Protein metabolism (6) | Metabolism of proteins (Reactome) | 2.1 | 4.2 | 1.1 | 2.6 | 1.5 |
| Proteasome (KEGG) | 0.9 | 0.3 | 3.6 | 0.2 | 0.7 | |
| Post-translational protein modifications (Reactome) | 1.3 | 3.6 | 2.3 | 0.7 | 1.5 | |
| Other (6) | Bioactive peptide induced signaling (Biocarta) | 3.2 | 5.9 | 0.3 | 1.5 | 0.2 |
| PPAR signaling pathway (KEGG) | 3.1 | 5.9 | 0.6 | 0.6 | 0.7 | |
| Glycine, serine and threonine metabolism (KEGG) | 2.4 | 2.2 | 1.3 | 2.4 | 3.4 |
The enrichment score was defined as the mean of negative log-transformed Kolmogorov-Smirnov and Fisher P-values for over-representation of high-ranking GWAS SNPs among the eSNPs that affect the expression of the pathway member genes. The number in parentheses in the first column indicates the number of CAD-associated pathways (detailed in Table S1).
*FDR<20% in Stage 1 and 2 respectively, and FDR<5% in combined Stage 1 & 2.
CAD enrichment scores for selected non-overlapping supersets after the merging of CAD-associated canonical pathways and co-expression modules.
| Superset | Overlap with known processes | All eSNPs | Adipose | Liver | Blood | HAEC |
| Lipid I | Lipid, fatty acid and steroid metabolism; oxidoreductase; PPAR signaling; mitochondrial beta-oxidation; branched-chain amino acid degradation; cholesterol biosynthesis; unsaturated fatty acid biosynthesis | 5.4 | 9.4 | 0.4 | 0.4 | 0.7 |
| Lipid II | Lipid, fatty acid and steroid metabolism; oxidoreductase; vesicles; xenobiotics; complement and coagulation system | 10.3 | 11.0 | 1.9 | 1.8 | 0.1 |
| Antigen | Human leukocyte antigens; bone reabsorption | 10.3 | 9.5 | 8.6 | 3.7 | 1.1 |
| Immunity | Wound and inflammatory responses; cell activation | 6.1 | 7.4 | 8.7 | 1.5 | 1.4 |
| Unknown I | - | 3.5 | 7.4 | 4.4 | 1.2 | 0.2 |
| Unknown II | - | 2.9 | 6.4 | 3.9 | 2.2 | 0.2 |
The enrichment score was defined as the mean of negative log-transformed Kolmogorov-Smirnov and Fisher P-values for over-representation of high-ranking GWAS SNPs among the eSNPs that affect the expression of the superset member genes.
*P<0.05 in either Fisher's exact test or Kolmogorov-Smirnov test after Bonferroni correction for the 3,539 original gene sets.
Top five genes whose eSNPs show strongest association with CAD in GWAS (termed “GWAS signal genes”) and key driver genes for selected CAD-associated supersets.
| Superset | GWAS signal genes | Key driver genes |
| Lipid I |
|
|
| Lipid II |
|
|
| Immunity |
|
|
| Antigen |
|
|
| Unknown I |
|
|
| Unknown II |
|
|
*Genes within superset whose eSNPs (i.e. putative functional SNPs that affect gene expression) show best association with CAD in the GWAS meta-analysis.
The key driver genes were ascertained by combining key driver analyses of all available Bayesian networks, and taking into account both the consistency across datasets and the KDA statistics.
Figure 2Key driver genes of six CAD-associated supersets, and their adjacent regulatory partners.
Key driver genes were denoted as larger nodes in the network. Genes were colored based on their membership in the six CAD-associated supersets. A) ‘Lipid II’ superset in red. B) ‘Lipid I’ superset in yellow. C) ‘Unknow II’ superset in lime. D) ‘Immunity’ superset in green. E) ‘Antigen’ superset in blue. F) ‘Unknown I’ superset in magenta. Only edges that were present in at least two Bayesian networks constructed from independent studies were included.