| Literature DB >> 29467471 |
Harri Lempiäinen1, Ingrid Brænne2, Tom Michoel3,4, Vinicius Tragante5, Baiba Vilne6,7, Tom R Webb8, Theodosios Kyriakou9, Johannes Eichner10, Lingyao Zeng6, Christina Willenborg2, Oscar Franzen11, Arno Ruusalepp4, Anuj Goel9, Sander W van der Laan12, Claudia Biegert10, Stephen Hamby8, Husain A Talukdar13, Hassan Foroughi Asl13, Gerard Pasterkamp12,14, Hugh Watkins9, Nilesh J Samani8, Timo Wittenberger10, Jeanette Erdmann2, Heribert Schunkert6,7, Folkert W Asselbergs5,15, Johan L M Björkegren16,17,18.
Abstract
Genome-wide association studies (GWAS) have identified over two hundred chromosomal loci that modulate risk of coronary artery disease (CAD). The genes affected by variants at these loci are largely unknown and an untapped resource to improve our understanding of CAD pathophysiology and identify potential therapeutic targets. Here, we prioritized 68 genes as the most likely causal genes at genome-wide significant loci identified by GWAS of CAD and examined their regulatory roles in 286 metabolic and vascular tissue gene-protein sub-networks ("modules"). The modules and genes within were scored for CAD druggability potential. The scoring enriched for targets of cardiometabolic drugs currently in clinical use and in-depth analysis of the top-scoring modules validated established and revealed novel target tissues, biological processes, and druggable targets. This study provides an unprecedented resource of tissue-defined gene-protein interactions directly affected by genetic variance in CAD risk loci.Entities:
Mesh:
Year: 2018 PMID: 29467471 PMCID: PMC5821758 DOI: 10.1038/s41598-018-20721-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Principal analysis steps to identify and score gene/protein subnetwork modules containing CAD candidate genes. (A) Analysis steps to identify subnetwork modules with CAD candidate genes. (I) In step 1, 171 tissue-specific and cross-tissue co-expression networks inferred from the Stockholm Atherosclerosis Gene Expression (STAGE) study[7,35]. (II) In step II to account also for gene-protein and protein-protein interactions (PPIs), the ConsensusPathDB[8] was used to add protein nodes to the STAGE gene networks conserving tissue integrity. (III) In step III, to extract smaller, likely functional, units Girvan-Newman algorithm was used to identify gene/protein modules within each networks resulting in 953 modules. (IV) In step IV, 286 modules affected by genome-wide significant loci (p < 5 × 10−8) were selected (Supplementary Table 1b). Squares indicate genes nodes from STAGE data. Diamonds represent protein nodes from ConsensusPathDB database. Color-coding highlight different modules. Yellow nodes are CAD candidate genes (LDLR, CETP, APOB and PCSK9 (p < 5 × 10−8), APOE and MAPK14 (FDR ≤ 5%)). (B), Principles for scoring gene/protein subnetwork modules with CAD candidate genes. The scoring theme was set to prioritize modules in relevant tissues and biological processes harboring druggable targets against CAD. Specifically, individual nodes were scored according to (I) distance to CAD candidate gene in module, (II) genetic modification of the mouse ortholog displaying an atherosclerotic phenotype (III) expression in CAD relevant tissues (green, positive score) or in tissues commonly displaying drug toxicity (red, negative score), and (IV) the CAD druggability potential of the gene. The final CAD-feasibility score for each module was calculated from the sum of individual gene/protein node scores divided by the total number of nodes/module. Several figures in panels II-IV have been obtained and adapted from Servier Medical Art (www.servier.com) which are distributed under Creative Commons license (https://creativecommons.org/licenses/by/3.0/).
Key features of the top 25 modules.
| Module ID | Size | CAD-feasibility score | Main tissue | CAD candidate genes | Main GO biological process term | Drug target nodes ( | Drug target enrichment ( | Cardiometabolic drug targeted nodes | Cardiometabolic target enrichment ( |
|---|---|---|---|---|---|---|---|---|---|
| 119_6 | 10 | 5.71 | VF | LDLR* | Chemotaxis | 9 | 3.31E-08 | 2 | 4.25E-03 |
| 18_4 | 6 | 5.65 | VF | COL4A1 | Platelet degranulation | 5 | 1.12E-04 | 1 | 5.83E-02 |
| 130_2 | 65 | 5.43 | SM | IGF1R, SHC1, IL6R | Epidermal growth factor receptor signaling pathway | 45 | 2.16E-27 | 0 | 1.00E + 00 |
| 36_4 | 9 | 5.37 | SM | ADM | Vein smooth muscle contraction | 6 | 1.46E-04 | 1 | 8.62E-02 |
| 142_4 | 14 | 5.35 | AAW | LRP1, *FN1 | Complement activation | 11 | 1.29E-08 | 4 | 9.33E-06 |
| 139_5 | 4 | 5.16 | LIV | SLC22A5, SLC22A3 | Quaternary ammonium group transport | 3 | 5.64E-03 | 0 | 1.00E + 00 |
| 1_5 | 5 | 5.09 | IMA, AAW, WB | ADM | Complement activation, alternative pathway | 3 | 1.29E-02 | 1 | 4.88E-02 |
| 134_1 | 38 | 4.98 | IMA, SM | RELA, TNF, *SHC1, LRP1* | Innate immune response | 22 | 8.60E-12 | 4 | 5.67E-04 |
| 17_3 | 12 | 4.93 | IMA, WB, VF | LRP1* | Extracellular matrix disassembly | 8 | 1.02E-05 | 4 | 4.69E-06 |
| 82_4 | 37 | 4.93 | AAW, SF | LRP1, *FN1, COL4A1 | Extracellular matrix organization | 25 | 1.78E-15 | 6 | 1.85E-06 |
| 171_2 | 59 | 4.85 | SF | SHC1, MRAS | epidermal growth factor receptor signaling pathway | 41 | 3.39E-25 | 0 | 1.00E + 00 |
| 108_3 | 75 | 4.83 | VF | PLCG1, LRP1, *ITGB5, SHC1 | Blood coagulation | 43 | 2.04E-21 | 4 | 6.95E-03 |
| 163_1 | 46 | 4.78 | SF | TNF, *FN1 | Extracellular matrix organization | 32 | 5.17E-20 | 4 | 1.18E-03 |
| 137_2 | 111 | 4.76 | SM | PLCG1, BCAR1, SHC1, FLT1 | Peptidyl-tyrosine phosphorylation | 61 | 2.72E-28 | 3 | 1.00E-01 |
| 126_2 | 44 | 4.74 | SM | LRP1, *FN1, COL4A1 | Extracellular matrix organization | 28 | 3.89E-16 | 6 | 5.27E-06 |
| 89_3 | 58 | 4.73 | SM | FN1, COL4A1 | Extracellular matrix organization | 34 | 1.13E-17 | 8 | 1.32E-07 |
| 143_5 | 56 | 4.72 | IMA | RELA, NOS3, *IGF1R, SMAD3 | Innate immune response | 36 | 1.51E-20 | 2 | 1.08E-01 |
| 91_4 | 29 | 4.70 | IMA; SF, VF, SM | LDLR, *APOE, *SCARB1, NOS3* | Receptor-mediated endocytosis | 17 | 1.55E-09 | 3 | 3.01E-03 |
| 116_1 | 10 | 4.68 | LIV, SF | FURIN | Caveolin-mediated endocytosis | 8 | 1.16E-06 | 1 | 9.53E-02 |
| 84_3 | 63 | 4.65 | VF | FN1, COL4A1 | Blood coagulation | 50 | 4.06E-35 | 19 | 7.30E-23 |
| 10_3 | 15 | 4.57 | AAW, LIV, SF,VF | PLG* | Extracellular matrix disassembly | 10 | 7.37E-07 | 4 | 1.26E-05 |
| 72_6 | 7 | 4.56 | IMA | EDNRA* | Cell division | 3 | 3.77E-02 | 1 | 6.77E-02 |
| 130_3 | 108 | 4.56 | SM | MAPK14, RELA, TNF, *FN1 | Positive regulation of NF-kappaB transcription factor activity | 58 | 2.76E-26 | 3 | 9.44E-02 |
| 69_5 | 31 | 4.54 | AAW | FN1: COL4A2: COL4A1 | Extracellular matrix organization | 16 | 5.53E-08 | 1 | 2.67E-01 |
| 124_1 | 43 | 4.52 | SM | MAPK14: RELA: SMAD3 | Activation of MAPKK activity | 23 | 2.68E-11 | 1 | 3.50E-01 |
The table is sorted by module CAD-feasibility score and shows the tissue(s) of the module, CAD candidate genes in the module with genes targeted by cardiometabolic drugs marked with asterisk, the most highly enriched Gene Ontology biological processes (GOBP) category, and number of genes in the module that are targeted by cardiometabolic drugs and by all known drugs. P value of the enrichments is based on Fisher’s exact test.
Figure 2Correlation between CAD-feasibility score and cardiometabolic drug target gene/protein enrichment. (A) The plot shows the 286 modules divided in to five equal size (57–58 modules in each) groups based on the CAD-feasibility score and the 25 top-scoring modules (which is a sub-group of 5th quintile). For each module group the arithmetic mean of the % of nodes targeted by cardiometabolic drugs in each module is shown together with the standard deviation. The score range for the each quintile is shown below the bars. The statistical difference between the quintile groups were tested with Kolmogorov-Smirnov two-group test; the statistically significant comparison are indicated with the arches above the bars: *p < 0.05, **p < 0.01, ***p < 0.001. (B) The plot shows the 286 modules divided in to equal size quintiles (57–58 modules in each) groups based on the CAD-feasibility score, and the 25 top-scoring modules (which is a sub-group of 5th quintile). For each module group the arithmetic mean of the the ratio of cardiometabolic drugs targets versus all other drugs targets is shown together with the standard deviation. The statistical difference between the quintile groups were tested with Kolmogorov-Smirnov two-group test; the statistically significant comparison are indicated with the arches above the bars: **p < 0.01.
ATC groups most represented by drugs that target gene products of the top 25 modules.
| ATC group | ATC group code | Significant modules | |
|---|---|---|---|
|
| ID | ||
| Antineoplastic and immunomodulating agents | L | 13 | 18_4, 130_2, 134_1, 82_4, 171_2, 108_3, 137_2, 126_2, 143_5, 91_4, 116_1, 130_3, 124_1 |
| Cardiovascular system | C | 4 | 36_4, 17_3, 84_3, 72_6 |
| Musculoskeletal system | M | 4 | 17_3, 82_4, 163_1, 69_5 |
| Blood and blood-forming organs | B | 2 | 84_3, 10_3 |
| Nervous system | N | 1 | 124_1 |
| Genitourinary system and sex hormones | G | 1 | 130_3 |
| Sensory organs | S | 1 | 163_1 |
| Alimentary tract and metabolism | A | 0 | |
| Dermatologicals | D | 0 | |
| Systemic hormonal preparations, excluding sex hormones and insulins | H | 0 | |
| Anti-infectives for systemic use | J | 0 | |
| Antiparasitic products, insecticides, and repellents | P | 0 | |
| Respiratory system | R | 0 | |
| Various | V | 0 | |
Listed are the ATC groups and the number and IDs of the modules significantly enriched (Fisher’s exact test, right-tail P < 0.05) in targets of ATC drug groups. ATC, Anatomical Therapeutic Chemical Classification System.
Figure 3Examples of subnetwork modules. (A) Cholesterol and lipoprotein metabolism and homeostasis (Module 91_4); (B) Extracellular matrix organization and regulation, blood coagulation and platelet activation (Module 82_4). (C) Innate immune response (Module 134_1). (D) Cellular signaling (Module 130_2). In the figures, the GO biological process term with the lowest P value (Benjamini-Hochberg; Fisher’s exact test) is shown. IMA, internal mammary artery; SF, subcutaneous fat; SM, skeletal muscle; VF, visceral fat.