| Literature DB >> 23977067 |
Anida Sarajlić1, Vuk Janjić, Neda Stojković, Djordje Radak, Nataša Pržulj.
Abstract
The structure of protein-protein interaction (PPI) networks has already been successfully used as a source of new biological information. Even though cardiovascular diseases (CVDs) are a major global cause of death, many CVD genes still await discovery. We explore ways to utilize the structure of the human PPI network to find important genes for CVDs that should be targeted by drugs. The hope is to use the properties of such important genes to predict new ones, which would in turn improve a choice of therapy. We propose a methodology that examines the PPI network wiring around genes involved in CVDs. We use the methodology to identify a subset of CVD-related genes that are statistically significantly enriched in drug targets and "driver genes." We seek such genes, since driver genes have been proposed to drive onset and progression of a disease. Our identified subset of CVD genes has a large overlap with the Core Diseasome, which has been postulated to be the key to disease formation and hence should be the primary object of therapeutic intervention. This indicates that our methodology identifies "key" genes responsible for CVDs. Thus, we use it to predict new CVD genes and we validate over 70% of our predictions in the literature. Finally, we show that our predicted genes are functionally similar to currently known CVD drug targets, which confirms a potential utility of our methodology towards improving therapy for CVDs.Entities:
Mesh:
Year: 2013 PMID: 23977067 PMCID: PMC3744556 DOI: 10.1371/journal.pone.0071537
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flowchart of our approach.
Parallelograms denote inputs and outputs. Rectangles denote analyses. Rhombuses denote choices to be made.
Figure 273 Graphlets and Graphlet Degree Vector (GDV) of a node.
Above: Graphlets with up to five nodes, denoted by . They contain 73 “symmetry groups,” denoted by . Within a graphlet, nodes belonging to the same symmetry group are of the same shade [33]. Below: An illustration of the GDV of node . , meaning that is touched by two edges (orbit 0), illustrated in the left panel, an end-node of one graphlet (orbit 1), illustrated in the middle panel, the middle node of one graphlet (orbit 2), illustrated in the left panel again, no nodes of a triangle (orbit 3 in graphlet ), no end-node of graphlet (orbit 4), one middle node of graphlet (orbit 5), illustrated in the right panel, and no other orbits [22]-Reproduced by permission of The Royal Society of Chemistry (http://pubs.rsc.org/en/content/articlehtml/2012/mb/c2mb25230a).
The Ten Key Cardiovascular Disease Genes.
| Entrez ID | Gene name | GO term | Cardiovascular disease (CVD) |
| 25 | ABL1 | Intracellular signaling cascade (BP),Signal transducer activity (MF) | Viral myocarditis. |
| 6464 | SHC1 | Intracellular signaling cascade (BP),Signal transducer activity (MF) | Atherosclerosis. |
| 6667 | SP1 | Enzyme binding (MF) | Trombophlebitis. |
| 367 | AR | Intracellular signaling cascade (BP),Intracellular receptor-mediatedsignaling pathway (BP), Signaltransducer activity (MF) | Atherosclerosis. |
| 1499 | CTNNB1 | Intracellular signaling cascade (BP),Intracellular receptor-mediatedsignaling pathway (BP), Enzyme binding (MF),Signal transducer activity (MF) | Arythmogenic right ventricular cardiomyopathy (ARVC). |
| 2534 | FYN | Intracellular signaling cascade (BP) | Viral myocarditis. |
| 60 | ACTB | Enzyme binding (MF) | Arythmogenic right ventricular cardiomyopathy(ARVC), Hypertrophic cardiomyopathy (HCM), Viral myocarditis, Dilated Cardiomyopathy (DCM). |
| 10014 | HDAC5 | Heart failure. | |
| 1956 | EGFR | Intracellular signaling cascade (BP),Enzyme binding (MF), Signal transduceractivity (MF) | Trombophlebitis, Stroke. |
| 2099 | ESR1 | Intracellular signaling cascade (BP),Intracellular receptor-mediatedsignaling pathway (BP), Signaltransducer activity (MF) | Stroke, Atherosclerosis, Cerebrovascular disorder. |
The first two columns: ten Key CVD genes (Entrez Gene IDs and Official Gene Symbols respectively). The third column: GO terms that the genes are annotated with. We only take into consideration GO terms in which this set of 10 genes is statistically significantly enriched. We only list GO terms that correspond to biological functions that the three drug mechanisms of interest rely on. BP denotes “biological process,” while MF denotes “molecular function” of GO. The fourth column: CVDs that the genes are associated with.
Predicted CVD genes.
| Entrez ID | Gene name | GO term | Reference PubMed ID |
| 1387 | CREBBP | Receptor binding (MF), Signal transduction (BP). | 14724353 |
| 4193 | MDM2 | Enzyme binding (MF). | 18375498, 22821713 |
| 3065 | HDAC1 | Enzyme binding (MF). | 22226905 |
| 4088 | SMAD3 | Enzyme binding (MF), Receptor binding (MF), Enzymelinked receptor protein signaling pathway (BP). | 22167769, 22633655 |
| 4087 | SMAD2 | Enzyme binding (MF), Receptor binding (MF), Signaltransduction (BP), Intracellular signaling cascade (BP),Enzyme linked receptor protein signaling pathway (BP). | 20829218, 22049534 |
| 3725 | JUN, c-JUN | Signal transduction (BP), Response to drug (BP), Enzymelinked receptor protein signaling pathway (BP). | 22664133 |
| 672 | BRCA1 | Enzyme binding (MF), Receptor binding (MF), Signaltransduction (BP), Intracellular signaling cascade (BP). | 22186889 |
| 4609 | MYC | 22402364 | |
| 6714 | SRC | Signal transduction (BP), Intracellular signaling cascade (BP),Enzyme linked receptor protein signaling pathway (BP). | 22287273 |
| 2033 | EP300 | Receptor binding (MF), Signal transduction (BP),Response to drug (BP). | 20375365 |
| 7157 | TP53 | Enzyme binding (MF), Signal transduction (BP), Intracellularsignaling cascade (BP), Response to drug (BP). | 23074332, 22189267 |
| 2885 | GRB2 | Receptor binding (MF), Signal transduction (BP), Intracellularsignaling cascade (BP), Enzyme linked receptor proteinsignaling pathway (BP). | 12639989 |
| 8517 | IKBKG | Signal transduction (BP), Intracellular signaling cascade (BP). | – |
| 3320 | HSP90AA1, HSP90AA2 | Signal transduction (BP). | – |
| 5295 | PIK3R1 | Enzyme binding (MF), Receptor binding (MF), Signaltransduction (BP), Intracellular signaling cascade (BP),Enzyme linked receptor protein signaling pathway (BP). | – |
| 7543 | YWHAZ | Signal transduction (BP), Response to drug (BP). | – |
| 10971 | YWHAQ | Signal transduction (BP), Intracellular signaling cascade (BP). | – |
The first two columns: predicted CVD genes (Entrez Gene IDs and Official Gene Symbols respectively). The third column: GO terms that the genes are annotated with. We only take into consideration GO terms in which this set of 17 genes is statistically significantly enriched. We only list GO terms that correspond to biological functions that the three drug mechanisms of interest rely on. BP denotes “biological process,” while MF denotes “molecular function” of GO. The fourth column: if we validate that the predicted gene is associated with a CVD, we give the PubMed ID of the corresponding reference; “–”means that we found no literature validation.
Figure 3The distribution of GDV similarity of protein pairs in the human PPI network.
Horizontal axis represents GDV-similarities of node pairs in the network in bins of . Vertical axis represents percentages of protein pairs that have a particular GDV-similarity.
Figure 4Summary of the results.
The Core Diseasome of [22] is overlaid with the results of this study. Green nodes are the Key CVD Genes (from Table 1), which are in the Core Diseasome. Blue nodes are predicted CVD genes (from Table 2) that we validated in the literature and that are in the Core Diseasome. Red nodes are non-validated CVD gene predictions (from Table 2) that are in the Core Diseasome. Triangular nodes are drug targets. Driver genes are bordered in red.
The Key Cardiovascular Disease Genes that are known drug targets.
| Entrez ID | Gene name | Number of Drugs |
| 367 | AR | 40 |
| 2099 | ESR1 | 61 |
| 25 | ABL1 | 11 |
| 1499 | CTNNB1 | 1 |
| 2534 | FYN | 2 |
| 1956 | EGFR | 10 |
The first column: Entrez Gene ID. The second column: Official Gene Symbol. The third column: the number of drugs from Drugbank that target the corresponding gene.