| Literature DB >> 24743187 |
Anirban Mukhopadhyay1, Ujjwal Maulik2.
Abstract
Protein-protein interaction network-based study of viral pathogenesis has been gaining popularity among computational biologists in recent days. In the present study we attempt to investigate the possible pathways of hepatitis-C virus (HCV) infection by integrating the HCV-human interaction network, human protein interactome and human genetic disease association network. We have proposed quasi-biclique and quasi-clique mining algorithms to integrate these three networks to identify infection gateway host proteins and possible pathways of HCV pathogenesis leading to various diseases. Integrated study of three networks, namely HCV-human interaction network, human protein interaction network, and human proteins-disease association network reveals potential pathways of infection by the HCV that lead to various diseases including cancers. The gateway proteins have been found to be biologically coherent and have high degrees in human interactome compared to the other virus-targeted proteins. The analyses done in this study provide possible targets for more effective anti-hepatitis-C therapeutic involvement.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24743187 PMCID: PMC3990553 DOI: 10.1371/journal.pone.0094029
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The diagrammatic representation of the proposed study.
The orange circles represent the HCV proteins. The blue circles represent the human proteins. The pink circles represent the diseases. The green edges represent the interaction between HCV proteins and human proteins. The black edges represent the interactions among human proteins. The violet edges represent the associations between human proteins and diseases. The quasi-bicliques and bicliques are shown also. The quasi-biclique in the HCV-human bipartite network overlaps with the quasi-clique in the human protein interaction network. The quasi-clique in the human protein interaction network overlaps with the quasi-biclique in the human protein-disease association network.
Figure 2Distribution of interactions in the Hepatitis-C-Human bipartite interaction network with respect to the 11 HCV proteins.
The HCV protein NS3 interacts with maximum number of human proteins (218), whereas NS2 is found to interact with minimum number of human proteins (8). Among the other HCV proteins, NS5A and CORE have reasonable number of interactions with the human proteins (115 and 94, respectively).
Figure 3Distribution of associations in the human gene-disease association network.
The left hand side figure shows the distribution of associations with respect to all the disease. The right hand side figure shows the distribution of associations with respect to all the genes.
Quasi-bicliques found from HCV-human protein interaction database.
| Quasi-biclique | HCV proteins | Human proteins | Density |
|
| Count: 3 | Count: 28 | |
| CORE, NS3, NS5A | EFEMP1, EIF2AK2, FBLN2, FBLN5, FTH1, HIVEP2, HNRNPK, JAK1, KPNA1, LTBP4, MAGED1, NAP1L1, NAP1L2, PSMB9, PSME3, RNF31, SMAD3, STAT1, STAT3, TBP, TLR2, TP53, TP53BP2, TRADD, TRAF2, TXNDC11, VIM, VWF | 0.6786 | |
|
| Count: 5 | Count: 10 | |
| E1, E2, NS2, NS4A, NS5B | CALR, CANX, CD209, CLEC4M, HOXD8, HSPA5, LTF, NR4A1, SETD2, UBQLN1 | 0.5400 |
The HCV proteins and human proteins involved in the quasi-bicliques are reported along with the densities of the quasi-bicliques.
Figure 4Human protein interactome induced by first quasi-biclique QB1.
The interactome consists of 120 human proteins and 509 interactions among them. The density of the interactome is nearly 0.07.
Quasi-cliques found from human protein interactome that overlap with the human proteins involved in the first quasi-biclique of Table 1.
| Quasi-clique | Human proteins | Density | Overlapping proteins with first quasi-clique |
|
| Count: 8 | ||
| POMP, PSMA2, PSMB10, PSMB7, PSMB8, PSMB9, PSME3, RFWD2 | 0.6786 | PSMB9, PSME3 | |
|
| Count: 14 | ||
| BIRC2, BIRC3, CASP8, FADD, GATA5, MAP3K5, RIPK1, TNFRSF1A, TNFRSF1B, TRADD, TRAF1, TRAF2, UBC, VIM | 0.6484 | TRADD, TRAF2, VIM | |
|
| Count: 23 | ||
| CIP, EDF1, GTF2A1, GTF2A2, GTF2B, GTF2E1, GTF2F1, HNRNPK, MYST1, SETD7, SF3A2, TAF1, TAF10, TAF11, TAF12, TAF13, TAF2, TAF2E, TAF3, TAF4, TAF5, TAF7, TBP | 0.6324 | HNRNPK, TBP | |
|
| Count: 8 | ||
| HDAC1, HIPK2, MDM2, MDM4, SUMO1, TP53, UBE2I, USp7 | 0.6429 | TP53 | |
|
| Count: 8 | ||
| EGFR, IL6ST, JAK1, PIAS3, SRC, STAT1, STAT2, STAT3 | 0.7143 | JAK1, STAT1, STAT3 |
The human proteins involved in the quasi-cliques are reported along with the densities of the quasi-cliques and the overlapping human proteins with the first quasi-biclique.
Figure 5Human protein interactome induced by second quasi-biclique QB2.
The interactome consists of 79 human proteins and 693 interactions among them. The density of the interactome is nearly 0.22.
Quasi-cliques found from human protein interactome that overlap with the human proteins involved in the second quasi-biclique of Table 1.
| Quasi-clique | Human proteins | Density | Overlapping proteins with second quasi-biclique |
|
| Count: 4 | ||
| PLOD1, PLOD2, PLOD3, SETD2 | 0.8333 | SETD2 | |
|
| Count: 5 | ||
| NBL1, PSMD4, UBA52, UBC, UBQLN1 | 0.7000 | UBQLN1 | |
|
| Count: 45 | ||
| BCL2, CD3D, CREBBP, EP300, ESR1, ESR2, ESRRA, ESRRB, ESRRG, FOSB, GNG2, HNF4A, HNF4G, MAPK7, MEF2D, NFATC2, NR0B2, NR1D1, NR1D2, NR1H2, NR2C1, NR2C2, NR2C2AP, NR2E1, NR2F1, NR2F6, NR4A1, NR4A2, NR5A1, NRBP1, POMC, PPARA, PPARD, PPARG, RARA, RARB, RARG, RORA, RORB, RORC, RXRA, RXRG, THRA, THRB, VDR | 0.6364 | NR4A1 | |
|
| Count: 5 | ||
| APOB, C1QA, C1QB, C1QC, CALR | 0.7000 | CALR |
The human proteins involved in the quasi-cliques are reported along with the densities of the quasi-cliques and the overlapping human proteins with the second quasi-biclique.
The significant important GO terms and KEGG pathways found in the quasi-cliques.
| Quasi-clique | Significant GO terms | KEGG Pathway | ||
| Biological Process | Molecular Function | Cellular Component | ||
|
| negative regulation of ubiquitin-protein ligase activity during mitotic cell cycle | threonine-type endopeptidase activity | proteasome complex | Proteasome |
| (p-value: 4.6e-11, 75%) | (p-value: 6.1e-11, 62.5%) | (p-value: 6.4e-14, 87.5%) | (p-value: 3.1e-12, 87.5%) | |
|
| apoptosis | death domain binding | membrane raft | Apoptosis |
| (p-value: 8.9e-14, 85.7%) | (p-value: 5.0e-3, 14.3%) | (p-value: 3.9e-8, 42.9%) | (p-value: 1.1e-10, 57.1%) | |
| programmed cell death | death-inducing signaling complex | pathways in cancer | ||
| (p-value: 1.1e-13, 85.7%) | (p-value: 5.5e-6, 21.4%) | (p-value: 3.6e-4, 42.9%) | ||
|
| transcription initiation from RNA polymerase II promoter | general RNA polymerase II transcription factor activity | DNA-directed RNA polymerase II, holoenzyme | Basal transcription factors |
| (p-value: 6.0e-29, 71.4%) | (p-value: 4.9e-20, 52.4%) | (p-value: 4.1e-30, 76.2%) | (p-value: 1.8e-29, 71.4%) | |
|
| negative regulation of transcription | enzyme binding | PML body | p53 signaling pathway |
| (p-value: 1.0e-8, 87.5%) | (p-value: 1.2e-3, 50.0%, ) | (p-value: 1.6e-7, 50.0%) | (p-value: 1.0e-3, 37.5%) | |
| Chronic myeloid leukemia | ||||
| (p-value: 1.3e-3, 37.5%) | ||||
|
| protein kinase cascade | protein tyrosine kinase activity | dendrite | Jak-STAT signaling pathway |
| (p-value: 2.8e-9, 87.5%) | (p-value: 3.3e-3, 37.5%) | (p-value: 7.4e-2, 25.0%) | (p-value: 4.9e-7, 75.0%) | |
| Pancreatic cancer | ||||
| (p-value: 9.1e-5, 50.0%) | ||||
|
| oxidation reduction | procollagen-lysine 5-dioxygenase activity | endoplasmic reticulum | Lysine degradation |
| (p-value: 1.0e-4, 100.0%) | (p-value: 1.1e-7, 75.0%) | (p-value: 5.6e-3, 75.0%) | (p-value: 6.0e-7, 100.0%) | |
|
| anaphase-promoting complex-dependent proteasomal ubiquitin-dependent protein catabolic process | structural constituent of ribosome | cytosolic small ribosomal subunit | |
| (p-value: 2.3e-5, 60.0%) | (p-value: 2.6e-2, 40.0%) | (p-value: 1.2e-2, 40.0%) | – | |
| proteasome complex | ||||
| (p-value: 1.9e-2, 40.0%) | ||||
|
| regulation of transcription, DNA-dependent | steroid hormone receptor activity | nuclear lumen | Pathways in cancer |
| (p-value: 4.3e-27, 84.4%) | (p-value: 6.1e-75, 73.3%) | (p-value: 1.4e-3, 20.0%) | (p-value: 1.7e-5, 20.0%) | |
| transcription factor activity | transcription factor complex | PPAR signaling pathway | ||
| (p-value: 7.7e-37, 84.4%) | (p-value: 4.7e-3, 8.9%) | (p-value: 1.3e-4, 11.1%) | ||
|
| protein maturation | carbohydrate binding | extracellular space | Prion diseases |
| (p-value: 2.8e-6, 80.0%) | (p-value: 5.4e-2, 40.0%) | (p-value: 5.9e-4, 80.0%) | (p-value: 1.4e-4, 60.0%) | |
| humoral immune response mediated by circulating immunoglobulin | Complement and coagulation cascades | |||
| (p-value: 3.0e-5, 60.0%) | (p-value: 5.4e-4, 60.0%) | |||
| Quasi-clique | Significant GO terms | KEGG Pathway | ||
| Biological Process | Molecular Function | Cellular Component | ||
|
| negative regulation of ubiquitin-protein ligase activity during mitotic cell cycle | threonine-type endopeptidase activity | proteasome complex | Proteasome |
| (p-value: 4.6e-11, 75%) | (p-value: 6.1e-11, 62.5%) | (p-value: 6.4e-14, 87.5%) | (p-value: 3.1e-12, 87.5%) | |
|
| apoptosis | death domain binding | membrane raft | Apoptosis |
The significant terms are mentioned along with their significance p-values and percentage of proteins associated with each term. DAVID online tool has been used to perform the significance tests.
Quasi-bicliques found for human protein-disease association network corresponding to four quasi-cliques.
| Quasi-biclique | Corresponding | Human proteins | Diseases | Density |
|
|
| Count: 2 | Count: 5 | |
| PSMB8, PSMB9 | Graves disease, diabetes (type 1), interferon response, psoriasis, malaria; hypoglycemia; hyperparasitemia | 0.7000 | ||
|
|
| Count: 2 | Count: 12 | |
| TNFRSF1A, TNFRSF1B | Crohn's disease, ulcerative colitis, cystic fibrosis, Lupus, Rheumatoid Arthritis, diabetes (type 2), amyloidosis, breast cancer, Tumor necrosis factor receptor-associated periodic syndrome, bone density, bone mass, obesity | 0.7083 | ||
|
|
| Count: 2 | Count: 9 | |
| TP53, MDM2 | DNA Damage | Lung Neoplasms, B-Cell Chronic Lymphocytic Leukemia, bladder cancer, breast cancer, colorectal cancer, endometrial cancer, liver cancer, lung cancer, stomach cancer | 1.000 | ||
|
|
| Count: 2 | Count: 5 | |
| EGFR, MDM2 | colorectal cancer, lung cancer, Acute Coronary Syndrome, Breast Neoplasms Carcinoma | Non-Small-Cell Lung | Exanthema | Lung Neoplasms | 0.7000 |
The human proteins and diseases associated with each quasi-biclique are reported along with the densities of the quasi-bicliques.
Significant GO-BP and KEGG pathway terms for viral-human gateway proteins.
| Significant GO-BP terms |
| cytokine-mediated signaling pathway (p-value: 3.7e-5) |
| regulation of apoptosis (p-value: 5.2e-5) |
| regulation of programmed cell death (p-value: 5.5e-5) |
| regulation of cell death (p-value: 5.6e-5) |
| positive regulation of macromolecule metabolic process (p-value: 7.4e-5) |
|
|
| Pancreatic cancer (p-value: 5.5e-4) |
| Pathways in cancer (p-value: 5.6e-3) |
The significant terms are mentioned along with their significance p-values. DAVID online tool has been used to perform the significance tests.