| Literature DB >> 33965637 |
Jayanta Kumar Das1, Subhadip Chakraborty2, Swarup Roy3.
Abstract
Understanding the molecular mechanism of COVID-19 pathogenesis helps in the rapid therapeutic target identification. Usually, viral protein targets host proteins in an organized fashion. The expression of any viral gene depends mostly on the host translational machinery. Recent studies report the great significance of codon usage biases in establishing host-viral protein-protein interactions (PPI). Exploring the codon usage patterns between a pair of co-evolved host and viral proteins may present novel insight into the host-viral protein interactomes during disease pathogenesis. Leveraging the similarity in codon usage patterns, we propose a computational scheme to recreate the host-viral protein-protein interaction network. We use host proteins from seventeen (17) essential signaling pathways for our current work towards understanding the possible targeting mechanism of SARS-CoV-2 proteins. We infer both negatively and positively interacting edges in the network. Further, extensive analysis is performed to understand the host PPI network topologically and the attacking behavior of the viral proteins. Our study reveals that viral proteins mostly utilize codons, rare in the targeted host proteins (negatively correlated interaction). Among them, non-structural proteins, NSP3 and structural protein, Spike (S), are the most influential proteins in interacting with multiple host proteins. While ranking the most affected pathways, MAPK pathways observe to be the worst affected during the SARS-CoV-2 infection. Several proteins participating in multiple pathways are highly central in host PPI and mostly targeted by multiple viral proteins. We observe many potential targets (host proteins) from the affected pathways associated with the various drug molecules, including Arsenic trioxide, Dexamethasone, Hydroxychloroquine, Ritonavir, and Interferon beta, which are either under clinical trial or in use during COVID-19.Entities:
Keywords: Bipartite graph; Cell signaling; Centrality; Codon usage bias; Drugs; Protein interaction network; Relative synonymous codon usage
Mesh:
Year: 2021 PMID: 33965637 PMCID: PMC8102073 DOI: 10.1016/j.jbi.2021.103801
Source DB: PubMed Journal: J Biomed Inform ISSN: 1532-0464 Impact factor: 8.000
SARS-CoV-2 proteins considered for host-viral PPI construction.
| Structural | 4 | Spike (S), Envelope (E), Membrane (M), Nucleocapsid (N) |
| Non-structural | 16 | Nsp1,Nsp2, |
| Accessory | 6 | Orf3a, Orf6, Orf7a, Orf7b, Orf8, Orf10 |
Candidate signaling pathways and the number of host proteins (or genes) participating in the pathway.
| NF-κ B signaling pathway | 105 | Th17 cell differentiationl | 108 |
| Cytokine-cytokine receptor interaction | 295 | TGF-β signaling pathway | 95 |
| TNF signaling pathway | 113 | Toll-like receptor signaling pathway | 105 |
| IL-17 signaling pathway | 95 | HIF-1 signaling pathway | 110 |
| RIG-I-like receptor signaling pathway | 70 | Apoptosis | 137 |
| MAPK signaling pathway | 295 | Insulin signaling pathway | 138 |
| Chemokine signaling pathway | 190 | mTOR signaling pathway | 156 |
| PI3K-Akt signaling pathway | 355 | Adipocytokine signaling pathway | 70 |
| JAK-STAT signaling pathway | 163 |
Fig. 1The host-viral codon usage (RSCU) patterns. The scatter plot shows the RSCU value of 59 codons for viral (X-axis) and host (Y-axis) proteins. The regression line represents the trend of RSCU patterns. Viral protein S showing positive correlation () with host protein TANK and negative correlation () with host protein GDF15.
Performance assessment of the proposed scheme on few virus-host interaction networks.
| SARS-CoV-2 | Network-1 | 23 | 294 | 294 | 60 |
| Network-2 | 21 | 884 | 1106 | 43 | |
| Network-3 | 25 | 203 | 517 | 59 | |
| Epstein-Barr | Virhostome | 19 | 435 | 1486 | 60 |
| Hepatitis-C | VirusMINT | 2 | 15 | 15 | 85 |
| Influenza-A | VirusMINT | 2 | 4 | 4 | 75 |
http://interactome.dfci.harvard.edu/V_hostome/idex.php.
Fig. 2Distribution of RSCU scores for 59 codons for all (a) SARS-CoV-2 proteins; (b) Host proteins.
Fig. 3The host-viral interactions network showing host proteins, which are connected to a single viral protein. In the network, the yellow-color represents viral nodes, whereas the blue and green colors represents host nodes, represent positive and negative interactions, respectively. As shown in the figure, 09 viral proteins (Nsp1, Nsp2, Nsp6, Nsp7, Nsp9, M, N, Orf3a, and Orf7a), 08 viral proteins (Nsp3, Nsp5, Nsp10, Nsp11, Nsp12, Nsp16, Orf8, and Orf10), and 03 viral proteins (Nsp13, Nsp14, and Nsp15) interactions with host proteins are positive, negative and both, respectively.
The list of top few host proteins targeted by number of SARS-CoV-2 interacting viral proteins. For each host protein (Hp), number of interacting viral proteins (IVP) count and calculated average correlation value () are shown. There are total of 40 host proteins (20 for positive interactions and 20 for negative interactions).
| 1 | COL4A5 | 21 | 0.62 | Nsp1, Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 2 | STAM2 | 21 | 0.63 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf6, Orf7a, Orf8 |
| 3 | LIFR | 21 | 0.64 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf6, Orf7a, Orf8 |
| 4 | IFNAR1 | 20 | 0.59 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 5 | PPM1B | 20 | 0.61 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 6 | RPS6KA6 | 20 | 0.62 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, N, Orf3a, Orf6, Orf7a, Orf8 |
| 7 | SOS2 | 20 | 0.63 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 8 | PKN2 | 20 | 0.66 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 9 | IRAK4 | 20 | 0.69 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 10 | IL13RA2 | 19 | 0.61 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 11 | APAF1 | 19 | 0.61 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 12 | CUL2 | 19 | 0.61 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 13 | DNM1L | 19 | 0.63 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 14 | MIOS | 19 | 0.64 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 15 | BIRC2 | 19 | 0.65 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 16 | RPS6KA3 | 19 | 0.68 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 17 | PPP1R3A | 19 | 0.70 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, N, Orf3a, Orf7a, Orf8 |
| 18 | SGK3 | 18 | 0.61 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 19 | PPP3CB | 18 | 0.62 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, N, Orf3a, Orf7a, Orf8 |
| 20 | HIF1A | 18 | 0.62 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8 |
| 1 | GDF15 | 19 | −0.62 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, Orf3a, Orf6, Orf8 |
| 2 | FGF4 | 19 | −0.60 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, Orf3a, Orf6, Orf8 |
| 3 | SHC2 | 19 | −0.58 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, Orf3a, Orf6, Orf8 |
| 4 | CEBPB | 18 | −0.59 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 5 | IRS2 | 18 | −0.59 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 6 | JUN | 18 | −0.58 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf817 |
| 7 | EFNA2 | 18 | −0.58 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 8 | LPAR5 | 18 | −0.58 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 9 | GDF7 | 18 | −0.58 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 10 | FZD1 | 18 | −0.58 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf818 |
| 11 | FZD9 | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 12 | PPP2R3B | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 13 | MAPK8IP2 | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 14 | DDIT4 | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 15 | NOG | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 16 | SMAD6 | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 17 | WNT6 | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 18 | GREM2 | 18 | −0.57 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 19 | BMP7 | 18 | −0.56 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
| 20 | FZD8 | 18 | −0.56 | Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8 |
Fig. 4(a) Frequency distribution of positive (right) and negative (left) correlation scores for interacting proteins in terms of RSCU based codon usage similarity. (b) Box plot showing the range of correlation values for each viral protein while associated with its target proteins.
Fig. 5(a) The bar chart represents the host protein count for each viral protein based on correlation analysis (p-value ). Pc-positive count, Nc-negative count, positive and negative count are based on positive and negative correlations. (b) Degree distribution of 859 host proteins in terms of number of associated viral proteins (degree) count (x-axis) with host protein frequency (y-axis).
Fig. 6Ranking of 17 candidate signaling pathways. The pathway ranking is done by observing the host protein percentage from pathways that interact with any of the SARS-CoV-2 (26) proteins.
Fig. 7(a) Participation host protein count of central proteins in candidate pathways; (b) Degree distribution of 859 interacting host proteins in terms of number of associated signaling pathways (candidate).
Few top pathway central proteins with the number of pathways they are participating (out of 17 pathways), PPI centrality score and number of viral proteins (Vp) targeting the proteins.
| IKBKB | 13 | 552 | 2 |
| CHUK | 12 | 462 | 11 |
| MAPK3 | 12 | 337 | 16 |
| RELA | 12 | 859 | 10 |
| AKT1 | 11 | 886 | 11 |
| AKT2 | 11 | 113 | 12 |
| AKT3 | 11 | 61 | 15 |
| IKBKG | 11 | 959 | 9 |
| TNF | 11 | 497 | 11 |
| MAPK8 | 9 | 444 | 12 |
| MAPK9 | 9 | 260 | 15 |
| NFKBIA | 9 | 501 | 8 |
| PIK3CA | 9 | 190 | 19 |
| PIK3CB | 9 | 82 | 17 |
| PIK3CD | 9 | 28 | 17 |
| PIK3R1 | 9 | 684 | 5 |
| PIK3R2 | 9 | 190 | 16 |
Few COVID-19 drugs with their actual number of target host proteins from our inferred network, number of targets that are highly central and number of targets involved in candidate pathways.
| 25 | 11 | 8 | 16 | |
| 64 | 10 | 5 | 12 | |
| 9 | 5 | 1 | 11 | |
| 5 | 4 | 0 | 9 | |
| 15 | 3 | 0 | 10 |
Fig. 8Bipartite graph showing 19 drugs linked with 17 signaling pathways. Left and right panel are showing drug name and middle panel is showing signaling pathway name.