Literature DB >> 33965637

A scheme for inferring viral-host associations based on codon usage patterns identifies the most affected signaling pathways during COVID-19.

Jayanta Kumar Das1, Subhadip Chakraborty2, Swarup Roy3.   

Abstract

Understanding the molecular mechanism of COVID-19 pathogenesis helps in the rapid therapeutic target identification. Usually, viral protein targets host proteins in an organized fashion. The expression of any viral gene depends mostly on the host translational machinery. Recent studies report the great significance of codon usage biases in establishing host-viral protein-protein interactions (PPI). Exploring the codon usage patterns between a pair of co-evolved host and viral proteins may present novel insight into the host-viral protein interactomes during disease pathogenesis. Leveraging the similarity in codon usage patterns, we propose a computational scheme to recreate the host-viral protein-protein interaction network. We use host proteins from seventeen (17) essential signaling pathways for our current work towards understanding the possible targeting mechanism of SARS-CoV-2 proteins. We infer both negatively and positively interacting edges in the network. Further, extensive analysis is performed to understand the host PPI network topologically and the attacking behavior of the viral proteins. Our study reveals that viral proteins mostly utilize codons, rare in the targeted host proteins (negatively correlated interaction). Among them, non-structural proteins, NSP3 and structural protein, Spike (S), are the most influential proteins in interacting with multiple host proteins. While ranking the most affected pathways, MAPK pathways observe to be the worst affected during the SARS-CoV-2 infection. Several proteins participating in multiple pathways are highly central in host PPI and mostly targeted by multiple viral proteins. We observe many potential targets (host proteins) from the affected pathways associated with the various drug molecules, including Arsenic trioxide, Dexamethasone, Hydroxychloroquine, Ritonavir, and Interferon beta, which are either under clinical trial or in use during COVID-19.
Copyright © 2021 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Bipartite graph; Cell signaling; Centrality; Codon usage bias; Drugs; Protein interaction network; Relative synonymous codon usage

Mesh:

Year:  2021        PMID: 33965637      PMCID: PMC8102073          DOI: 10.1016/j.jbi.2021.103801

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   8.000


Introduction

The entire world is passing through an unprecedented pandemic situation due to massive outbreak of Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infected viral disease, COVID-19. SARS-CoV-2 belongs to the Coronaviridae family, and members of this family are enveloped, non-segmented, and have single-stranded, positive-sense large RNA genomes [1]. This virus rapidly spreads from person to person through respiratory droplets during close physical contact. Other than respiratory system, it is reported to attack the immune system, and other vital cellular machinery leading to multi-organ failure [2]. The need for the hour is utmost crucial for the scientific community to understand the disease pathogenesis of SARS-CoV-2 at genomic and proteomic levels for the rapid development of effective drugs or vaccines to control the COVID-19. Many recent works use host-viral protein–protein interaction network as an input to elucidate potential drug targets or repurposed drug molecules [3], [4], [5]. Host-pathogen protein interactions provide essential insights into the molecular mechanisms of pathogenesis [6]. The host defense mechanism activates signal transduction molecules that initiate signals, which activate immune effector mechanisms to protect the host from any pathogenic infections. Studies show that viral immune modulators perturb the human PPI network by targeting signaling pathways [7] to suppress the immunity in mammalian hosts [8]. To understand the molecular mechanism of pathogenicity of SARS-CoV-2 during COVID-19 disease, investigation of the host-viral protein interactions is important. Knowledge gained through understanding the interactions among viral and host proteins involved in signaling pathways may translate into effective therapies and vaccines. There are four (04) basic categories of chemical signaling found in multi-cellular organisms namely; paracrine signaling, autocrine signaling, endocrine signaling, and signaling by direct contact. Various regulatory signaling pathways are involved in signaling transduction and cellular interactions, many of which are playing an important role during COVID-19. Signal transduction focuses on molecular and functional aspects of viral interactions with host cell signaling, important for the anti-viral response, the viral life cycle, viral pathogenesis, and cell transformation [9]. We aim to study the interaction pattern of SARS-CoV-2 with its target host proteins involved in signaling pathways [10], [11], [12], [13], [14], [15] (see Materials and Method section). Working with them can help in deciphering the possible involvement of pathways and key genes during COVID-19 pathogenesis. The virus-host interactome is essential for understanding virulence factors influencing SARS-CoV-2 pathogenesis [16]. Recent studies reported the use of SARS-CoV-2 and host PPI networks to study the pathogenesis of SARS-CoV-2 and identify repurposed drugs [17], [4], [18]. Several studies have shown that viral proteins interact with hubs in complex host PPI networks [19], [20]. Considering different features of proteins such as sequence homology, gene co-expression, or phylogenetic profiles [21], [22], [23], the pairwise similarity is computed between a pair of proteins to predict a possible interaction between them. In addition to non-structural information, structural data about a pair of proteins appears to be more effective in improving prediction [24], [25], [4]. Several computational methods have been developed to predict PPIs by focusing on protein sequence features [26], [27], [28]. In reality, predicting whether two given proteins are physically interacting or not based on the similarity of different structural and non-structural features is challenging and not feasible due to the expensive experimental setup. In the case of viral genome study, codon usage biases play an important role [29], [30], [31]. Viral gene depends largely on the host translational machinery for their expression [32]. Viral proteins are co-evolved with host proteins. Several studies have reported that physically interacting or functionally associated protein pairs have similar codon usage bias [33], [34], [35], [36], [37], [38], [39]. Therefore, codon usage bias can be utilized in establishing host-viral protein interactions [40], [41]. State-of-the-art methods for inferring interactions do not consider the co-expression or co-adaptation between a pair of virus and host proteins for drawing possible attacking mechanisms of a virus [42], [43]. According to the genome hypothesis proposed by Grantham et al. [44], the pattern of codon usage is species-specific and in some way unique. Interestingly, even in the same genome, the codon usage varies significantly among genes with different expression levels [39], functions [45], and tissue-specific patterns [46]. Few works hypothesized that viral proteins enrich with few codons that are rare in their target host proteins [47], [48]. In this work, we explore host-viral interactions by leveraging the inherent correlation (co-expression or co-adaptation) between viral and host proteins’ codon usage patterns, which applies to any nucleotide (CDS) sequences. To the best of our knowledge, no prior work explored the codon usage similarity to infer host-viral PPI network. We capture both positive and negative interactions in the host-viral PPI. We use host proteins involved in different human cellular signaling pathways that might be affected during COVID-19 disease pathogenesis. Topologically, we try to establish the relevance of the host proteins and highlight a few essential proteins in the network, which are also useful as potential drug targets for certain reported drugs during COVID-19.

Materials and method

This section discusses the proposed scheme for constructing a host-viral PPI network using pair-wise codon usage patterns of host and viral proteins. To analyze the interaction mechanism of SARS-CoV-2 viral proteins in host signaling pathways, we select all the genes involved in 17 candidate signaling pathways. We calculate the RSCU similarity score for all pairs of host-viral candidate proteins to build the network. We further analyze the host PPI network to list highly connected host proteins and highlighted a few potential drugs targeting those proteins for possible repairing of affected pathways during COVID-19.

Data acquisition and processing

Structurally, SARS-CoV-2 consists of three categories of proteins, structural, non-structural, and accessory proteins. We select four (04) structural proteins, sixteen (16) non-structural proteins, and six (06) accessory proteins reported in [49], [50]. The details of the viral proteins are listed in Table 1 (NCBI accession numbers for SARS-CoV-2 proteins: MN908947.3, NC_045512).
Table 1

SARS-CoV-2 proteins considered for host-viral PPI construction.

Protein categoryCountProtein Name
Structural4Spike (S), Envelope (E), Membrane (M), Nucleocapsid (N)
Non-structural16Nsp1,Nsp2,,Nsp16
Accessory6Orf3a, Orf6, Orf7a, Orf7b, Orf8, Orf10
SARS-CoV-2 proteins considered for host-viral PPI construction. As discussed before, we consider seventeen (17) signaling pathways, namely TGF-β, JAK-STAT, PI3K-Akt, MAPK, HIF-1, TNF, NF-κ B, Cytokine-cytokine receptor interaction, Apoptosis, Th17 cell differentiation, Chemokine, Toll-like receptor, RIG-like receptor, IL-17, Insulin Signaling, mTOR, and Adipocytokine, which are reported to associate with COVID-19 and other viral diseases [10], [11], [12], [13], [14], [15], [51], [52], [53]. We search the Kyoto Encyclopedia of Genes and Genomes (KEGG) database 1 for the set of human host proteins (genes) that participated in our selected candidate pathways. We observe a total of 2600 genes involved in the above pathways (Supplementary-A,Table S1). We use 1313 unique genes participating exclusively in our 17 target pathways (Supplementary-B). Our proposed scheme is relying on the codon usage pattern, for which the nucleotide sequence (coding region) of each host protein is obtained from the NCBI database. A good number of genes (total 1274) are also involved in more than one pathways. We summarize our target pathways and the number of genes involved in each pathway in the Table 2 .
Table 2

Candidate signaling pathways and the number of host proteins (or genes) participating in the pathway.

Pathways#Genes involvedPathways#Genes involved
NF-κ B signaling pathway105Th17 cell differentiationl108
Cytokine-cytokine receptor interaction295TGF-β signaling pathway95
TNF signaling pathway113Toll-like receptor signaling pathway105
IL-17 signaling pathway95HIF-1 signaling pathway110
RIG-I-like receptor signaling pathway70Apoptosis137
MAPK signaling pathway295Insulin signaling pathway138
Chemokine signaling pathway190mTOR signaling pathway156
PI3K-Akt signaling pathway355Adipocytokine signaling pathway70
JAK-STAT signaling pathway163
Candidate signaling pathways and the number of host proteins (or genes) participating in the pathway.

Computing Relative Synonymous Codon Usage (RSCU)

The genetic code describes how the 64-nucleotide triplets specify only twenty (20) different translated amino acids. These alternative codons for the same amino acids are termed as synonymous codons. However, most of the amino acids have at least two synonymous codons that are not used at the same frequencies in different genomes. Differences in the frequency of occurrence on synonymous codons in coding DNA is termed as synonymous codon usage bias [54]. Several indices are available to quantify the synonymous codon usage bias [55]. Effective Number of Codons (ENC) focuses on GC content, Rare Codon (RC) focuses on the abundance of low usage codon, and Codon Adaptation Index (CAI) calculates the frequency of the overall codons based on a reference set. They are either partially capturing the usage or generating values based on the relative whole reference set. We are looking for a normalized frequency of codon usage for comparing the variation of usage between host and viral proteins. RSCU is one of the indices for measuring codon bias used to examine synonymous codon usage without the confounding influence of the amino acid composition of different gene products [55]. It is widely used to estimate the codon usage bias [56], [57], [58], [59]. It can be used to quantify the similarity between any two gene sequences by applying any classical proximity measure between a pair of RSCU vectors. The similarity between RSCU vectors may reflect the possible interactions between a couple of proteins in the PPI [58], [60], [57]. RSCU is the ratio between the observed number of occurrences of codons and expected during uniform usage of synonymous codons and calculated as follows.where, is the number of occurrences of the codon for the amino acid, which is encoded by synonymous codons. The RSCU score of a codon more than 1.0 indicates excess usage (biased) of the codon, and less than 1.0 marks low usage of that particular codon. We generate a 59-dimensional RSCU feature vector for each coding protein. We consider the usage pattern of only 59 codons (out of 64 available codons). We ignore 03 stop codons and uniquely coded codons ATG and TGG coded for Met and Trp amino acids, respectively [61]. For RSCU calculation, we use the CAI package [62] available free at 2 . Using the feature vectors, we try to draw the similarity between host and viral proteins to form a network, as discussed next.

Inferring host-viral protein interaction network

Protein–Protein Interactions (PPI) are usually studied computationally from a graph-theoretic perspective [63]. Interactions among different organisms, such as a host and its pathogen, are primarily driven by interactions among the host proteins and pathogen proteins. These interactions can also be represented as host-pathogen PPI. Host-pathogen PPI is usually represented as a bipartite graph where any given interacting pair of nodes (proteins) does not belong to the same organism. This network essentially provides the known interactions of host proteins with pathogen proteins. Pearson’s correlation coefficient () is used to calculate relationship between two variables with different magnitudes [64], [65]. Assume and are the RSCU vectors for a pair of viral and host proteins, respectively. Based on and can be calculated as follows.where, and and are the mean of the vectors and respectively. To determine significantly correlated pair of RSCU vectors, we use 2-tailed p measurement [66]. Two proteins are strongly connected if the p is less than certain cutoff threshold, , i.e. . We use SciPy version 1.5.0 (sipy.stats) 3 for calculating and p value. We consider two (02) kinds of interactions, positive and negative, between a host and viral proteins while inferring the network. Positive interaction indicates possible similar codon usage, whereas a negative score signifies possible rare codon usage by SARS-CoV-2 proteins compared to its interacting host proteins. The Pearson correlation coefficient () is used to determine the possible sign of the inferred edges. An example is shown in Fig. 1 for negative and positive correlations computed between two pairs of host-viral proteins.
Fig. 1

The host-viral codon usage (RSCU) patterns. The scatter plot shows the RSCU value of 59 codons for viral (X-axis) and host (Y-axis) proteins. The regression line represents the trend of RSCU patterns. Viral protein S showing positive correlation () with host protein TANK and negative correlation () with host protein GDF15.

The host-viral codon usage (RSCU) patterns. The scatter plot shows the RSCU value of 59 codons for viral (X-axis) and host (Y-axis) proteins. The regression line represents the trend of RSCU patterns. Viral protein S showing positive correlation () with host protein TANK and negative correlation () with host protein GDF15. Given a set of viral proteins, and host proteins we can create a bipartite graph in the form of adjacency matrix using the above and p values as follows.Next, we investigate the interaction mechanism of SARS-CoV-2 proteins with the proteins involved in certain signaling pathways given in Table 2.

Results and discussion

Benchmarking

To assess the effectiveness of codon usage bias measure in predicting possible viral and host protein interactions, the reported host-viral networks with physically verified interactions are considered. For example, 332 host protein interactions with 27 SARS-CoV-2 proteins [3] (network-1) are reported that utilized affinity purification mass spectrometry (AP-MS) based method to infer the physical interactions. It reports host-viral interactions forming star-like topology, where one host is exclusively interacting with one viral node. Few other similar studies report SARS-CoV-2 viral proteins interactions with more than 1100 [67](network-2) and 200 [16](network-3) host proteins. Altogether, there are total of 294 interactions (network-1), 1106 interactions (network-2), and 517 interactions (network-3) in the above networks, which are available in BioGRID database [68]. We apply our method to these three networks, and we observe approximately interactions (average) for the above three networks (Table 3 ). In addition, we use three (03) other viral-host networks, such as Epstein-Barr from Virhostome 4 , Hepatitis-C and Influenza-A from VirusMINT [69] for validation. We report the performance in Table 3. For more detailed results, one can refer to Supplementary-c.
Table 3

Performance assessment of the proposed scheme on few virus-host interaction networks.

VirusSource#Viral proteins#Host proteins#Host-virus interactionsMatching (%)
SARS-CoV-2Network-1 [3]2329429460
Network-2 [67]21884110643
Network-3 [16]2520351759



Epstein-BarrVirhostomea19435148660
Hepatitis-CVirusMINT [69]2151585
Influenza-AVirusMINT [69]24475

http://interactome.dfci.harvard.edu/V_hostome/idex.php.

Performance assessment of the proposed scheme on few virus-host interaction networks. http://interactome.dfci.harvard.edu/V_hostome/idex.php. Unlike reported physical interactions methods, codon usage infers a possible co-expression between a pair of host and viral proteins computed quantitatively using pairwise RSCU score similarity. Possibly this might be a possible reason for inferring low matching interactions with the true networks. We consider for inferring the above networks.

Comparison of RSCU patterns among viral and host proteins

We report codon usage distribution of 59 codons across 26 viral proteins (SARS-CoV-2) and 1313 host proteins in Fig. 2 (a) and (b), respectively, involved in our candidate signaling pathways. We observe that GGT, AGA, GCT, CCT, GTT, TCT, ACA, CTT, TTA, ACT are the highly used (median RSCU score for each codon) codons in SARS-CoV-2 proteins. On the other hand, CGA, AGC, ACC, CGG, CTG, CCG, ACG, GCG, TCG, GGG rarely used codons. In the host proteins (from 17 signaling pathways), codons such as CTG, GTG, ATC, GCC, CAG, ACC, AGC, GGC, and CCC are highly used (median RSCU score for each codon). The distribution margins of RSCU values of those codons are relatively wider (Fig. 2 (b)). However, CCG, GTT, CGT, GCG, TCG, CAA, CTA, ATA, GTA, TTA rarely used codons in host proteins. It is worth mentioning that for SARS-CoV-2 proteins, highly used codons are ending (third position of codon) with T or A that shows similar characteristics with Nipah virus [54], SARS-CoV [70], and coronavirus N genes [71]. But for host proteins from candidate signaling pathways, the highly used codons are ending with G or C at the third position of the codons.
Fig. 2

Distribution of RSCU scores for 59 codons for all (a) SARS-CoV-2 proteins; (b) Host proteins.

Distribution of RSCU scores for 59 codons for all (a) SARS-CoV-2 proteins; (b) Host proteins.

Analysis of host-viral inferred networks

We predict the host-viral (SARS-CoV-2) interaction graph based on the Eq. 3 involving 26 SARS-CoV-2 proteins with 1313 host proteins participating in 17 signaling pathways. Out of 34138 () maximum possible interactions, our method infers 9412 ( 36%) strong interactions. In our network, 859 distinct host proteins ( 66%) are connected to at least one viral protein. We set for deciding the strong interaction (edge) between two proteins. Interestingly, our inferred network reveals that out of 859 host proteins, a total of 779 proteins is targeted by more than one viral proteins. A snapshot of isolated networks with one (viral) to many (host) interactions are shown in Fig. 3 between viral and host proteins.
Fig. 3

The host-viral interactions network showing host proteins, which are connected to a single viral protein. In the network, the yellow-color represents viral nodes, whereas the blue and green colors represents host nodes, represent positive and negative interactions, respectively. As shown in the figure, 09 viral proteins (Nsp1, Nsp2, Nsp6, Nsp7, Nsp9, M, N, Orf3a, and Orf7a), 08 viral proteins (Nsp3, Nsp5, Nsp10, Nsp11, Nsp12, Nsp16, Orf8, and Orf10), and 03 viral proteins (Nsp13, Nsp14, and Nsp15) interactions with host proteins are positive, negative and both, respectively.

The host-viral interactions network showing host proteins, which are connected to a single viral protein. In the network, the yellow-color represents viral nodes, whereas the blue and green colors represents host nodes, represent positive and negative interactions, respectively. As shown in the figure, 09 viral proteins (Nsp1, Nsp2, Nsp6, Nsp7, Nsp9, M, N, Orf3a, and Orf7a), 08 viral proteins (Nsp3, Nsp5, Nsp10, Nsp11, Nsp12, Nsp16, Orf8, and Orf10), and 03 viral proteins (Nsp13, Nsp14, and Nsp15) interactions with host proteins are positive, negative and both, respectively. Similar researches on SARS-CoV-2 host protein interactions [3] shows viral protein oriented star-like topology only and unable to report any host protein oriented multiple interactions. We report a list of such highly connected host proteins with the viral proteins (total of 15) in Table 4 . Many (viral) to one (host) interactions are also reported (Supplementary-D).
Table 4

The list of top few host proteins targeted by number of SARS-CoV-2 interacting viral proteins. For each host protein (Hp), number of interacting viral proteins (IVP) count and calculated average correlation value () are shown. There are total of 40 host proteins (20 for positive interactions and 20 for negative interactions).

Sl. No.HpIVP countAvg. (ρ)SARS-CoV-2 interacting viral proteins
1COL4A5210.62Nsp1, Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
2STAM2210.63Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf6, Orf7a, Orf8
3LIFR210.64Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf6, Orf7a, Orf8
4IFNAR1200.59Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
5PPM1B200.61Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
6RPS6KA6200.62Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, N, Orf3a, Orf6, Orf7a, Orf8
7SOS2200.63Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
8PKN2200.66Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
9IRAK4200.69Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
10IL13RA2190.61Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
11APAF1190.61Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
12CUL2190.61Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
13DNM1L190.63Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
14MIOS190.64Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
15BIRC2190.65Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
16RPS6KA3190.68Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
17PPP1R3A190.70Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, N, Orf3a, Orf7a, Orf8
18SGK3180.61Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8
19PPP3CB180.62Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, N, Orf3a, Orf7a, Orf8
20HIF1A180.62Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, N, Orf3a, Orf7a, Orf8



1GDF1519−0.62Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, Orf3a, Orf6, Orf8
2FGF419−0.60Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, Orf3a, Orf6, Orf8
3SHC219−0.58Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, M, Orf3a, Orf6, Orf8
4CEBPB18−0.59Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
5IRS218−0.59Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
6JUN18−0.58Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf817
7EFNA218−0.58Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
8LPAR518−0.58Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
9GDF718−0.58Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
10FZD118−0.58Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf818
11FZD918−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
12PPP2R3B18−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
13MAPK8IP218−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
14DDIT418−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
15NOG18−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
16SMAD618−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
17WNT618−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
18GREM218−0.57Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
19BMP718−0.56Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
20FZD818−0.56Nsp2, Nsp3, Nsp4, Nsp5, Nsp6, Nsp7, Nsp8, Nsp9, Nsp10, Nsp12, Nsp13, Nsp14, Nsp15, Nsp16, S, Orf3a, Orf6, Orf8
The list of top few host proteins targeted by number of SARS-CoV-2 interacting viral proteins. For each host protein (Hp), number of interacting viral proteins (IVP) count and calculated average correlation value () are shown. There are total of 40 host proteins (20 for positive interactions and 20 for negative interactions).

Distribution of correlation scores

Statistically, it is also important to study the distribution of correlation values (both positive and negative) between pairs of proteins in terms of codon usage patterns. From the distribution plot given in Fig. 4 reveals that the host-viral codon usage pattern (edge correlation) shows non-normal distribution pattern (with for positive correlation, and for negative correlation based on normality test, performed using SciPy.stats.normaltest3) [72], [73]. The negative correlation is varied in the range [-.73, −4.18], which covers 6325 (67%) interactions, and a positive correlation is varied in the range [4.18, 8.44], which covers 3087 (33%) interactions. So, positive correlation exhibits a wider range of values than the negative range.
Fig. 4

(a) Frequency distribution of positive (right) and negative (left) correlation scores for interacting proteins in terms of RSCU based codon usage similarity. (b) Box plot showing the range of correlation values for each viral protein while associated with its target proteins.

(a) Frequency distribution of positive (right) and negative (left) correlation scores for interacting proteins in terms of RSCU based codon usage similarity. (b) Box plot showing the range of correlation values for each viral protein while associated with its target proteins. We further look into the correlation value distribution of a viral protein interacting with its target proteins. We report the correlation value range (both positive and negative) for 26 viral proteins in Fig. 4. While fixing at high (significance level) value, correlation values also appear to be significant which are ranging between ±.05 and above. Except few, most of the viral proteins are participated in the network, both positively and negatively. Viral proteins, Orf10 and Nsp10 are interacted with their targets, negatively. Similarly, viral proteins like N, Nsp1 and Orf7b are interacted, positively. Based on correlation analysis, we may confirm that while a viral protein targets its host, it mimics similar codon usage of its target to uphold the expression of target host protein. Similarly, viral proteins use a set of codons that are rarely used in their targets to down-regulate the expression of its target. We observe in the case of host proteins, involved in signaling pathways, the majority of SARS-CoV-2 proteins aimed to break down the normal pathways by down regulating the key proteins involved in such pathways.

Degree distribution of host and viral proteins

In any interacting network, the node’s degree conveys essential information about the node’s influence within the network. In the case of host-viral PPI, a high degree viral protein (highly connected) may be a critical protein that influences the functional activities of a number of host proteins. Pharmacologically, identifying such (hub) proteins may help in designing a small molecule that may bind with it to inhibit its influence during disease pathogenesis. The same may be applicable to host proteins. If host protein have a high degree, it indicates that more viral proteins target the host protein. However, it may require further investigation about its importance in its own network, i.e., host–host protein network. If a host protein is significant concerning its degree, suitable repurposed drug molecules may be identified for the same. While focusing on highly interacting viral proteins, interestingly, we observe that the maximum number of highly interacting proteins belongs to the non-structural family. In the case of structural proteins, S is a highly interacting (more than 600) protein. Out of accessory proteins, Orf8 shows a maximum interaction count next to protein S. We report the degree distribution for each of the viral proteins from our network in Fig. 5 (a). From the figure, it can be observed that majority of the viral proteins carrying a high node degree. Out of all the SARS-CoV-2 proteins, Nsp3 shows the maximum degree ( 700), which interacts with more than 80% of the candidate host proteins involved in 17 different signaling pathways. Concerning negative edges, Nsp3 is still on top, followed by Nsp16, Nsp13, and few others. While considering positive edges, S, Nsp6, and Orf7a are found to be highly interactive. Few viral proteins like Nsp11, Orf7b, E, Nsp1, Orf10, and Nsp11 are comparatively less interactive.
Fig. 5

(a) The bar chart represents the host protein count for each viral protein based on correlation analysis (p-value ). Pc-positive count, Nc-negative count, positive and negative count are based on positive and negative correlations. (b) Degree distribution of 859 host proteins in terms of number of associated viral proteins (degree) count (x-axis) with host protein frequency (y-axis).

(a) The bar chart represents the host protein count for each viral protein based on correlation analysis (p-value ). Pc-positive count, Nc-negative count, positive and negative count are based on positive and negative correlations. (b) Degree distribution of 859 host proteins in terms of number of associated viral proteins (degree) count (x-axis) with host protein frequency (y-axis). We show the degree distribution of 859 host proteins in Fig. 5 (b), interacting with 26 viral proteins. From the distribution plot, we can observe that majority (82) of the host proteins are connected with only one viral node. While considering host proteins targeted by multiple viral proteins, we observe less than ten (10) such proteins are highly connected proteins with the degree 21 (maximum within the network). Even though our network is a bipartite graph, we observe that the number of low-degree nodes is high, fav minor in number, which is somehow follows the scale-free properties [74] of a complex network. We observe relatively good host nodes possessing a degree within the range of 11 to 18.

Ranking highly targeted signaling pathways during COVID-19

To study the most affected pathways in our 17 candidate set of pathways, we rank them based on the percentage of host proteins targeted by any SARS-CoV-2 viral protein (out of total proteins involved in those pathways) and report in Fig. 6 .
Fig. 6

Ranking of 17 candidate signaling pathways. The pathway ranking is done by observing the host protein percentage from pathways that interact with any of the SARS-CoV-2 (26) proteins.

Ranking of 17 candidate signaling pathways. The pathway ranking is done by observing the host protein percentage from pathways that interact with any of the SARS-CoV-2 (26) proteins. The topmost pathway is the Mitogen-Activated Protein Kinase (MAPK) signaling pathway. Different SARS-CoV-2 proteins target more than 50% of proteins from this pathway. This pathway is associated with the COVID-19 immune response [75] and involves in papain-like protease activation of promoter as observed in the SARS coronavirus [76]. MAPK proteins communicate signals from a receptor on the cell’s surface to the DNA in the cell’s nucleus, essential from a viral infection point of view. Further, MAPK proteins are involved in a series of vital signal transduction pathways that regulate processes such as cell proliferation, cell differentiation, and cell death in humans. Besides MAPK, other ranked signaling pathways are significantly affected during COVID-19 infection. Under physiological conditions, adipokines act mainly in adipose tissue (paracrine or autocrine) or circulate through the blood to distant target organs, regulating their growth and development, metabolism, and tissue remodeling. However, adipokines’ synthesis and secretion are disordered under pathological conditions, leading to obesity, diabetes, heart disease, and other metabolic disorders. Our results show that the adipocytokine pathway is affected by COVID-19. It implicates that patients with comorbid conditions like diabetes and heart disease may show the worst disease aggression, which is already observed in various reports. The mTOR pathway [12] is a central regulator of mammalian metabolism and physiology, with essential roles in tissues’ function, including liver, muscle, white and brown adipose tissue, and the brain. It is dysregulated in human diseases, such as diabetes, obesity, depression, aging-related problems, and certain cancers. Our result corroborates with the same, and it has reported that aged patients are more prone to the infection due to the dysregulation of the mTOR pathway or some other unknown reasons. It has been observed that some COVID-19 affected deaths are due to multiple organ failures. HIF1 [89], [90] and RIG1 [91] like receptor pathways are involved in normal immunoregulation and various organ functioning. Dysregulation may cause immune compromisation and multiple organ failure through ischaemic heart disease, acute lung injury, pulmonary hypertension, pulmonary fibrosis, chronic obstructive pulmonary disease (COPD), acute liver failure, liver fibrosis, and acute kidney injury, etc. Our result also supports these findings. In our ranking, the fourth most affected pathway is the TGF- (Transforming growth factor-beta) [92], which is a multi-functional cytokine belonging to the transforming growth factor superfamily that includes three different mammalian isoforms (TGF- 1 to 3, HGNC symbols TGFB1, TGFB2, TGFB3) and many other signaling proteins. All-white blood cell lineages produce TGF-β proteins. This pathway activates different downstream substrates and regulatory proteins, inducing transcription of various target genes that function in differentiation, chemotaxis, proliferation, and activation of many immune cells.

Centrality analysis of targeted host proteins and candidate signaling pathways

Studies on human host-viral protein interactions reveal that viruses tends to target- attacks towards host proteins [77], [19], [78] by interacting with key (central) host proteins. We consider a host protein important if it interacts with many other host proteins in host-host protein network. We use BioGRID [68] to calculate the centrality score of our candidate host proteins 5 . We report the top 100 central proteins in Supplementary-A (Table S2). We observe that a good number of interacting host proteins in our network are highly central in their own (host) PPI. A common set of viral proteins targets central genes, and such proteins are involved in multiple pathways. For instance, if we consider few top central proteins, MYC (2843), TRIM25 (2656), EGFR (2452), BRCA1 (2236), MDM2 (2219), NTRK1 (2030), KRAS (1944), ELAVL1 (1914) and HSP90AA1 (1734), they are found to be targeted jointly by the viral proteins such Nsp2, Nsp3, Nsp4, Nsp5, Nsp8, Nsp10, Nsp12, Nsp13. If we consider the most central proteins in our candidate pathways, we observe that the PI3K-Akt signaling pathway (36 target proteins) and MAPK signaling pathway (35 target proteins) contain most of the central proteins targeted by the viral proteins. A pathway may be more crucial from the disease pathogenesis perspective if it contains highly central proteins targeted by viral proteins. Moving one step ahead, we may rank our 17 pathways based on the number of participating central proteins (out of the top 100 centrality list) in the above pathways and shown in Fig. 7 (a). More details about the top 100 central host proteins are listed in Supplementary-A (Table S2). Interestingly, in terms of the number of target proteins, which are also central in host-host PPI, the signaling pathway MAPK is one of the worst affected pathways among 17 candidate pathways. In addition to PPI centrality, we study the pathway centrality of the host proteins regarding our 17 signaling pathways. Prior researches also identified an exciting fact that viral proteins target host proteins that are pathway central, i.e., participating in multiple pathways [78]. The degree distribution of host proteins in terms of their density of participation in 17 pathways is reported in Fig. 7(b). We observe a nice power-law [74] like distribution where the majority of proteins are participating in only one pathway, and fewer numbers are having high participation in multiple pathways. We list a few top highly pathway-central proteins and few interesting facts in Table 5 . The table shows that the pathway-central proteins are also highly connected in their own PPI and mostly targeted by multiple viral proteins.
Fig. 7

(a) Participation host protein count of central proteins in candidate pathways; (b) Degree distribution of 859 interacting host proteins in terms of number of associated signaling pathways (candidate).

Table 5

Few top pathway central proteins with the number of pathways they are participating (out of 17 pathways), PPI centrality score and number of viral proteins (Vp) targeting the proteins.

Host protein#Pathway centralityPPI centralityInteracting Vp count
IKBKB135522
CHUK1246211
MAPK31233716
RELA1285910
AKT11188611
AKT21111312
AKT3116115
IKBKG119599
TNF1149711
MAPK8944412
MAPK9926015
NFKBIA95018
PIK3CA919019
PIK3CB98217
PIK3CD92817
PIK3R196845
PIK3R2919016
(a) Participation host protein count of central proteins in candidate pathways; (b) Degree distribution of 859 interacting host proteins in terms of number of associated signaling pathways (candidate). Few top pathway central proteins with the number of pathways they are participating (out of 17 pathways), PPI centrality score and number of viral proteins (Vp) targeting the proteins.

Quantitative association of key pathway proteins and drugs

To investigate further the significance of key proteins in our network, we analyse protein-drug association. We primarily consider approved or under-trial drugs that are in use during COVID-19. We searched online drug target resource database 6 to count hits with different key proteins in our network (Supplementary-A, Table S3). We ranked those drugs based on their counts of protein targets in our network (Table 6 ). A good number of drugs are also observed to be associated with central proteins that are not reported so far used in COVID-19. The list of such drugs is given in Supplementary-A (Table S3). It can be observed from Table 6 that a single drug is having targets in multiple pathways forming a bipartite graph as shown in Fig. 8 .
Table 6

Few COVID-19 drugs with their actual number of target host proteins from our inferred network, number of targets that are highly central and number of targets involved in candidate pathways.

Drug Name#Actual targets#Targets in#Targets (central) in#Involved
inferred networkstop-100 host PPIpathway
Arsenic trioxide2511816
Dexamethasone6410512
Hydroxychloroquine95111
Interferon beta5409
Ritonavir153010
Fig. 8

Bipartite graph showing 19 drugs linked with 17 signaling pathways. Left and right panel are showing drug name and middle panel is showing signaling pathway name.

Few COVID-19 drugs with their actual number of target host proteins from our inferred network, number of targets that are highly central and number of targets involved in candidate pathways. Bipartite graph showing 19 drugs linked with 17 signaling pathways. Left and right panel are showing drug name and middle panel is showing signaling pathway name. We discuss below the drugs that are associated with COVID-19 and having potential target host proteins involved in our inferred host-viral networks. Arsenic trioxide is a widely known chemical used for multiple disease conditions. The Ministry of Ayush 7 , Govt. Of India, advised for Arsenicum album 30 as a potential homeopath drug for COVID-19. Arsenic trioxide is a mother tincture of Arsenicum album 30, used as a homeopath medicine. Symptoms like severe respiratory adverse effects frequently occur in patients with promyelocytic leukemia. Arsenic trioxide could be used in consolidation therapy [79], [80]. If we consider central genes (present in the top 100 list) involved in MAPK pathways, we observe that six target proteins (RARA, FGFR1, IKBKB, CCND1, CDKN1A, JUN, MAPK3, AKT1) are the excellent target of this chemical. Interestingly, all such proteins are targeted negatively by viral proteins. In a comorbid situation where these signaling pathway genes are already perturbed, arsenic trioxide may play a protective role in boosting up the immunity and other unknown vital regulators that are yet to discover. In addition to MAPK, several targets are present in PI3K-Akt, TNF, and Apoptosis signaling pathways. Dexamethasone is another most widely used COVID-19 drug with 64 target genes8 . This is the first drug to show life-saving efficacy in patients infected with COVID-19 [81] and widely utilized in a large trial in the UK [82] Our result shows that NTRK1, HSPA8, SMAD3, VCAM1, and RARA are the targets (central) for the drug involved in MAPK, PI3K-Akt, Th17 cell differentiation, TGF-β, and NF-κ B. In addition to that, Dexamethasone also targets a few other interacting host proteins (low centrality), JUNB, LIF, CD86, SLC2A4, and IRS2. Dexamethasone is predicted to maintain these signaling pathways’ normal functioning and shows protection against COVID-19 symptoms, as we assume from our results. Hydroxychloroquine is another important drug, has been widely utilized for COVID-19 treatment [83], [84]. The only central target is TNF, which is present in several signaling pathways. It can rapidly be transcribed in various cell types following exposure to a broad range of pathogens and signals of inflammation and stress [85]. Other low centrality targets are TLR3, TLR7, PTGS2, and TLR9. The other two important drugs recommended by WHO 9 are Ritonavir , and Interferon Alfa B observed in our list used for the COVID-19 clinical trial [86], [87], [88]. Interestingly, we found that ritonavir shows three target central genes (CXCL10, TLR4, IFNL3) in our study. These genes share NF κ B, HIF1, toll-like receptor, PI3K-AKT, JAK-STAT, cytokine-cytokine receptor, TNF, IL7, RIG1 receptor, chemokine signaling pathways. Interferon Alfa 1B is another option for solidarity trials with targets like IFNAR1, IFNAR2, and IL13 genes. These genes participate in PI3K-AKT, toll-like receptor, cytokine-cytokine receptor, JAK-STAT, Il17 pathways. These pathways are essential for maintaining normal immunological functioning, which is thought to dysregulate in COVID-19. We believe that a pathway targeted by different SARS-CoV-2 proteins and involvement of highly central proteins in its own PPI is the most crucial (affected) pathway. It is worth mentioning that the above-highlighted drug molecules are based on quantitative analysis of host proteins from our inferred host-viral network and their hits with the existing drug target database. Hence, our proposed scheme is not a new drug repurposing methodology and needs due attention while designing therapeutic solutions pharmacologically.

Conclusion

This work introduced a novel effort into recreating host-viral PPI. Proposed work explored the codon usage pattern similarity between host proteins that are participating in a few major signaling pathways and SARS-CoV-2 viral proteins. Both positive and negative edges between interacting proteins were inferred, which depict an essential association between viral and host proteins. The inferred network was analyzed topologically, considering nodes’ degree distribution and node centrality. An interesting fact has been observed on how viral proteins are targeting their host proteins. Our analysis highlighted a few drugs already in use for COVID-19, having potential targets in some of the essential host proteins involved in important candidate signaling pathways such as MAPK and PI3K-Akt. Several central proteins were identified (AKT1, CCND1, CDKN1A, FGFR1, HSPA8, IKBKB, JUN, MAPK3, NTRK1, RARA, SMAD3, TNF, and VCAM1), which are involved in critical signaling pathways and targeted by few drug molecules. The topmost few drug molecules highlighted by this study are Arsenic trioxide, Dexamethasone, and Hydroxychloroquine, which might play an influential role in preventing COVID-19 mortality. Our method is generic and useful to draw a more extensive network, covering genes from all critical pathways. Even the method can be applied to any set of host-viral proteins (other than SARS-CoV-2 or Human). Currently, we used a correlation score to measure the similarity between two RSCU vectors. However, other measures such as cosine similarity, mutual information, and ensemble approach might improve the result. One may consider a multi-layer network approach considering viral-viral and host–host networks that may shed better light on the possible viral-host interaction patterns.

Acknowledgement

This research is partly supported by the Department of Science & Technology (DST), Govt. of India under DST-ICPS Data Science program [DST/ICPS/Cluster/Data Science/General]. Thanks to Ms Arpita Roy, for reviewing and correcting the final draft of the manuscript.

CRediT authorship contribution statement

Jayanta Kumar Das: Conceptualization, Data curation, Methodology, Software, Formal analysis, Visualization, Writing - original draft, Writing - review & editing. Subhadip Chakraborty: Software, Validation, Visualization, Writing - original draft, Writing - review & editing. Swarup Roy: Conceptualization, Supervision, Visualization, Writing - original draft, Writing - review & editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
  80 in total

1.  Error and attack tolerance of complex networks

Authors: 
Journal:  Nature       Date:  2000-07-27       Impact factor: 49.962

2.  Clinical efficacy of lopinavir/ritonavir in the treatment of Coronavirus disease 2019.

Authors:  X-T Ye; Y-L Luo; S-C Xia; Q-F Sun; J-G Ding; Y Zhou; W Chen; X-F Wang; W-W Zhang; W-J Du; Z-W Ruan; L Hong
Journal:  Eur Rev Med Pharmacol Sci       Date:  2020-03       Impact factor: 3.507

Review 3.  TGF-beta: a master immune regulator.

Authors:  Christopher Larson; Bryan Oronsky; Corey A Carter; Arnold Oronsky; Susan J Knox; David Sher; Tony R Reid
Journal:  Expert Opin Ther Targets       Date:  2020-04-12       Impact factor: 6.902

4.  Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV.

Authors:  Alexey Stukalov; Virginie Girault; Vincent Grass; Ozge Karayel; Valter Bergant; Christian Urban; Darya A Haas; Yiqi Huang; Lila Oubraham; Anqi Wang; M Sabri Hamad; Antonio Piras; Fynn M Hansen; Maria C Tanzer; Igor Paron; Luca Zinzula; Thomas Engleitner; Maria Reinecke; Teresa M Lavacca; Rosina Ehmann; Roman Wölfel; Jörg Jores; Bernhard Kuster; Ulrike Protzer; Roland Rad; John Ziebuhr; Volker Thiel; Pietro Scaturro; Matthias Mann; Andreas Pichlmair
Journal:  Nature       Date:  2021-04-12       Impact factor: 49.962

5.  Structure-based prediction of protein-protein interactions on a genome-wide scale.

Authors:  Qiangfeng Cliff Zhang; Donald Petrey; Lei Deng; Li Qiang; Yu Shi; Chan Aye Thu; Brygida Bisikirska; Celine Lefebvre; Domenico Accili; Tony Hunter; Tom Maniatis; Andrea Califano; Barry Honig
Journal:  Nature       Date:  2012-09-30       Impact factor: 49.962

6.  An improved method for scoring protein-protein interactions using semantic similarity within the gene ontology.

Authors:  Shobhit Jain; Gary D Bader
Journal:  BMC Bioinformatics       Date:  2010-11-15       Impact factor: 3.169

7.  When the human viral infectome and diseasome networks collide: towards a systems biology platform for the aetiology of human diseases.

Authors:  Vincent Navratil; Benoit de Chassey; Chantal Rabourdin Combe; Vincent Lotteau
Journal:  BMC Syst Biol       Date:  2011-01-21

8.  Comprehensive analysis of correlations among codon usage bias, gene expression, and substitution rate in Arachis duranensis and Arachis ipaënsis orthologs.

Authors:  Hui Song; Hongjun Gao; Jing Liu; Pei Tian; Zhibiao Nan
Journal:  Sci Rep       Date:  2017-11-01       Impact factor: 4.379

9.  Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation.

Authors:  Daniel Wrapp; Nianshuang Wang; Kizzmekia S Corbett; Jory A Goldsmith; Ching-Lin Hsieh; Olubukola Abiona; Barney S Graham; Jason S McLellan
Journal:  Science       Date:  2020-02-19       Impact factor: 47.728

10.  Prediction of human-virus protein-protein interactions through a sequence embedding-based machine learning method.

Authors:  Xiaodi Yang; Shiping Yang; Qinmengge Li; Stefan Wuchty; Ziding Zhang
Journal:  Comput Struct Biotechnol J       Date:  2019-12-26       Impact factor: 7.271

View more
  6 in total

Review 1.  Emerging landscape of molecular interaction networks:Opportunities, challenges and prospects.

Authors:  Gauri Panditrao; Rupa Bhowmick; Chandrakala Meena; Ram Rup Sarkar
Journal:  J Biosci       Date:  2022       Impact factor: 2.795

2.  Analyzing host-viral interactome of SARS-CoV-2 for identifying vulnerable host proteins during COVID-19 pathogenesis.

Authors:  Jayanta Kumar Das; Swarup Roy; Pietro Hiram Guzzi
Journal:  Infect Genet Evol       Date:  2021-05-15       Impact factor: 3.342

3.  Novel informatics approaches to COVID-19 Research: From methods to applications.

Authors:  Hua Xu; David L Buckeridge; Fei Wang; Peter Tarczy-Hornoch
Journal:  J Biomed Inform       Date:  2022-02-16       Impact factor: 8.000

4.  Analysis of SARS-CoV-2 synonymous codon usage evolution throughout the COVID-19 pandemic.

Authors:  Ezequiel G Mogro; Daniela Bottero; Mauricio J Lozano
Journal:  Virology       Date:  2022-02-02       Impact factor: 3.616

5.  A Comprehensive Review of Artificial Intelligence in Prevention and Treatment of COVID-19 Pandemic.

Authors:  Haishuai Wang; Shangru Jia; Zhao Li; Yucong Duan; Guangyu Tao; Ziping Zhao
Journal:  Front Genet       Date:  2022-04-26       Impact factor: 4.772

6.  Human genes with relative synonymous codon usage analogous to that of polyomaviruses are involved in the mechanism of polyomavirus nephropathy.

Authors:  Yu Fan; Duan Guo; Shangping Zhao; Qiang Wei; Yi Li; Tao Lin
Journal:  Front Cell Infect Microbiol       Date:  2022-09-08       Impact factor: 6.073

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.