Literature DB >> 34004362

Analyzing host-viral interactome of SARS-CoV-2 for identifying vulnerable host proteins during COVID-19 pathogenesis.

Jayanta Kumar Das1, Swarup Roy2, Pietro Hiram Guzzi3.   

Abstract

The development of therapeutic targets for COVID-19 relies on understanding the molecular mechanism of pathogenesis. Identifying genes or proteins involved in the infection mechanism is the key to shedding light on the complex molecular mechanisms. The combined effort of many laboratories distributed throughout the world has produced protein and genetic interactions. We integrated available results and obtained a host protein-protein interaction network composed of 1432 human proteins. Next, we performed network centrality analysis to identify critical proteins in the derived network. Finally, we performed a functional enrichment analysis of central proteins. We observed that the identified proteins are primarily associated with several crucial pathways, including cellular process, signaling transduction, neurodegenerative diseases. We focused on the proteins that are involved in human respiratory tract diseases. We highlighted many potential therapeutic targets, including RBX1, HSPA5, ITCH, RAB7A, RAB5A, RAB8A, PSMC5, CAPZB, CANX, IGF2R, and HSPA1A, which are central and also associated with multiple diseases.
Copyright © 2021 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  COVID-19; Centrality; Disease; Pathways; Protein-protein interaction; SARS-CoV-2

Mesh:

Substances:

Year:  2021        PMID: 34004362      PMCID: PMC8123524          DOI: 10.1016/j.meegid.2021.104921

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


Introduction

The world is experiencing an unprecedented pandemic due to a massive outbreak of Severe Acute Respiratory Syndrome Corona Virus 2 (SARS-CoV-2) infected viral disease, COVID-19. SARS-CoV-2 is a large enveloped coronavirus (family-Coronaviridae, subfamily-Coronavirinae) with non-segmented, single-stranded, and positive-sense RNA genomes (Wrapp et al., 2020), transmits rapidly through human to human contacts. Although SARS-CoV-2 is similar to other known coronaviruses, i.e., SARS-CoV or MERS-CoV (Perlman and Netland, 2009; De Groot et al., 2013; Weber et al., 2021), however it shows high rate of infection (Liu et al., 2020; Surveillances, 2020; Milano and Cannataro, 2020). Hence, there is a need to understand the disease pathogenesis of SARS-CoV-2 to develop effective therapies and vaccines. The SARS-CoV-2 virus causes damage in multiple organs as the disease progresses from an asymptomatic phase to a life-threatening disease (Servick, 2020; Cho et al., 2013). Therefore, accurate molecular diagnosis of COVID-19 disease is essential by collecting the proper respiratory tract specimen (Whetton et al., 2020; Ortuso et al., 2021). In this context, the integrated analysis (Antonelli et al., 2019) of various data-sets, including clinical and imaging data, may explain, and hopefully predict, the longitudinal effects of SARS-CoV-2 infection (Tang et al., 2020; Kumar Das et al., 2021). In particular, many independent projects, throughout the world have studied in genomics and proteomics levels (Kumar Das et al., 2021), and then they integrated these outcomes with clinical case studies. These works have produced data about the effect of the infection at a molecular scale, evidencing genes and proteins' role, such as the interactions among viral and human proteins. Interactions between a host and its pathogen are primarily driven by interactions among the host proteins and pathogen proteins, referred to as host-pathogen protein-protein interaction (PPI) network. The SARS-CoV-2 virus-host interactome have been studied computationally, focusing various virulence factors that are influencing SARS-CoV-2 pathogenesis and interacting mechanism (Guzzi et al., 2020; Hoffmann et al., 2021; Messina et al., 2020; Li et al., 2021). Further, many recent works also used host-viral protein-protein interaction network as an input to elucidate potential drug targets or repurposed drug molecules (Beck et al., 2020, Das et al., 2021, Gordon et al., 2020, Zhou et al., 2020). Host-pathogen protein interactions provide important insights into the molecular mechanisms of pathogenecity (Memišević et al., 2015) and for understanding virulence factors influencing SARS-CoV-2 pathogenesis (Li et al., 2021; Thiel et al., 2003). Protein-Protein Interactions (PPI) are usually modeled and analyzed with graph theory (Guzzi and Roy, 2020; Roy et al., 2019). In this formalism, the interactions are modeled as a graph whose nodes are proteins (or genes), and the edges are the interaction among them. Several studies have found that specific candidate proteins might play a crucial role (Li et al., 2013; Ferrari et al., 2018; Galicia et al., 2020; Lim et al., 2011). Protein-protein interaction networks are an essential ingredient for any systems-level understanding of cellular processes and modeling, and even drug discovery (Tucker et al., 2001; Thakur et al., 2015; Athanasios et al., 2017; Chautard et al., 2009; Nietzsche et al., 2016). The key genes/proteins involved in the different biological pathways can give valuable insight for in-depth characterization of disease progression (Lan et al., 2015; Safari-Alighiarloo et al., 2014; Wang et al., 2018; Jha et al., 2020). It is well accepted that all the viruses have evolved to target proteins that are central and have strong control over the human interactome (Jeong et al., 2001; Bösl et al., 2019; Albert et al., 2000; Navratil et al., 2011; Halehalli and Nagarajaram, 2015). Exploring the predicted interaction networks can suggest new directions for future experimental research and provide cross-species predictions for efficient interaction mapping (Xu and Li, 2006; Safari-Alighiarloo et al., 2014). The complete work-flow design of the current study. This study aimed to identify essential human host proteins based on topology analysis of the PPI of the host proteins targeted by SARS-CoV-2 proteins. We integrated available experimentally validated host-viral PPIs and obtained a set of host proteins that were participated in the integrated network. Network centrality analysis of those candidate host proteins were performed based on their interactions within host-host PPI. We performed enrichment analysis of the central host proteins to shed light on their significance in cellular, signaling, and disease pathways. The enriched host proteins, derived from above step were further analysized for their possible roles in any disease pathogenesis. Finally, a set of host proteins that having role in COVID-19 related diseases were reported as vulnerable proteins. The complete workflow of the current study is illustrated in Fig. 1.
Fig. 1

The complete work-flow design of the current study.

Materials and method

SARS-CoV-2 interacting (human) host proteins

We used recently reported host proteins that were physically verified using Affinity purification mass spectrometry (AP-MS) for their interactions with SARS-CoV-2 proteins (Gordon et al., 2020; Li et al., 2021; Stukalov et al., 2020; Cannataro and Harrison, 2021). The used host-viral protein interactions are also available in BioGRID1 (Stark et al., 2006). A total of 2489 host-viral interactions (consisting of 1432 unique host proteins interacting with 30 SARS-CoV-2 viral proteins) were obtained. In Fig. 2(a), we provided the number of interacting host proteins for each viral protein. It is to be noted that the majority of the host proteins are targeted by any specific viral protein (Fig. 2(b)).
Fig. 2

The quantitative information of host-viral interactions. (a) The abundance (percentage) of collected interacting human host protein for different SARS-CoV-2 viral proteins; (b) A host-viral interaction network pattern, where few host proteins interactions can be seen associated with multiple viral proteins although most of interactions are virus protein specific unique (more details can be seen from Supplementary-A).

The quantitative information of host-viral interactions. (a) The abundance (percentage) of collected interacting human host protein for different SARS-CoV-2 viral proteins; (b) A host-viral interaction network pattern, where few host proteins interactions can be seen associated with multiple viral proteins although most of interactions are virus protein specific unique (more details can be seen from Supplementary-A).

Reconstruction of host PPI network

Starting with the host proteins that are interacting with the virus proteins, we reconstructed the host PPI sub-network by querying the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING, Version 10.02 ) (Szklarczyk et al., 2010). The host PPI sub-network is then visualized using Cytoscape,3 a general platform for complex network analysis and visualization (Shannon et al., 2003). We also used Cytoscape to obtain giant components of the host PPI network and calculated various network centrality scores.

Centrality analysis of host PPI sub-network

In network analysis, indicators of centrality identify the most critical nodes in the network (Bonacich, 1987). The centrality measure are used to characterize each node and edge in the PPI network. The degree measure is the most intuitive for topology analysis of the PPI network. Several other crucial factors that can influence network links are betweenness centrality, closeness centrality, clustering coefficient, topological coefficient, and neighborhood connectivity. where σ is the total number of shortest paths from node s to node t and σ (n) is the number of those paths that pass through n. where L(m,  n) is the length of shortest path between node n and m, and m denotes any other nodes that are reachable to node n. where k is the number of neighbors of n and e is the number of connected pairs between all neighbors of n. where J(n,  m) is defined for all nodes m that share at least one neighbor with n, and the value J(n,  m) is the number of neighbors shared between the nodes n and m, plus one if there is a direct link between n and m. Degree centrality(D: The degree centrality (simply degree) of a node n in a network is the number of direct neighbours of n. The densely connected nodes in a PPI network are considered as hub nodes (Han et al., 2004). Betweenness centrality(B: Betweenness centrality quantifies the number of times a node acts as a bridge along the shortest path between two other nodes (Yoon et al., 2006). The betweenness centrality of a node n is defined as: Closeness centrality(C: Closeness centrality is a way of detecting nodes that are able to spread information very efficiently through the network (Newman, 2005). It can be calculated as: Average shortest-path length(S: The shortest-path length between two nodes (say n and m) is defined as the number of minimum steps required reach node n from m!(Mao and Zhang, 2013). The average shortest path length of node n is the average value of all pair of nodes shortest path from the node n. Clustering coefficient(C: Clustering coefficient is a measure of the degree to which nodes in a graph tend to cluster together (Barabasi and Oltvai, 2004). In undirected networks, the clustering coefficient of a node n is defined as: Topological coefficient(T: Topological coefficient is a relative measure for the extent to which a node shares neighbors with other nodes (Goldberg and Roth, 2003). The topological coefficient T of a node n with k neighbors is computed as follows: Neighborhood connectivity(N: Neighborhood connectivity of a node n is defined as the average connectivity of all neighbors of n (Maslov and Sneppen, 2002). The neighborhood connectivity distribution gives the average of the neighborhood connectivities of all nodes n with k neighbors for k = 0, 1, ⋯n-1. We used NetworkAnalyzer (Shannon et al., 2003) to calculate the above centrality scores. NetworkAnalyzer calculates C (Closeness centrality) as the reciprocal of the average shortest path length. So, high C means highly central, and thus low S .

Gene ontology and pathway enrichment analysis

We performed enrichment analysis to determine the set of significant genes/proteins in different functional and biological pathways. We used KEGG (Kyoto Encyclopedia of Genes and Genomes) (Kanehisa and Goto, 2000) for elucidating pathway enrichment of a host protein and Gene Ontology (GO) for the assessment of protein functions (Ashburner et al., 2000). In this study, we performed GO and KEGG analysis using Enrichr software,4 a web-based suite of gene list enrichment analysis tools (Kuleshov et al., 2016).

Gene-disease association network

Complex diseases are caused by a group of genes known as disease genes. More often, a gene can participate in various disease conditions (Goh et al., 2007; W. T. C. C. Consortium, 2007). It helps unravel the disease pathogenesis, which helps in disease diagnosis, treatment, and disease prevention. We obtained gene-disease association network from DisGeNET (v7.0) database,5 which contains 1,134,942 gene-disease associations (GDAs), between 21,671 genes and 30,170 disease (Piñero et al., 2020). From this database, we considered curated gene-disease associations only.

Results and discussion

Here, we report the outcomes of intermediate steps to reach our objective of isolating key host proteins followed by their significance analysis.

Deriving PPI network for candidate host proteins

Our list of host proteins consists of 1432 distinct proteins that are targeted by SARS-CoV-2 during COVID-19. We rebuilt the PPI network centered around our candidate proteins using STRING DB. There are 7076 edges in the derived PPI network. We derived PPI by keeping only the interactions whose confidence scores were at least 0.7 (high confidence). The derived PPI network was then analyzed using Cytoscape. We identified the big connected component (also called gain/main component) of the PPI network. After discarding all disconnected components in the PPI network, we considered the giant component of the PPI network with 1111 nodes (approximately 78% of total candidate proteins) and 7043 edges (Fig. 3 ).
Fig. 3

The gain (connected) component of PPI network obtained from whole PPI network. The network is consisting of 1111 unique nodes (proteins) and 7043 interactions edges (more details can be seen from Supplementary-A).

The gain (connected) component of PPI network obtained from whole PPI network. The network is consisting of 1111 unique nodes (proteins) and 7043 interactions edges (more details can be seen from Supplementary-A).

Network topology analysis of gain component

We performed the topological analysis of the gain component using NetworkAnalyzer (Shannon et al., 2003). The degree distribution of all the candidate proteins in the gain component showed that the majority of the proteins in the gain component exhibit a higher degree of connectivity (Fig. 4 ). Few proteins with degree (shown within parentheses) more than 50 are CDK1(73), PPP2R1A(65), NOP56(60), POLR2B(60), RAB1A(59), RBX1(58), SKIV2L2(57), NAPA(57), RPS14(56), STX5(54), TGOLN2(54), TCEB1(53), DCTN2(53), TCEB2(52), HSPA9(51), and GNB2L1(50).
Fig. 4

The degree distribution of all 1111 nodes (proteins) in the gain component of PPI network. The X-axis indicates degree distribution, whereas Y-axis shows relative frequency distributions.

The degree distribution of all 1111 nodes (proteins) in the gain component of PPI network. The X-axis indicates degree distribution, whereas Y-axis shows relative frequency distributions. The histogram analysis of all the centrality measures (discussed in Section 2.3) showing non-random distribution (Supplementary-B). Out of seven centrality mesures we considered only those measures which were aligning in their decissions. To select the candidate centrality measures, we computed Pearson's correlation among all centrality scores (Table 1 ). The correlation between degree centrality (D ) and closeness centrality (C ) observed to be the highest (r = 0.759) in comparison to other measures. Although, we observed correlation between D and neighbourhood centrality (N ) is the third-highest (r = 0.557), but N and B showed less correlative (r = 0.1101). Overall, we observed that the correlation scores among the three centrality measures (D , B , C ) are quite similar. Therefore, we selected them in subsequent analysis. We identified 373 proteins in these criteria, which are considered highly central proteins (above the median score for all three selected parameters). When we considered all measures, we found only six common proteins (GEMIN4, DDX20, GOLGA3, FKBP15, PMPCA, and AK4) above the median score in each category of centrality measurement, which was the reason for selecting three centrality measures for our downstream analysis.
Table 1

Correlation analysis among all centrality parameters computed for 1111 proteins. The correlation score (>0.5) among the three centrality measures (Dc,Bc,Cc) are similar (highlighted in bold).

DcBcCcoefTcNc
Bc0.6031
Ccoef0.209−0.1681
Tc−0.32−0.2830.4511
Nc0.5570.11010.346−0.1391
Cc0.7590.51460.213−0.3290.684
Correlation analysis among all centrality parameters computed for 1111 proteins. The correlation score (>0.5) among the three centrality measures (Dc,Bc,Cc) are similar (highlighted in bold).

Pathway and gene ontology enrichment analysis of highly central proteins

We performed KEGG pathway analysis of selected 373 highly central proteins. We obtained a total of 84 enriched KEGG pathways within the significant level (adj − p < .05). The enriched pathways are involved in Cellular Processes (9), Environmental Information Processing (9), Genetic Information Processing (13), Human Disease (31), Organismal Systems (15), and Metabolism (7). The top seven pathways in each category were shown in Fig. 5 .
Fig. 5

The top 7 enriched pathways from each category of KEGG pathways. In each category, pathways are shown ordered by − log 10(p) value.

The top 7 enriched pathways from each category of KEGG pathways. In each category, pathways are shown ordered by − log 10(p) value. Our current study mainly focused on the proteins that are involved in cellular process, signaling transduction, and human disease (viral and neurodegenerative) pathways. The most affected thirty-one (31) pathways were considered in the context of COVID-19 disease (Seif et al., 2020; Ganesan et al., 2019; Luo et al., 2020; Grimes and Grimes, 2020). There are nine enriched pathways in cellular process (Endocytosis, Phagosome, Adherens junction, Tight junction, Cell cycle, Cellular senescence, Focal adhesion, Regulation of actin cytoskeleton, Lysosome), nine enriched pathways in Environmental Information Processing-signaling transduction (Ras signaling pathway, HIF-1 signaling pathway, Hippo signaling pathway, Apelin signaling pathway, MAPK signaling pathway, TGF-beta signaling pathway, AMPK signaling pathway, NF-kappa B signaling pathway), nine enriched pathways from human viral disease sub-category (Human immunodeficiency virus-1 infection, Human papillomavirus infection, Human cytomegalovirus infection, Hepatitis B, Human T-cell leukemia virus-1 infection, Influenza A, Hepatitis C, Measles) and four enriched pathways from neurodegenerative disease with sub-category (Huntington disease, Parkinson's disease, Alzheimer's disease, Prion diseases). A total of 141 distinct proteins (out of 373) were obtained from these pathways, which were then ranked based on their presence in selected enriched pathways. We found that 79 proteins were associated with our candidate pathways. All these proteins were then further studied for disease-gene association in the next. We also performed gene set enrichment analysis (Gene ontology). It is observed that the selected genes are primarily involved in Biological processes (Supplementary-C). The top ten terms in each category of gene ontology (Biological processes (BF), Molecular Function (MF), and Cellular Component (CC)) were shown in Fig. 6 that includes neutrophil-mediated immunity (GO:0002446), neutrophil activation involved in immune response (GO:0002283) and viral process (GO:0016032) from BP category; dolichyl-diphosphooligosaccharide-protein glycotransferase activity (GO:0004579), GDP binding (GO:0019003), cadherin binding (GO:0045296) and ATPase activity (GO:0016887) from MF category; and focal adhesion (GO:0005925) from CC category.
Fig. 6

The top 10 enriched terms in each category of gene ontology (BP-Biological process, MF-Molecular function, CC-Cellular component). In each category, terms are shown ordered by log10(combined score) value.

The top 10 enriched terms in each category of gene ontology (BP-Biological process, MF-Molecular function, CC-Cellular component). In each category, terms are shown ordered by log10(combined score) value.

Analysis of disease-gene associations

The shortlisted 141 proteins involved in four significant pathways (cellular process, signaling transduction, viral and neurodegenerative)were further screened by looking into their association with COVID-19 related diseases. We particularly focused on three highly influential diseases during COVID-19, namely cardiovascular, respiratory tract (Wu et al., 2020; Clerkin et al., 2020; Konturek et al., 2020; Cannataro et al., 2005) and immune system disease (Melenotte et al., 2020; Chowdhury et al., 2020). To obtain disease-gene association, we used DisGeNET database (Piñero et al., 2020) and selected CURETED source only. We found a total of 64 proteins (out of 141) playing roles in various COVID-19 related diseases such as Asthma, Pneumonitis, Pneumonia, Influenza, Lung diseases, Cardiomyopathies, Coronary, Arteriosclerosis, Coronary Artery Disease, Heart failure, HIV Infections, and others.(Supplementary-D). We compared proteins involved in all three disease categories and individual diseases in each category (Fig. 7 ). A total of 119, 37, and 48 unique diseases (total count-204) and 44, 17, and 24 distinct proteins are associated with the Cardiovascular, Respiratory, and Immune system disease category. Interestingly, we found few proteins that are associated with all three disease categories (AREG, CAV1, IFIH1, PARP1, PLAU, TGFB1, ATM, B2M, DDX58, ENO1, HSPA5, PRKDC, STAT6, TGFBR1, and TGFBR2). The top few proteins which are associated with ten or more diseases are PLAU(59), TGFB1(29), CAV1(17), PARP1(17), TGFBR2(13), ATP2A2(11), AREG(10), FASN(10), IFIH1(10), and ITGB1(10). Finally, we listed a total of 64 unique genes/proteins and their various quantitative scores (degree, disease count (out of 204), disease type count (out of 3), and pathway count (out of 31)) in Table 2 .
Fig. 7

Comparison of three groups of disease categories (Cardiovascular, Respiratory, Immune system) using venn-diagram. (a) based on number of proteins count in each category; (b) based on number of disease associated proteins (curated from database) among the observed proteins in each category.

Table 2

The shortlisted 64 host proteins with their degree in host-host PPI , disease count (out of 204), disease type count (out of 3), and pathway count (out of 31). Some of the proteins that are known to be targeted by other viruses were also highlighted.

Gene/ProteinDegree in PPI networkPathway countDisease categoryDisease countKnown target virus
ADAM17102Immune1
ALDOA261Cardiovascular1
AP3B1171Respiratory2
AREG202Respiratory, Cardiovascular, Immune10
ATM296Cardiovascular, Immune8
ATP2A2101Cardiovascular11
ATP6195Cardiovascular1Human SARS coronavirus, Bovine papillomavirus type 1, Human papillomavirus type 16
ATR175Respiratory2Human adenovirus 5
B2M254Cardiovascular, Immune4Hepatitis C virus genotype 1b (isolate Con1)
CANX312Cardiovascular1
CAPZB341Cardiovascular1
CAV1222Respiratory, Cardiovascular, Immune17Poliovirus type 1 (strain Sabin)
CD44191Immune1
COX293Cardiovascular1
CRKL115Cardiovascular4
DDX58106Respiratory, Cardiovascular2
ENO1131Cardiovascular, Immune2
EPHA292Cardiovascular5
FASN91Cardiovascular10
GAPDH232Cardiovascular1Hepatitis C virus genotype 1b (isolate Con1), Epstein-Barr virus (strain GD1)
GLA161Cardiovascular5
GNAQ145Cardiovascular1
GUSB161Immune1
HDAC2125Respiratory2Human herpesvirus 1 (strain 17), Human papillomavirus type 16, Human papillomavirus type 31
HLA-A158Immune5Epstein-Barr virus (strain GD1), Human papillomavirus type 16
HLA-C148Immune7
HMGCR111Immune4
HSPA1A305Cardiovascular1Epstein-Barr virus (strain GD1)
HSPA5461Respiratory, Cardiovascular3Epstein-Barr virus (strain GD1)
IFIH1103Respiratory, Cardiovascular, Immune10Sendai virus (strain Fushimi)
IGF2R312Respiratory1
ITCH461Immune1Epstein-Barr virus (strain B95-8)
ITGA6103Immune1
ITGB1295Cardiovascular10Hepatitis C virus genotype 1b (isolate Con1)
JAK2222Cardiovascular6
LDHA141Cardiovascular4
LDLR202Cardiovascular4
MET184Respiratory1
NDUFS2203Cardiovascular3
NF1162Cardiovascular3
NOTCH1233Cardiovascular4Hepatitis C virus genotype 1b (isolate Con1)
NOTCH2112Cardiovascular1
NOTCH3153Cardiovascular2
PARP1141Respiratory, Cardiovascular, Immune17Human herpesvirus 1 (strain 17)
PCNA253Immune1Human herpesvirus 1 (strain 17)
PDIA3143Cardiovascular1
PLAU241Respiratory, Cardiovascular, Immune59
PPP1CB144Cardiovascular2
PRKDC151Respiratory, Immune3Human herpesvirus 1 (strain 17)
PSMC5341Immune7Human adenovirus 5, Human adenovirus 12, Simian virus 40
PSMD6241Immune2
PTPN11221Cardiovascular6
RAB5A403Cardiovascular1
RAB7A412Cardiovascular1
RAB8A403Immune1
RBX1584Immune5
SERPINE1164Cardiovascular8
SLC9A1122Cardiovascular7
SORT1141Cardiovascular3
STAT6111Respiratory, Immune4
TGFB1297Respiratory, Cardiovascular, Immune29Hepatitis C virus genotype 1b (isolate Con1)
TGFBR1179Respiratory, Cardiovascular6
TGFBR2168Respiratory, Cardiovascular13
XPO1252Cardiovascular2
Comparison of three groups of disease categories (Cardiovascular, Respiratory, Immune system) using venn-diagram. (a) based on number of proteins count in each category; (b) based on number of disease associated proteins (curated from database) among the observed proteins in each category. The shortlisted 64 host proteins with their degree in host-host PPI , disease count (out of 204), disease type count (out of 3), and pathway count (out of 31). Some of the proteins that are known to be targeted by other viruses were also highlighted.

Viral proteins targeting key host proteins

We next looked into the source viral-host PPI network (refer Fig. 2) to identify the viral proteins that were targeting shortlisted 64 disease-associated proteins. We found 25 SARS-CoV-2 proteins were interacting 64 key host proteins (Fig. 8 ).Among 25 SARS-CoV-2 viral proteins, eight are accessory proteins (Orf3a, Orf7b, Orf6, Orf7a, Orf7b, Orf8, Orf9b, Orf10), four structural proteins (E,M,N,S) and thirteen non-structural poly-proteins (nsp1, nsp10, nsp12, nsp13, nsp14, nsp2, nsp3, nsp4, nsp5, nsp6, nsp7, nsp8, nsp9). It was observed that several host proteins were interacting with a single viral protein. Very few host proteins were interacting with more than one viral protein. The viral protein, Orf7b interacts with a maximum number of target host proteins, followed by Orf3a and M protein. Further, five host proteins were found to interact with both Orf3a and Orf7b.
Fig. 8

The interaction network represents the most influential host protein and viral protein. The network is consisting of sixty-four (64) host proteins interacting with twenty-five (25) SARS-CoV-2 viral proteins. The yellow colour node represents the viral proteins in the network, whereas the green one represents the host proteins.

The interaction network represents the most influential host protein and viral protein. The network is consisting of sixty-four (64) host proteins interacting with twenty-five (25) SARS-CoV-2 viral proteins. The yellow colour node represents the viral proteins in the network, whereas the green one represents the host proteins. We looked further for any other viruses that are targeting our 64 host proteins. We mined VirusMINT (Chatr-Aryamontri et al., 2009), a virus-host association database, to find the other related viral diseases. We found that the majority of the highlighted host proteins were also targeted by Hepatitis C virus genotype 1b, Poliovirus Type 1, Human herpesvirus 1, Human papillomavirus type 16 & 31, Simian virus 40, Sendai virus, Human adenovirus 5 & 12, Epstein-Barr virus, Human SARS coronavirus. Bovine papillomavirus type 1, and Epstein-Barr virus (Table 2). Further, a few genes (AREG, PLAU, MET, NF1, JAK2, SERPINE1) were observed to be improtant drug targets and associated with various signaling pathways as reported in recent research (Das et al., 2021). These proteins might be highly essential and need to put utmost importance on developing host-directed antiviral therapies for COVID-19.

Conclusion

In this study, we analyzed the human (host) protein-protein interaction network targeted by SARS-CoV-2 . Using topology and pathway enrichment analysis of important host PPI network , we reported 64 potential key SARS-CoV-2 interacting host proteins associated with disease pathways, like respiratory, cardiovascular, and immune system diseases. Among them, we highlighted a set of highly central host proteins, such as RBX1, HSPA5, ITCH, RAB7A, RAB5A, RAB8A, PSMC5, CAPZB, CANX, IGF2R, and HSPA1A, which might influence the whole PPI network. Some of the key host proteins are known to target different viruses and may be highly important for therapeutic solution for controlling COVID-19 severeness. We strongly believe that the highlighted key proteins are promising drug targets, which might play a crucial role in inhibiting COVID-19 progression.

Author contributions

Jayanta Kumar Das and Swarup Roy conceived and designed the study. Swarup Roy and Pietro H. Guzzi supervised the study. Jayanta Kumar Das performed the computational analysis and wrote the original draft, which is finally edited by Swarup Roy and Pietro H. Guzzi. All authors approved the final version of the draft.

Declaration of Competing Interest

We would like to declare that we do not have any conflicts of interest in connection with the submitted work.
  64 in total

1.  Assessing experimentally derived interactions in a small world.

Authors:  Debra S Goldberg; Frederick P Roth
Journal:  Proc Natl Acad Sci U S A       Date:  2003-04-03       Impact factor: 11.205

Review 2.  Interaction networks: from protein functions to drug discovery. A review.

Authors:  E Chautard; N Thierry-Mieg; S Ricard-Blum
Journal:  Pathol Biol (Paris)       Date:  2008-12-13

Review 3.  Protein-Protein Interaction (PPI) Network: Recent Advances in Drug Discovery.

Authors:  Alexiou Athanasios; Vairaktarakis Charalampos; Tsiamis Vasileios; Ghulam Md Ashraf
Journal:  Curr Drug Metab       Date:  2017       Impact factor: 3.731

Review 4.  A review on protein-protein interaction network of APE1/Ref-1 and its associated biological functions.

Authors:  S Thakur; M Dhiman; G Tell; A K Mantha
Journal:  Cell Biochem Funct       Date:  2015-03-19       Impact factor: 3.685

5.  Middle East respiratory syndrome coronavirus (MERS-CoV): announcement of the Coronavirus Study Group.

Authors:  Raoul J de Groot; Susan C Baker; Ralph S Baric; Caroline S Brown; Christian Drosten; Luis Enjuanes; Ron A M Fouchier; Monica Galiano; Alexander E Gorbalenya; Ziad A Memish; Stanley Perlman; Leo L M Poon; Eric J Snijder; Gwen M Stephens; Patrick C Y Woo; Ali M Zaki; Maria Zambon; John Ziebuhr
Journal:  J Virol       Date:  2013-05-15       Impact factor: 5.103

6.  Statistical and Network-Based Analysis of Italian COVID-19 Data: Communities Detection and Temporal Evolution.

Authors:  Marianna Milano; Mario Cannataro
Journal:  Int J Environ Res Public Health       Date:  2020-06-12       Impact factor: 3.390

7.  M-Finder: Uncovering functionally associated proteins from interactome data integrated with GO annotations.

Authors:  Young-Rae Cho; Marco Mina; Yanxin Lu; Nayoung Kwon; Pietro H Guzzi
Journal:  Proteome Sci       Date:  2013-11-07       Impact factor: 2.480

8.  Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation.

Authors:  Daniel Wrapp; Nianshuang Wang; Kizzmekia S Corbett; Jory A Goldsmith; Ching-Lin Hsieh; Olubukola Abiona; Barney S Graham; Jason S McLellan
Journal:  Science       Date:  2020-02-19       Impact factor: 47.728

9.  A new coronavirus associated with human respiratory disease in China.

Authors:  Fan Wu; Su Zhao; Bin Yu; Yan-Mei Chen; Wen Wang; Zhi-Gang Song; Yi Hu; Zhao-Wu Tao; Jun-Hua Tian; Yuan-Yuan Pei; Ming-Li Yuan; Yu-Ling Zhang; Fa-Hui Dai; Yi Liu; Qi-Min Wang; Jiao-Jiao Zheng; Lin Xu; Edward C Holmes; Yong-Zhen Zhang
Journal:  Nature       Date:  2020-02-03       Impact factor: 49.962

10.  A SARS-CoV-2 protein interaction map reveals targets for drug repurposing.

Authors:  David E Gordon; Gwendolyn M Jang; Mehdi Bouhaddou; Jiewei Xu; Kirsten Obernier; Kris M White; Matthew J O'Meara; Veronica V Rezelj; Jeffrey Z Guo; Danielle L Swaney; Tia A Tummino; Ruth Hüttenhain; Robyn M Kaake; Alicia L Richards; Beril Tutuncuoglu; Helene Foussard; Jyoti Batra; Kelsey Haas; Maya Modak; Minkyu Kim; Paige Haas; Benjamin J Polacco; Hannes Braberg; Jacqueline M Fabius; Manon Eckhardt; Margaret Soucheray; Melanie J Bennett; Merve Cakir; Michael J McGregor; Qiongyu Li; Bjoern Meyer; Ferdinand Roesch; Thomas Vallet; Alice Mac Kain; Lisa Miorin; Elena Moreno; Zun Zar Chi Naing; Yuan Zhou; Shiming Peng; Ying Shi; Ziyang Zhang; Wenqi Shen; Ilsa T Kirby; James E Melnyk; John S Chorba; Kevin Lou; Shizhong A Dai; Inigo Barrio-Hernandez; Danish Memon; Claudia Hernandez-Armenta; Jiankun Lyu; Christopher J P Mathy; Tina Perica; Kala Bharath Pilla; Sai J Ganesan; Daniel J Saltzberg; Ramachandran Rakesh; Xi Liu; Sara B Rosenthal; Lorenzo Calviello; Srivats Venkataramanan; Jose Liboy-Lugo; Yizhu Lin; Xi-Ping Huang; YongFeng Liu; Stephanie A Wankowicz; Markus Bohn; Maliheh Safari; Fatima S Ugur; Cassandra Koh; Nastaran Sadat Savar; Quang Dinh Tran; Djoshkun Shengjuler; Sabrina J Fletcher; Michael C O'Neal; Yiming Cai; Jason C J Chang; David J Broadhurst; Saker Klippsten; Phillip P Sharp; Nicole A Wenzell; Duygu Kuzuoglu-Ozturk; Hao-Yuan Wang; Raphael Trenker; Janet M Young; Devin A Cavero; Joseph Hiatt; Theodore L Roth; Ujjwal Rathore; Advait Subramanian; Julia Noack; Mathieu Hubert; Robert M Stroud; Alan D Frankel; Oren S Rosenberg; Kliment A Verba; David A Agard; Melanie Ott; Michael Emerman; Natalia Jura; Mark von Zastrow; Eric Verdin; Alan Ashworth; Olivier Schwartz; Christophe d'Enfert; Shaeri Mukherjee; Matt Jacobson; Harmit S Malik; Danica G Fujimori; Trey Ideker; Charles S Craik; Stephen N Floor; James S Fraser; John D Gross; Andrej Sali; Bryan L Roth; Davide Ruggero; Jack Taunton; Tanja Kortemme; Pedro Beltrao; Marco Vignuzzi; Adolfo García-Sastre; Kevan M Shokat; Brian K Shoichet; Nevan J Krogan
Journal:  Nature       Date:  2020-04-30       Impact factor: 69.504

View more
  5 in total

Review 1.  Methodology-Centered Review of Molecular Modeling, Simulation, and Prediction of SARS-CoV-2.

Authors:  Kaifu Gao; Rui Wang; Jiahui Chen; Limei Cheng; Jaclyn Frishcosy; Yuta Huzumi; Yuchi Qiu; Tom Schluckbier; Xiaoqi Wei; Guo-Wei Wei
Journal:  Chem Rev       Date:  2022-05-20       Impact factor: 72.087

Review 2.  Network for network concept offers new insights into host- SARS-CoV-2 protein interactions and potential novel targets for developing antiviral drugs.

Authors:  Neda Eskandarzade; Abozar Ghorbani; Samira Samarfard; Jose Diaz; Pietro H Guzzi; Niloofar Fariborzi; Ahmad Tahmasebi; Keramatollah Izadpanah
Journal:  Comput Biol Med       Date:  2022-04-30       Impact factor: 6.698

3.  Exploiting the molecular basis of age and gender differences in outcomes of SARS-CoV-2 infections.

Authors:  Daniele Mercatelli; Elisabetta Pedace; Pierangelo Veltri; Federico M Giorgi; Pietro Hiram Guzzi
Journal:  Comput Struct Biotechnol J       Date:  2021-07-10       Impact factor: 7.271

4.  Differential Co-Expression Network Analysis Reveals Key Hub-High Traffic Genes as Potential Therapeutic Targets for COVID-19 Pandemic.

Authors:  Aliakbar Hasankhani; Abolfazl Bahrami; Negin Sheybani; Behzad Aria; Behzad Hemati; Farhang Fatehi; Hamid Ghaem Maghami Farahani; Ghazaleh Javanmard; Mahsa Rezaee; John P Kastelic; Herman W Barkema
Journal:  Front Immunol       Date:  2021-12-15       Impact factor: 7.561

5.  Identifying Potential Mitochondrial Proteome Signatures Associated with the Pathogenesis of Pulmonary Arterial Hypertension in the Rat Model.

Authors:  Jie Wang; Md Nazim Uddin; Qian Li; Alidan Aierken; Ming-Yuan Li; Rui Wang; Qian-Zhi Yan; Dilare Adi; Ming-Tao Gai; Yun Wu
Journal:  Oxid Med Cell Longev       Date:  2022-02-21       Impact factor: 6.543

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.