Literature DB >> 28097092

Comparative interactomics for virus-human protein-protein interactions: DNA viruses versus RNA viruses.

Saliha Durmuş1, Kutlu Ö Ülgen2.   

Abstract

Viruses are obligatory intracellular pathogens and completely depend on their hosts for survival and reproduction. The strategies adopted by viruses to exploit host cell processes and to evade host immune systems during infections may differ largely with the type of the viral genetic material. An improved understanding of these viral infection mechanisms is only possible through a better understanding of the pathogen-host interactions (PHIs) that enable viruses to enter into the host cells and manipulate the cellular mechanisms to their own advantage. Experimentally-verified protein-protein interaction (PPI) data of pathogen-host systems only became available at large scale within the last decade. In this study, we comparatively analyzed the current PHI networks belonging to DNA and RNA viruses and their human host, to get insights into the infection strategies used by these viral groups. We investigated the functional properties of human proteins in the PHI networks, to observe and compare the attack strategies of DNA and RNA viruses. We observed that DNA viruses are able to attack both human cellular and metabolic processes simultaneously during infections. On the other hand, RNA viruses preferentially interact with human proteins functioning in specific cellular processes as well as in intracellular transport and localization within the cell. Observing virus-targeted human proteins, we propose heterogeneous nuclear ribonucleoproteins and transporter proteins as potential antiviral therapeutic targets. The observed common and specific infection mechanisms in terms of viral strategies to attack human proteins may provide crucial information for further design of broad and specific next-generation antiviral therapeutics.

Entities:  

Keywords:  DNA virus; RNA virus; comparative interactomics; next‐generation antiviral therapeutics; pathogen–host interaction; viral infection strategy

Year:  2017        PMID: 28097092      PMCID: PMC5221455          DOI: 10.1002/2211-5463.12167

Source DB:  PubMed          Journal:  FEBS Open Bio        ISSN: 2211-5463            Impact factor:   2.693


deoxyribonucleic acid double‐stranded DNA double‐stranded RNA Epstein–Barr virus gene ontology heterogeneous nuclear ribonucleoprotein Kyoto Encyclopedia of Genes and Genomes KEGG Orthology Based Annotation System pathogen–host interaction pathogen–host interaction search tool protein–protein interaction ribonucleic acid single‐stranded DNA negative‐sense single‐stranded RNA positive‐sense single‐stranded RNA single‐stranded RNA Viral infections pose ever‐increasing danger to the human beings because of emerging and reemerging diseases. Their high mutation rates enable viruses to easily develop drug resistance towards the conventional therapeutics, which mainly inhibit essential viral proteins. This makes the problem more serious, since most of the current antiviral drugs are hardly effective to the resistant virus strains. Therefore, the efforts on the next‐generation antiviral drug discovery have been focused on finding host‐oriented drug targets, which act on cellular functions essential in the virus life‐cycle 1, 2. The cellular processes are manipulated and exploited by pathogenic microorganisms through mainly physical interactions between pathogen and host proteins 3. Therefore, viral PHI networks should be investigated thoroughly in terms of the functional properties of virus‐targeted host proteins, in order to identify the cellular proteins and associated functions that are indispensable for viruses to replicate and persist within the host cell during infections. The understanding of the differences and similarities in the infection mechanisms used by different viral groups is crucial for designing broad and virus‐specific antiviral therapeutics 1, 4, 5. Viral families are grouped based on their type of nucleic acid as genetic material, DNA or RNA 6. DNA viruses contain usually double‐stranded DNA (dsDNA) and rarely single‐stranded DNA (ssDNA). These viruses replicate using DNA‐dependent DNA polymerase. RNA viruses have typically ssRNA, but may also contain dsRNA. ssRNA viruses can be further grouped as positive‐sense (ssRNA(+)) or negative‐sense (ssRNA(−)). The genetic material of ssRNA(+) viruses is like mRNA and can be directly translated by the host cell. ssRNA(−) viruses carry RNA that is complementary to mRNA and must be converted to positive‐sense RNA using RNA polymerase before translation. An exception of this group is the Retroviruses, which replicate through DNA intermediates using reverse transcriptase despite having RNA genomes. Owing to their very small genome sizes, viruses have restricted life capabilities and must enter into the host cells to replicate, assemble and propagate. They have evolved to develop strategies to manipulate host cell mechanisms to control their own life cycles and also to disable antiviral responses of host immune systems 4, 7, 8. Compared to DNA virus genomes, which can encode up to hundreds of viral proteins, RNA viruses have smaller genomes that usually encode only a few proteins. Owing to the size and functionality of the resulting proteome, the size and type of viral genetic materials may have great effects on the life styles of viruses within the host cells. DNA viruses have integrated large DNA sequences from the hosts to their genome, throughout the evolution. Consequently, their genomes can encode proteins with eukaryote‐originated complex functional domains and enable DNA viruses to finely exploit the metabolism of infected cells in order to promote their own replication within the cell. On the other hand, RNA virus proteins cannot exhibit such homologies with their eukaryotic counterparts, but still can communicate with host cells through complex networks of PHIs. RNA viruses have probably evolved a different strategy, i.e. they interact with the host proteins using protein‐binding motifs specific to RNA viruses 9. Consequently, it can be stated that, DNA and RNA viruses have developed some distinct infection strategies to cause generally chronic and acute infections, respectively. Despite the availability of detailed models of virus structures, replication machineries, and patho‐physiologies, a more functional analysis of virus–host molecular interactions is required to capture a systems view of viral infection mechanisms in host cells. Considering the preliminary efforts on the high‐throughput experimental studies 4, 10, 11, 12, 13, one can state that the field of virus–human interspecies protein interactions has been developing now. Nevertheless, the aforementioned systems view of viral infection mechanisms through PHIs is still lacking 2, 14. The general focus of computational analysis of PHI data is on the common and specific behaviors of bacterial and viral pathogens during infections, by comparing their protein interactions with human 3, 15. Vidalain and Tangy (2010) reviewed RNA viruses–human protein interaction networks analyzing special infection characteristics of RNA viruses through 830 PHI data. Pichlmair et al. (2012) experimentally found 1681 PHI data between 70 viral proteins and 579 human proteins and then comparatively analyzed this dataset in terms of common and specific infection strategies used by DNA and RNA viruses. Here, we comparatively analyzed the current experimental PHI data belonging to DNA and RNA viruses, covering 19 033 PHIs between 1061 viral proteins and 4943 human proteins. The PHI data were obtained from phisto which is a comprehensive database of pathogen–human PPIs 16. This study presents the first comprehensive comparison between DNA and RNA viruses in terms of their infection strategies, providing an initial systems‐level understanding of viral pathogenesis through PHIs. However the results drawn from this analysis should be interpreted with caution since PHI data for lots of virus families are still scarce.

Materials and methods

phisto: a web‐based tool for retrieval and analysis of PHI networks

Pathogen–host interaction search tool (phisto) was developed by our group due to the lack of a comprehensive PHI database in the Web 16. It stores the up‐to‐date PHI data for all pathogen types for which experimentally‐found PPIs with human are available. phisto also provides integrated bioinformatic tools for visualization of PHI networks and topological/functional analysis of pathogen‐targeted human proteins through its user‐friendly interface (www.phisto.org). Ongoing studies on phisto are for covering experimental PHI data belonging also to other mammalian species as host organism.

Virus–human PHI data

The family‐based virus–human PHIs were downloaded from phisto using the taxonomic filtering functionality of its browse option. Twenty‐eight viral families are covered by the downloaded interspecies interactome data. Twelve families carry DNA and 16 families carry RNA as their genetic material. Retroviruses were excluded from the RNA families since they replicate through reverse transcription. Similarly, Hepadnaviruses were excluded from the DNA viruses. Consequently, PHI data belonging to 11 DNA virus families (Table 1) and 15 RNA virus families (Table 2) were obtained and used throughout the study. Two representative PHI networks, one for a DNA virus and one for an RNA virus, are in Fig. 1. All details of the PHI data for these 26 viral families are given in Data S1 and S2.
Table 1

Contents of DNA viruses–human PHI data

DNA virus family (genetic material)# of species# of strains# of PHIs# of pathogen proteins# of human proteins
Adenoviridae (dsDNA)91330543235
Asfarviridae (dsDNA)12554
Baculoviridae (dsDNA)11111
Circoviridae (ssDNA)12423
Herpesviridae (dsDNA)173548363912288
Myoviridae (dsDNA)23433
Papillomaviridae (dsDNA)10204033911731
Parvoviridae (ssDNA)2260556
Polyomaviridae (dsDNA)5635618261
Poxviridae (dsDNA)101944478343
Siphoviridae (dsDNA)33333
Total6110610 0516403658
Table 2

Contents of RNA viruses–human PHI data

RNA virus family (genetic material)# of species# of strains# of PHIs# of pathogen proteins# of human proteins
Arenaviridae ((−) ssRNA)6617614
Arteriviridae ((+) ssRNA)2268368
Birnaviridae (dsRNA)11111
Bornaviridae ((−) ssRNA)11111
Bunyaviridae ((−) ssRNA)91018211140
Coronaviridae ((+) ssRNA)44341629
Filoviridae ((−) ssRNA)341726151
Flaviviridae ((+) ssRNA)8301704191876
Hepeviridae ((+) ssRNA)12322
Orthomyxoviridae ((−) ssRNA)34956811051623
Paramyxoviridae ((−) ssRNA)142290442650
Picornaviridae ((+) ssRNA)471096
Reoviridae (dsRNA)48631157
Rhabdoviridae ((−) ssRNA)57271122
Togaviridae ((+) ssRNA)451156104
Total6915889824212639
Figure 1

Virus–human PHI networks obtained from phisto. Red nodes are viral proteins and blue nodes are human proteins. (A) Human herpesvirus 4 (Epstein–Barr virus) – human protein–protein interaction network. (B) Influenza A virus strain WSN/1933/TS61 (H1N1) – human protein–protein interaction network.

Contents of DNA viruses–human PHI data Contents of RNA viruses–human PHI data Virus–human PHI networks obtained from phisto. Red nodes are viral proteins and blue nodes are human proteins. (A) Human herpesvirus 4 (Epstein–Barr virus) – human protein–protein interaction network. (B) Influenza A virus strain WSN/1933/TS61 (H1N1) – human protein–protein interaction network.

Human protein sets

A total of eight sets of human proteins interacting with viral pathogens were constructed from the PHI data to analyze the functional properties of virus‐targeted human proteins. Firstly, the sets targeted by DNA viruses (DNA viruses‐targeted set), RNA viruses (RNA viruses‐targeted set), only DNA viruses, i.e. not targeted by any RNA viruses (only DNA viruses‐targeted set), and only RNA viruses, i.e. not targeted by any DNA viruses (only RNA viruses‐targeted set) were constructed to observe the characteristics specific to DNA and RNA viruses with respect to their interactions with human proteins. For a deeper comparison between DNA and RNA viral infections, human proteins interacting with at least four DNA virus families (4‐DNA viruses‐targeted set), as well as human proteins interacting with at least four RNA virus families (4‐RNA viruses‐targeted set) were used. On the other hand, to obtain the common infection strategies of viral pathogens, sets of human proteins targeted by all viruses, i.e. targeted by DNA and/or RNA viruses (viruses‐targeted set) and the ones targeted by both DNA and RNA viruses together (DNA‐RNA viruses‐targeted set) were also analyzed. The number of virus‐targeted human proteins covered by each set is tabulated in Table 3.
Table 3

Virus‐targeted human protein sets

Human protein setTargeting viruses# of human proteins
DNA viruses‐targetedDNA viruses3658
RNA viruses‐targetedRNA viruses2639
Only DNA viruses‐targetedOnly DNA viruses2304
Only RNA viruses‐targetedOnly RNA viruses1285
4‐DNA viruses‐targeted4 or more DNA virus families60
4‐RNA viruses‐targeted4 or more RNA virus families84
DNA‐RNA viruses‐targetedBoth DNA and RNA viruses1354
Viruses‐targetedDNA and/or RNA viruses4943
Virus‐targeted human protein sets

Gene ontology enrichment analysis

Gene ontology (GO) 17 enrichment analysis of the human protein sets was performed using the bingo plug‐in of Cytoscape 18. The significance level was set to 0.05 meaning that only terms enriched with a P‐value of at most 0.05 were considered after an enrichment calculation with hypergeometric test and then Benjamini and Hochberg false discovery rate correction. All three GO terms (biological process, molecular function, and cellular component) were scanned to identify the terms having significant association with each virus‐targeted human protein set studied.

Pathway enrichment analysis

Pathway enrichment analysis of the human protein sets was performed using the Web‐based tool, kobas (ver. 2.0) 19 based on information in KEGG pathway database 20. In the enrichment process, kobas platform uses hypergeometric test and Benjamini and Hochberg correction method. In this study, P‐value was set to 0.05 to obtain enriched human pathways for the virus‐targeted protein sets.

Results

Virus‐targeted human proteins

The visualization of virus–human networks (Fig. 1) provides some insights on the nature of interactions between the viral and human proteins. In a PHI network, few proteins may serve as hub nodes. These are human proteins targeted by lots of pathogen proteins; and pathogen proteins targeting lots of human proteins. This scale‐free behavior is observed in the DNA virus–human PHI network to some extent (Fig. 1A). On the other hand, in RNA viruses, very few viral proteins have roles in PHI networks because of their very small genomes. All of these RNA virus proteins usually have lots of interactions with human proteins (Fig. 1B). Therefore, in most cases, the scale‐free behavior could not be observed in RNA virus–human PHI networks. In fact, the degree distribution of the virus–human protein interaction networks could not be fitted to any model yet, mostly because of their incompleteness 21, despite some preliminary attempts to model the graph properties of PHI networks 10. The distribution of 4943 viruses‐targeted human proteins based on their attacking virus types can be observed from the number of proteins in the human protein sets (Table 3). A considerable amount of human proteins (1354) are targeted by both DNA and RNA viruses, constituting the common viral targets. All of the virus‐targeted human proteins with the number of targeting DNA/RNA virus families can be found in Data S3. Human proteins that are highly targeted by viruses, i.e. targeted by at least total 8 viral families and the targeting virus families, are presented in Table 4. The list includes 21 such human proteins with corresponding targeting viral families.
Table 4

Highly targeted human proteins

Human proteinTargeting DNA virus familiesTargeting RNA virus families
HNRPK – heterogeneous nuclear ribonucleoprotein KAsfarviridae, Herpesviridae, Papillomaviridae, PoxviridaeArteriviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Togaviridae
YBOX1 – nuclease‐sensitive element‐binding protein 1Herpesviridae, Papillomaviridae, Polyomaviridae, PoxviridaeArteriviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae, Togaviridae
EF1A1 – elongation factor 1‐alpha 1Herpesviridae, Papillomaviridae, PoxviridaeBunyaviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae, Togaviridae
IMA1 – Importin subunit alpha‐1Adenoviridae, Herpesviridae, Papillomaviridae, Parvoviridae, Polyomaviridae, PoxviridaeCoronaviridae, Orthomyxoviridae, Paramyxoviridae
ADT2 – ADP/ATP translocase 2Herpesviridae, Papillomaviridae, PoxviridaeFiloviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae,Togaviridae
TBA1C – Tubulin alpha‐1C chainHerpesviridae, Papillomaviridae, PoxviridaeArenaviridae, Bunyaviridae, Filoviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae
ROA1 – heterogeneous nuclear ribonucleoprotein A1Herpesviridae, Papillomaviridae, PoxviridaeArteriviridae, Coronaviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Togaviridae
GRP78 – 78 kDa glucose‐regulated proteinHerpesviridae, Papillomaviridae, PoxviridaeBunyaviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Rhabdoviridae, Togaviridae
TBB5 – Tubulin beta chainHerpesviridae, PoxviridaeBunyaviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae, Togaviridae
P53 – cellular tumor antigen p53Adenoviridae, Herpesviridae, Papillomaviridae, Parvoviridae, PolyomaviridaeFlaviviridae, Orthomyxoviridae, Paramyxoviridae, Togaviridae
NPM – nucleophosminAdenoviridae, Herpesviridae, Papillomaviridae, PoxviridaeFlaviviridae, Orthomyxoviridae, Paramyxoviridae, Togaviridae
GBLP – guanine nucleotide‐binding protein subunit beta‐2‐like 1Adenoviridae, Herpesviridae, Papillomaviridae, PoxviridaeArteriviridae, Orthomyxoviridae, Paramyxoviridae, Togaviridae
TCPG – T‐complex protein 1 subunit gammaAdenoviridae, Herpesviridae, PapillomaviridaeArenaviridae, Bunyaviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae
EF1A3 – putative elongation factor 1‐alpha‐like 3Herpesviridae, Papillomaviridae, PoxviridaeBunyaviridae, Filoviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae
HNRPC – heterogeneous nuclear ribonucleoproteins C1/C2Herpesviridae, Papillomaviridae, PoxviridaeArteriviridae, Orthomyxoviridae, Paramyxoviridae, Rhabdoviridae, Togaviridae
TCPE – T‐complex protein 1 subunit epsilonAdenoviridae, HerpesviridaeArenaviridae, Bunyaviridae, Filoviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae
HS90B – heat shock protein HSP 90‐betaHerpesviridae, PoxviridaeBunyaviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Togaviridae
HNRH1 – heterogeneous nuclear ribonucleoprotein HHerpesviridae, PapillomaviridaeArteriviridae, Bunyaviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae
TBA1A – Tubulin alpha‐1A chainHerpesviridae, PoxviridaeArenaviridae, Bunyaviridae, Filoviridae, Orthomyxoviridae, Paramyxoviridae, Reoviridae
HNRPF – heterogeneous nuclear ribonucleoprotein FHerpesviridae, PapillomaviridaeArteriviridae, Bunyaviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae
ROA2 – heterogeneous nuclear ribonucleoproteins A2/B1Herpesviridae, PoxviridaeArteriviridae, Filoviridae, Flaviviridae, Orthomyxoviridae, Paramyxoviridae, Togaviridae
Highly targeted human proteins

Functional analysis of the virus‐targeted human proteins

GO enrichment analysis results

The enriched GO process terms can be used to point out the human processes that are attacked by DNA/RNA viruses. All enriched GO process, function and component terms for each human protein set are available in Data S4. Special attention should be given to the results of sets of human proteins interacting with only DNA virus proteins and only RNA virus proteins (Table 5) to retrieve specific attack strategies of these two different virus types. Enriched GO processes for the human proteins targeted highly by multiple viral families, 4‐DNA viruses‐targeted set and 4‐RNA viruses‐targeted set (Table 6) may reflect more specificity to infection mechanisms of the corresponding virus types. On the other hand, the results of human proteins interacting with both DNA and RNA viruses (Table 7) are also important to highlight common infection mechanisms shared by the two types of viruses.
Table 5

Top 20 enriched GO process terms in only DNA viruses‐targeted and only RNA viruses‐targeted human protein sets

Enriched GO process terms in only DNA viruses‐targeted set P‐valueEnriched GO process terms in only RNA viruses‐targeted set P‐value
Cellular process4.94E‐26Gene expression2.53E‐15
Cellular macromolecule metabolic process2.86E‐19Cellular process4.75E‐14
Cellular metabolic process4.61E‐19RNA processing9.57E‐11
Positive regulation of cellular process1.47E‐18RNA splicing2.68E‐09
Positive regulation of biological process4.49E‐16RNA metabolic process2.68E‐09
Macromolecule metabolic process1.34E‐15mRNA metabolic process2.81E‐09
Cellular component organization1.99E‐14Nucleobase, nucleoside, nucleotide and nucleic acid metabolic process1.07E‐08
Metabolic process1.26E‐13Cellular metabolic process1.25E‐08
Primary metabolic process1.92E‐13mRNA processing2.63E‐08
Cellular protein metabolic process4.14E‐12RNA localization4.45E‐08
Post‐translational protein modification5.25E‐11Nucleic acid metabolic process7.77E‐08
Organelle organization1.88E‐10Nucleic acid transport8.10E‐08
Protein modification process3.30E‐10RNA transport8.10E‐08
Cell cycle3.30E‐10Establishment of RNA localization8.10E‐08
Interspecies interaction between organisms7.65E‐10Cellular component organization3.25E‐07
Macromolecule modification3.28E‐09Cellular nitrogen compound metabolic process3.25E‐07
Regulation of molecular function8.70E‐09Nucleobase, nucleoside, nucleotide and nucleic acid transport4.02E‐07
Regulation of cell cycle9.08E‐09Metabolic process7.33E‐07
Nucleobase, nucleoside, nucleotide and nucleic acid metabolic process1.05E‐08mRNA transport7.69E‐07
Positive regulation of apoptosis1.06E‐08Macromolecule localization1.39E‐06
Table 6

Top 20 enriched GO process terms in 4‐DNA viruses‐targeted and 4‐RNA viruses‐targeted human protein sets

Enriched GO process terms in 4‐DNA viruses‐targeted set P‐valueEnriched GO process terms in 4‐RNA viruses‐targeted set P‐value
Interspecies interaction between organisms1.53E‐08Cellular macromolecular complex assembly9.83E‐11
Positive regulation of biological process2.25E‐06Cellular macromolecular complex subunit organization3.42E‐10
Multi‐organism process2.50E‐05Nucleosome assembly1.78E‐09
Positive regulation of cellular process3.77E‐05Chromatin assembly2.18E‐09
Modulation by host of viral transcription1.59E‐04Protein‐DNA complex assembly2.89E‐09
Modulation of transcription in other organism involved in symbiotic interaction1.59E‐04Nucleosome organization2.89E‐09
Modulation by host of symbiont transcription1.59E‐04DNA packaging2.78E‐08
Anatomical structure formation involved in morphogenesis2.82E‐04Macromolecular complex assembly3.33E‐08
Protein import into nucleus4.12E‐04Chromatin assembly or disassembly4.52E‐08
Nuclear import4.51E‐04Protein folding4.66E‐08
Nucleocytoplasmic transport5.50E‐04Macromolecular complex subunit organization6.99E‐08
Nuclear transport5.50E‐04Cellular process6.99E‐08
Modification by host of symbiont morphology or physiology6.42E‐04RNA splicing6.99E‐08
Regulation of viral transcription6.42E‐04DNA conformation change7.12E‐08
Protein localization in nucleus7.63E‐04Gene expression1.43E‐07
Cellular process8.59E‐04Cellular component assembly3.93E‐07
Regulation of protein modification process1.04E‐03Cellular component biogenesis5.46E‐07
Blood vessel development1.05E‐03Cellular macromolecule metabolic process9.59E‐07
Positive regulation of viral reproduction1.07E‐03Translational elongation9.59E‐07
Vasculature development1.07E‐03Response to unfolded protein1.25E‐06
Table 7

Top 20 enriched GO process terms in DNA‐RNA viruses‐targeted human protein set

GO process term P‐value
Translational elongation3.15E‐62
Cellular macromolecule metabolic process1.21E‐51
Translation2.28E‐46
Gene expression4.62E‐46
Interspecies interaction between organisms6.88E‐46
Cellular process4.86E‐43
Macromolecule metabolic process6.46E‐42
Cellular metabolic process8.78E‐39
Cellular macromolecule biosynthetic process4.41E‐34
Cellular protein metabolic process4.86E‐34
Macromolecule biosynthetic process7.66E‐33
Cellular component biogenesis4.07E‐32
Macromolecular complex assembly1.95E‐31
Macromolecular complex subunit organization1.64E‐29
Primary metabolic process2.32E‐29
Cellular macromolecular complex assembly7.09E‐29
Cellular component assembly7.17E‐29
Cellular macromolecular complex subunit organization7.64E‐28
Intracellular transport7.41E‐27
Metabolic process3.67E‐26
Top 20 enriched GO process terms in only DNA viruses‐targeted and only RNA viruses‐targeted human protein sets Top 20 enriched GO process terms in 4‐DNA viruses‐targeted and 4‐RNA viruses‐targeted human protein sets Top 20 enriched GO process terms in DNA‐RNA viruses‐targeted human protein set

Pathway enrichment analysis results

Enriched pathway terms for five specific human protein sets are listed in Tables 8, 9, 10, presenting the certain characteristics of DNA and RNA viruses attack strategies. Similar to GO enrichment analysis results, these human protein sets are only DNA viruses‐targeted and only RNA viruses‐targeted (Table 8), 4‐DNA viruses‐targeted and 4‐RNA viruses‐targeted (Table 9), and DNA‐RNA viruses‐targeted (Table 10). Pathway enrichment analysis results are provided in Data S5 for all of eight virus‐targeted human protein sets under investigation.
Table 8

Enriched pathway terms in only DNA viruses‐targeted and only RNA viruses‐targeted human protein sets

Enriched pathway terms in only DNA viruses‐targeted set P‐valueEnriched pathway terms in only RNA viruses‐targeted set P‐value
Cell cycle2.55E‐04RNA transport1.18E‐11
Oxidative phosphorylation1.04E‐02Spliceosome6.43E‐08
Huntington's disease1.09E‐02mRNA surveillance pathway5.36E‐04
Viral carcinogenesis2.23E‐02Pathogenic Escherichia coli infection1.41E‐02
p53 signaling pathway2.59E‐02Aminoacyl‐tRNA biosynthesis2.16E‐02
TNF signaling pathway3.49E‐02Ribosome biogenesis in eukaryotes3.05E‐02
Small cell lung cancer3.80E‐02
Notch signaling pathway3.80E‐02
Table 9

Enriched pathway terms in 4‐DNA viruses‐targeted and 4‐RNA viruses‐targeted human protein sets

Enriched pathway terms in 4‐DNA viruses‐targeted set P‐valueEnriched pathway terms in 4‐RNA viruses‐targeted set P‐value
Epstein–Barr virus infection1.06E‐03Systemic lupus erythematosus1.79E‐06
Viral carcinogenesis4.96E‐03Alcoholism1.74E‐05
Pathogenic Escherichia coli infection1.69E‐04
Ribosome1.14E‐03
Protein processing in endoplasmic reticulum1.94E‐03
Antigen processing and presentation5.20E‐03
Spliceosome7.74E‐03
Legionellosis2.04E‐02
Phagosome2.52E‐02
Table 10

Enriched pathway terms in DNA‐RNAviruses targeted human protein set

Pathway term P‐value
Ribosome1.58E‐22
Epstein–Barr virus infection1.23E‐11
Proteasome1.23E‐11
Protein processing in endoplasmic reticulum4.81E‐08
Spliceosome8.68E‐07
Viral carcinogenesis3.04E‐06
Herpes simplex infection5.69E‐05
RNA transport1.26E‐04
Systemic lupus erythematosus4.82E‐03
Pathogenic Escherichia coli infection9.56E‐03
Influenza A1.23E‐02
Toxoplasmosis1.23E‐02
Small cell lung cancer1.46E‐02
Hepatitis B1.76E‐02
mRNA surveillance pathway1.78E‐02
Hepatitis C1.82E‐02
Measles1.93E‐02
Alcoholism4.13E‐02
Chronic myeloid leukemia4.95E‐02
Enriched pathway terms in only DNA viruses‐targeted and only RNA viruses‐targeted human protein sets Enriched pathway terms in 4‐DNA viruses‐targeted and 4‐RNA viruses‐targeted human protein sets Enriched pathway terms in DNA‐RNAviruses targeted human protein set

Discussion

Most of the current antiviral therapeutics act for inhibiting specific viral proteins, e.g. essential viral enzymes. Unfortunately, this approach has been ineffective because of drug resistance developed by viruses, especially in the case of RNA viruses which can mutate very rapidly. The next‐generation antiviral therapeutics are emerging which target host proteins required by the pathogens, instead of targeting pathogen proteins. If these host factors are indispensable for pathogens, but not essential for host cells, their silencing may effectively inhibit infections without developing drug resistance rapidly 1, 21, 22. Another alternative approach is to inhibit the interactions between these host factors and pathogen proteins, instead of targeting the proteins 23. The development of these novel strategic therapeutic approaches against infectious diseases raises the need for enlightening the infection mechanisms through PHIs, in order to identify putative host‐oriented anti‐infective therapeutic targets. To understand the complex mechanisms of infections, computational analysis of underlying protein interaction networks may serve crucial insights to develop non‐conventional solutions 2, 14, 24. This study of computational analysis of virus–human interactomes aims to provide initial insights on the infection mechanisms of DNA and RNA viruses, comparatively, through the observation of the characteristics of human proteins interacting with viral proteins. The common and special infection strategies of DNA and RNA viruses found here may lead to the development of broad and specific next‐generation antiviral therapeutics.

Highly targeted human proteins

As the main viral infection strategy, all viruses manipulate cellular processes to proliferate within the host. Therefore, viral proteins highly interact with human proteins functioning in cell cycle, human transcription factors to promote viral genetic material transcription, nuclear membrane proteins for transporting viral genetic material across the nuclear membrane, and also regulatory proteins for translation and apoptosis 3, 15, 25, 26. We identified human proteins that are highly interacting with viral proteins, sequentially based on the total number of targeting virus families (Table 4). The list includes the top viral targets which interact with multiple viral families, within the most comprehensive PHI data. Some of these human proteins were previously reported as targets for multiple viruses, i.e. P53, NPM, ROA2, GBLP, and HNRPK 3, 15. Our analyses revealed that there are six heterogeneous nuclear ribonucleoproteins (HNRPs) in the highly targeted human proteins list (HNRPK, ROA1, HNRPC, HNRH1, HNRPF, ROA2). HNRPs are RNA‐binding proteins, which function in processing heterogeneous nuclear RNAs into mature mRNAs and in regulating gene expression. Specifically, they take role in the export of mRNA from the nucleus to the cytoplasm. They also recruit regulatory proteins associated with pathways related to DNA and RNA metabolism 27, 28. Being targeted by multiple viruses, HNRPU was reported as a hotspot of viral infection, and proposed as a potential antiviral human protein 4. In the present study, HNRPU is found to be targeted by five viral families (see Data S3). Our data additionally indicate several other HNRPs, targeted by viral proteins (see Data S1–S3). For all virus‐targeted HNRPs, the number of targeting RNA virus families is found to be higher than that of DNA virus families (see Data S3), revealing that they may play crucial roles in viral RNA processing. The protein family of HNRPs may serve as host‐oriented antiviral drug targets. Moreover, our analyses also reflected that proteins functioning in transport and localization related processes within the cell are targeted highly by both DNA and RNA viruses, i.e. IMA1, ADT2, TCPG, and TCPE. IMA1 (Karyopherin alpha 2, KPNA2) functions mainly in nuclear import as an adapter protein for nuclear receptor KPNB1 (Karyopherin beta 1). Interacting with IMA1 enables viruses to enter the nucleus and consequently to use the host's transcriptional machinery. Besides, viruses may interact with IMA1 in order to inhibit the host antiviral response, since nuclear import factors regulate the transport of innate immune regulatory proteins to the nucleus of cells to activate the antiviral response 3, 29, 30, 31. The transmembrane transporter activity of ADT2 is responsible for the exchange of cytoplasmic ADP with mitochondrial ATP across the mitochondrial membrane, serving crucial roles in metabolic processes 32. Attacking to human metabolic processes was reported as a common infection strategy of bacteria and viruses 15. The proteins, TCPG and TCPE are responsible for RNA localization activity and our results reveal that they are targeted by larger number of RNA families (Table 4). Highly targeted transporter proteins should be investigated further for their potential to be next‐generation antiviral target, because of their crucial roles in viral life cycle within the host organism. EF1A1 and EF1A3 function as translation elongation factors in protein biosynthesis. EF1A proteins promote the GTP‐dependent binding of aminoacyl‐tRNA to the A‐site of ribosomes during protein biosynthesis with a responsibility of achieving accuracy of translation 33. Translation elongation factors were reported as targets for viruses, in early studies 34, 35, 36. Since they are essential components of the cellular translational machinery, viruses interact with them for biosynthesis of viral proteins within the host cell. We found translational elongation as the top biological process, commonly targeted by both DNA and RNA viruses (Table 7). Interacting with human transcription factors was reported as one of the main viral infection strategies 3, 15. Among the highly targeted human proteins, YBOX1 and P53 have transcription factor activity. Both of these proteins are multifunctional. YBOX1 functions in transcription of numerous genes, as a transcription factor. It also contributes to the regulation of translation. On the other hand, P53 is the famous tumor supressor acting as an activator for apoptotic cell death. Apoptosis is a very crucial process during the viral infection progress, and should be strategically controlled by viruses for a successful viral infection. Apoptosis is an innate immune response to viral infection. In the early stage of viral life cycle in the host cell, apoptosis is inhibited by corresponding virus–human protein interactions. After completion of transcription and translation of viral genetic material, viruses try to induce apoptosis to assist virus dissemination 37, 38, 39. Among the highly targeted human proteins in Table 4, EF1A1, ADT2, TBA1C, GRP78, TBB5, P53, TCPG, HS90B, and TBA1A were found as drug targets listed in DrugBank 40. However, only ADT2, GRP78, TBB5, P53, and TBA1A are approved for commercial drugs. Nevertheless, no antiviral therapeutic usage is available for these drug targets yet. Above‐mentioned human proteins; ribonucleoproteins, proteins functioning in intracellular transport and localization, translation elongation factors and transcription factors require further investigation for their potential for serving as antiviral drug targets.

Targeted human mechanisms

Gene ontology and pathway enrichment analyses of pathogen‐targeted host proteins are widely used in bioinformatic analysis of PHI networks to understand the attack strategies of pathogens 3, 4, 15, 41, 42 as well as in verification of computationally predicted PHIs 43. Additionally, GO and pathway terms are widely used as features in computational PHI prediction studies 44, 45. Our observation of the enriched GO process terms for human proteins targeted by only DNA viruses (Table 5) may lead to the conclusion that DNA viruses have specifically evolved to be able to attack human cellular and metabolic processes simultaneously, during infections. Using this PHI mechanism, DNA viruses can finely exploit the cellular and metabolic mechanisms of infected cells to their own advantage, generally resulting in chronic infections in human. On the other hand, GO process terms enriched in human proteins targeted by only RNA viruses are mostly related to RNA processing, intracellular transport and localization within the cell (Table 5). It was reported that RNA viruses extensively target human proteins that are involved in RNA metabolism and also protein and RNA transport to promote viral RNA processing for a successful infection 4. Further investigation of the enriched processes of human proteins attacked by multiple DNA viruses (Table 6) pointed out their high preference to target cellular processes. It was reported that DNA viruses tend to target crosstalking human proteins linking the cell cycle with either transcription or chromosome biology, with a possible aim of promoting viral replication instead of cellular growth 4. For the RNA viruses, we found that the human proteins attacked by multiple RNA virus families are enriched in specific processes within the cellular mechanisms (Table 6). All viruses need host's transcriptional machinery for viral genetic material transcription. In the case of human proteins targeted by both DNA and RNA viruses, the P‐values of the enriched GO process terms are very low, indicating statistically strong results (Table 7). The most highly‐targeted human process is translational elongation. Translational control of viral gene expression in eukaryotic hosts was reported repeatedly 46, 47, 48. Here, we presented translational elongation as the top GO process term enriched in human proteins targeted by both DNA and RNA viruses within the current experimental PHI data. The remaining list includes cellular and metabolic processes, which can be considered as targets of both virus types. Based on these observations, we can state that the common viral infection strategy is to target human proteins functioning within the processes of gene expression and protein synthesis, simply because of the lack of their own such machineries. All viruses depend on the cellular mechanisms for these processes and they recruit host ribosomes for translation of viral proteins. A comparative investigation of the enriched pathway terms for human protein sets targeted by only DNA viruses and by only RNA viruses (Table 8) reveals additional support for the different infection strategies of these viral groups. There is no common term in these two lists of enriched human pathways. Cell cycle pathway targeted by only DNA viruses and RNA‐related pathways targeted by only RNA viruses, provide parallel results with GO enrichment analyses. The enriched pathway terms in 4‐DNA viruses‐targeted human protein set are only Epstein–Barr virus (EBV) infection and viral carcinogenesis (Table 9). EBV is a species of DNA virus family Herpesviridae, which constitute nearly half of the DNA viruses–human PHI data (Table 1). On the other hand, it is estimated that 15% of all human tumors are caused by viruses, mainly DNA viruses, i.e. Herpesviruses and Papillomaviruses 49. The pathway enrichment analysis of 4‐RNA viruses‐targeted set brings the terms of protein processing and immune system related terms forward (Table 9). Finally, for the common targets of two virus types, we obtained ribosome term enriched with a very small P‐value (Table 10). Both viruses use host ribosome for viral protein synthesis.

Conclusions

In this study, an initial system‐level understanding of viral infection mechanisms through PHI networks was pursued by comparing DNA and RNA viruses, aiming to provide a framework for further investigations of infection mechanisms in the light of more precise information on pathogen–host systems in the near future. Ongoing studies and increasing amounts of experimentally‐verified PHI data will further improve our understanding of the interplay between pathogens and human and hopefully identify novel and effective therapeutics for infectious disesases.

Author contributions

SD and KÖÜ conceived the study. SD performed the study. SD and KÜÖ prepared the manuscript. Data S1. DNA viruses–human PHI data. Click here for additional data file. Data S2. RNA viruses–human PHI data. Click here for additional data file. Data S3. Number of targeting DNA/RNA virus families for each targeted human proteins. Click here for additional data file. Data S4. GO enrichment results for the human protein sets. Click here for additional data file. Data S5. Pathway enrichment results for the human protein sets. Click here for additional data file.
  48 in total

Review 1.  Elongation factors in protein biosynthesis.

Authors:  Gregers R Andersen; Poul Nissen; Jens Nyborg
Journal:  Trends Biochem Sci       Date:  2003-08       Impact factor: 13.807

Review 2.  Virus-host protein interactions in RNA viruses.

Authors:  Pierre-Olivier Vidalain; Frédéric Tangy
Journal:  Microbes Infect       Date:  2010-09-09       Impact factor: 2.700

3.  PHISTO: pathogen-host interaction search tool.

Authors:  Saliha Durmuş Tekir; Tunahan Çakır; Emre Ardiç; Ali Semih Sayılırbaş; Gökhan Konuk; Mithat Konuk; Hasret Sarıyer; Azat Uğurlu; İlknur Karadeniz; Arzucan Özgür; Fatih Erdoğan Sevilgen; Kutlu Ö Ülgen
Journal:  Bioinformatics       Date:  2013-03-20       Impact factor: 6.937

4.  Translation elongation factor 1-alpha interacts specifically with the human immunodeficiency virus type 1 Gag polyprotein.

Authors:  A Cimarelli; J Luban
Journal:  J Virol       Date:  1999-07       Impact factor: 5.103

5.  Specific in vitro interaction between papillomavirus E2 proteins and TBP-associated factors.

Authors:  Elba Carrillo; Efrain Garrido; Patricio Gariglio
Journal:  Intervirology       Date:  2004       Impact factor: 1.763

6.  Severe acute respiratory syndrome coronavirus ORF6 antagonizes STAT1 function by sequestering nuclear import factors on the rough endoplasmic reticulum/Golgi membrane.

Authors:  Matthew Frieman; Boyd Yount; Mark Heise; Sarah A Kopecky-Bromberg; Peter Palese; Ralph S Baric
Journal:  J Virol       Date:  2007-06-27       Impact factor: 5.103

7.  Network-based prediction and analysis of HIV dependency factors.

Authors:  T M Murali; Matthew D Dyer; David Badger; Brett M Tyler; Michael G Katze
Journal:  PLoS Comput Biol       Date:  2011-09-22       Impact factor: 4.475

8.  Infection Strategies of Bacterial and Viral Pathogens through Pathogen-Human Protein-Protein Interactions.

Authors:  Saliha Durmuş Tekir; Tunahan Cakir; Kutlu Ö Ulgen
Journal:  Front Microbiol       Date:  2012-02-14       Impact factor: 5.640

9.  DrugBank: a comprehensive resource for in silico drug discovery and exploration.

Authors:  David S Wishart; Craig Knox; An Chi Guo; Savita Shrivastava; Murtaza Hassanali; Paul Stothard; Zhan Chang; Jennifer Woolsey
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  Virus-host interactomics: new insights and opportunities for antiviral drug discovery.

Authors:  Benoît de Chassey; Laurène Meyniel-Schicklin; Jacky Vonderscher; Patrice André; Vincent Lotteau
Journal:  Genome Med       Date:  2014-11-29       Impact factor: 11.117

View more
  18 in total

Review 1.  Virological and Immunological Outcomes of Coinfections.

Authors:  Naveen Kumar; Shalini Sharma; Sanjay Barua; Bhupendra N Tripathi; Barry T Rouse
Journal:  Clin Microbiol Rev       Date:  2018-07-05       Impact factor: 26.132

Review 2.  Towards an ecosystem model of infectious disease.

Authors:  James M Hassell; Tim Newbold; Andrew P Dobson; Yvonne-Marie Linton; Lydia H V Franklinos; Dawn Zimmerman; Katrina M Pagenkopp Lohan
Journal:  Nat Ecol Evol       Date:  2021-05-17       Impact factor: 15.460

Review 3.  Study of combining virtual screening and antiviral treatments of the Sars-CoV-2 (Covid-19).

Authors:  Ehsaneh Khodadadi; Parham Maroufi; Ehsan Khodadadi; Isabella Esposito; Khudaverdi Ganbarov; Silvano Espsoito; Mehdi Yousefi; Elham Zeinalzadeh; Hossein Samadi Kafil
Journal:  Microb Pathog       Date:  2020-05-05       Impact factor: 3.738

Review 4.  COVID-19 and Dentistry: Prevention in Dental Practice, a Literature Review.

Authors:  Federico Alcide Villani; Riccardo Aiuto; Luigi Paglia; Dino Re
Journal:  Int J Environ Res Public Health       Date:  2020-06-26       Impact factor: 3.390

Review 5.  Investigating the Interaction between Negative Strand RNA Viruses and Their Hosts for Enhanced Vaccine Development and Production.

Authors:  Kostlend Mara; Meiling Dai; Aaron M Brice; Marina R Alexander; Leon Tribolet; Daniel S Layton; Andrew G D Bean
Journal:  Vaccines (Basel)       Date:  2021-01-17

Review 6.  Clocks, Viruses, and Immunity: Lessons for the COVID-19 Pandemic.

Authors:  Shaon Sengupta; Louise Ince; Francesca Sartor; Helene Borrmann; Xiaodong Zhuang; Amruta Naik; Annie Curtis; Jane A McKeating
Journal:  J Biol Rhythms       Date:  2021-01-22       Impact factor: 3.182

7.  Integrative gene network analysis identifies key signatures, intrinsic networks and host factors for influenza virus A infections.

Authors:  Christian V Forst; Bin Zhou; Minghui Wang; Tsui-Wen Chou; Guy Mason; Won-Min Song; Eric Schadt; Elodie Ghedin; Bin Zhang
Journal:  NPJ Syst Biol Appl       Date:  2017-12-04

Review 8.  Oxygen Sensing and Viral Replication: Implications for Tropism and Pathogenesis.

Authors:  Peter Jianrui Liu; Peter Balfe; Jane A McKeating; Mirjam Schilling
Journal:  Viruses       Date:  2020-10-25       Impact factor: 5.048

Review 9.  RNA-Dependent RNA Polymerase as a Target for COVID-19 Drug Discovery.

Authors:  Wei Zhu; Catherine Z Chen; Kirill Gorshkov; Miao Xu; Donald C Lo; Wei Zheng
Journal:  SLAS Discov       Date:  2020-07-13       Impact factor: 3.341

10.  Structural Genomics of SARS-CoV-2 Indicates Evolutionary Conserved Functional Regions of Viral Proteins.

Authors:  Suhas Srinivasan; Hongzhu Cui; Ziyang Gao; Ming Liu; Senbao Lu; Winnie Mkandawire; Oleksandr Narykov; Mo Sun; Dmitry Korkin
Journal:  Viruses       Date:  2020-03-25       Impact factor: 5.048

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.