| Literature DB >> 22347880 |
Saliha Durmuş Tekir1, Tunahan Cakir, Kutlu Ö Ulgen.
Abstract
Since ancient times, even in today's modern world, infectious diseases cause lots of people to die. Infectious organisms, pathogens, cause diseases by physical interactions with human proteins. A thorough analysis of these interspecies interactions is required to provide insights about infection strategies of pathogens. Here we analyzed the most comprehensive available pathogen-human protein interaction data including 23,435 interactions, targeting 5,210 human proteins. The data were obtained from the newly developed pathogen-host interaction search tool, PHISTO. This is the first comprehensive attempt to get a comparison between bacterial and viral infections. We investigated human proteins that are targeted by bacteria and viruses to provide an overview of common and special infection strategies used by these pathogen types. We observed that in the human protein interaction network the proteins targeted by pathogens have higher connectivity and betweenness centrality values than those proteins not interacting with pathogens. The preference of interacting with hub and bottleneck proteins is found to be a common infection strategy of all types of pathogens to manipulate essential mechanisms in human. Compared to bacteria, viruses tend to interact with human proteins of much higher connectivity and centrality values in the human network. Gene Ontology enrichment analysis of the human proteins targeted by pathogens indicates crucial clues about the infection mechanisms of bacteria and viruses. As the main infection strategy, bacteria interact with human proteins that function in immune response to disrupt human defense mechanisms. Indispensable viral strategy, on the other hand, is the manipulation of human cellular processes in order to use that transcriptional machinery for their own genetic material transcription. A novel observation about pathogen-human systems is that the human proteins targeted by both pathogens are enriched in the regulation of metabolic processes.Entities:
Keywords: PHISTO; bottleneck; gene ontology; hub; infection strategy; pathogen–human protein–protein interactions
Year: 2012 PMID: 22347880 PMCID: PMC3278985 DOI: 10.3389/fmicb.2012.00046
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Contents of pathogen–human PHI data.
| Pathogen | Number of strains | Number of PHIs | Number of targeting pathogen proteins | Number of targeted human proteins |
|---|---|---|---|---|
| 1 | 2 | 1 | 2 | |
| 2 | 3,021 | 940 | 1,736 | |
| 1 | 3 | 1 | 3 | |
| 2 | 21 | 3 | 21 | |
| 1 | 1 | 1 | 1 | |
| 3 | 47 | 9 | 10 | |
| 1 | 1 | 1 | 1 | |
| 4 | 30 | 14 | 27 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1,338 | 346 | 986 | |
| 2 | 3 | 3 | 2 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 1 | 4 | 4 | 3 | |
| 1 | 1 | 1 | 1 | |
| 1 | 2 | 1 | 2 | |
| 1 | 17 | 1 | 17 | |
| 1 | 12 | 4 | 10 | |
| 1 | 5 | 5 | 5 | |
| 1 | 11 | 9 | 8 | |
| 3 | 12 | 10 | 10 | |
| 5 | 15 | 12 | 9 | |
| 1 | 1 | 1 | 1 | |
| 4 | 3,999 | 1,221 | 2,120 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 1 | 2 | 1 | 2 | |
| 3 | 8 | 4 | 8 | |
| 1 | 1 | 1 | 1 | |
| Adenovirus | 13 | 121 | 36 | 80 |
| Anemia virus | 6 | 8 | 6 | 4 |
| ASFV | 1 | 1 | 1 | 1 |
| Bacteriophage | 6 | 6 | 6 | 5 |
| Coxsackie virus | 1 | 1 | 1 | 1 |
| Dengue virus | 3 | 3 | 3 | 2 |
| Ebola virus | 1 | 1 | 1 | 1 |
| Echo virus | 2 | 3 | 3 | 1 |
| Ectromelia virus | 1 | 2 | 2 | 2 |
| Encephalitis virus | 1 | 2 | 1 | 2 |
| Foamy virus | 1 | 1 | 1 | 1 |
| Hantaan virus | 1 | 6 | 1 | 6 |
| Hendra virus | 1 | 1 | 1 | 1 |
| Hepatitis virus | 21 | 1,573 | 179 | 399 |
| Herpesvirus | 28 | 666 | 141 | 388 |
| HIV | 49 | 11,435 | 279 | 1,601 |
| Influenza virus | 9 | 523 | 27 | 182 |
| Leukemia virus | 3 | 10 | 3 | 10 |
| Measles virus | 3 | 10 | 4 | 4 |
| Molluscum virus | 1 | 1 | 1 | 1 |
| MPMV | 1 | 1 | 1 | 1 |
| Nipah virus | 1 | 1 | 1 | 1 |
| Nucleopolyhedrovirus | 1 | 1 | 1 | 1 |
| Orf virus | 2 | 2 | 2 | 1 |
| Papillomavirus | 14 | 290 | 51 | 128 |
| Parainfluenza virus | 1 | 2 | 1 | 2 |
| Parvo virus | 1 | 1 | 1 | 1 |
| Polio virus | 2 | 3 | 2 | 2 |
| Polyomavirus | 4 | 64 | 10 | 45 |
| Puumala virus | 1 | 3 | 1 | 3 |
| Rabies virus | 1 | 1 | 1 | 1 |
| Rhino virus | 1 | 1 | 1 | 1 |
| Rota virus | 4 | 8 | 5 | 6 |
| Sarcoma virus | 5 | 15 | 6 | 11 |
| SARS | 1 | 4 | 3 | 4 |
| Sendai virus | 1 | 1 | 1 | 1 |
| Seoul virus | 1 | 4 | 1 | 4 |
| SIV | 2 | 3 | 2 | 3 |
| Stomatitis virus | 3 | 7 | 3 | 6 |
| T-lymphotropic virus | 3 | 38 | 7 | 35 |
| Tula virus | 1 | 2 | 1 | 2 |
| Vaccinia virus | 5 | 46 | 20 | 33 |
| West Nile virus | 1 | 1 | 1 | 1 |
See Data Sheets 1–4 in Supplementary Material for detailed information.
Figure 1The number of pathogen-targeted human proteins that are grouped based on their interactions with viruses, bacteria, and fungi – protozoa (targeted by fungi and/or protozoa).
Highly targeted human proteins.
| Protein | Degree | Betweenness centrality | Targeting bacterial groups | Targeting viral groups |
|---|---|---|---|---|
| P53 | 347 | 0.01547 | Adenovirus, Hepatitis virus, HIV, Papillomavirus, Polyomavirus, SIV | |
| DRA | 52 | 0.00003 | Herpesvirus, HIV, Influenza virus | |
| SUMO1 | 103 | 0.00366 | Herpesvirus, HIV, Papillomavirus, Puumala virus, SARS, Tula virus, Vaccinia virus | |
| JUN | 122 | 0.00335 | Hepatitis virus, HIV, Papillomavirus, Vaccinia virus | |
| NPM | 137 | 0.00166 | Adenovirus, Hepatitis virus, Herpesvirus, HIV | |
| ROA1 | 246 | 0.00262 | Herpesvirus, HIV, Influenza virus, SARS | |
| UBC9 | 134 | 0.00410 | Hantaan virus, Herpesvirus, HIV, Influenza virus, Papillomavirus, Seoul virus | |
| IGHG1 | 57 | 0.00219 | Herpesvirus | |
| RAC1 | 239 | 0.00279 | HIV | |
| CDC42 | 232 | 0.00405 | HIV, T-lymphotropic virus | |
| DRB5 | – | – | Herpesvirus, HIV | |
| LCK | 147 | 0.00202 | Hepatitis virus, Herpesvirus, HIV | |
| XRCC6 | 131 | 0.00445 | Herpesvirus, HIV, Polyomavirus | |
| KPYM | 76 | 0.00041 | Hepatitis virus, Herpesvirus, Papillomavirus | |
| ROA2 | 189 | 0.00069 | Herpesvirus, Influenza virus, Vaccinia virus | |
| P85A | 402 | 0.00914 | Anemia virus, HIV, Influenza virus | |
| STAT3 | 77 | 0.00133 | Hepatitis virus, Herpesvirus, HIV | |
| STAT1 | 71 | 0.00104 | Adenovirus, Herpesvirus, HIV | |
| GBLP | 93 | 0.00265 | Adenovirus, Herpesvirus, HIV | |
| PARP4 | 1 | 0.00000 | Hepatitis virus, Herpesvirus, HIV | |
| RB | 149 | 0.00282 | Adenovirus, Herpesvirus, HIV, Papillomavirus, Polyomavirus | |
| SP1 | 103 | 0.00268 | Adenovirus, Herpesvirus, HIV, Polyomavirus, T-lymphotropic virus | |
| TAF1 | 58 | 0.00025 | Adenovirus, Hepatitis virus, HIV, Papillomavirus, Polyomavirus | |
| CDK2 | 151 | 0.00220 | Herpesvirus, HIV, Papillomavirus, Polyomavirus, T-lymphotropic virus | |
| TF2B | 69 | 0.00020 | Hepatitis virus, Herpesvirus, HIV, Papillomavirus, Polyomavirus | |
| EP300 | 123 | 0.00245 | Adenovirus, Hepatitis virus, HIV, Papillomavirus, Polyomavirus | |
| CBP | 147 | 0.00304 | Adenovirus, Hepatitis virus, HIV, Papillomavirus, Polyomavirus | |
| TBP | 147 | 0.00241 | – | Adenovirus, Hepatitis virus, Herpesvirus, HIV, Papillomavirus, Polyomavirus |
The targeting pathogenic proteins for each human protein can be obtained from Data Sheets 1–4 in Supplementary Material.
Figure 2The cumulative degree distributions of human protein sets. The distribution of all proteins in the PPI network is given in comparison with (A) the bacteria-targeted sets, and (B) the virus-targeted sets. The number of proteins in each set is given in the parentheses. The fraction of proteins at a particular value of degree is the number of proteins having that value and greater divided by the number of proteins in the set.
Figure 3The cumulative betweenness centrality distributions of human protein sets. The distribution of all proteins in the PPI network is given in comparison with (A) the bacteria-targeted sets, and (B) the virus-targeted sets. The number of proteins in each set is given in the parentheses. The fraction of proteins at a particular value of betweenness centrality is the number of proteins having that value and greater divided by the number of proteins in the set.
Figure 4The cumulative distributions of degree and betweenness centrality of human proteins excluding . The number of proteins in each set is given in the parentheses. (A) The degree distributions (B) the betweenness centrality distributions. The fraction of proteins at a particular value of degree is the number of proteins having that value and greater divided by the number of proteins in the set.
First 20 enriched GO process terms in human proteins targeted by at least three bacterial groups (three-bacteria-targeted set).
| GO process term | |
|---|---|
| I-kappaB kinase/NF-kappaB cascade | 9.64E−13 |
| Regulation of biological process | 9.69E−10 |
| Biological regulation | 2.59E−09 |
| Negative regulation of biological process | 4.89E−09 |
| Positive regulation by organism of immune response of other organism involved in symbiotic interaction | 6.64E−09 |
| Modulation by organism of immune response of other organism involved in symbiotic interaction | 6.64E−09 |
| Modulation by symbiont of host immune response | 6.64E−09 |
| Positive regulation by symbiont of host immune response | 6.64E−09 |
| Response to immune response of other organism involved in symbiotic interaction | 6.64E−09 |
| Response to host immune response | 6.64E−09 |
| Positive regulation by organism of defense response of other organism involved in symbiotic interaction | 6.64E−09 |
| Positive regulation by symbiont of host defense response | 6.64E−09 |
| Positive regulation by organism of innate immunity in other organism involved in symbiotic interaction | 6.64E−09 |
| Modulation by organism of innate immunity in other organism involved in symbiotic interaction | 6.64E−09 |
| Pathogen-associated molecular pattern dependent modulation by organism of innate immunity in other organism involved in symbiotic interaction | 6.64E−09 |
| Modulation by organism of defense response of other organism involved in symbiotic interaction | 6.64E−09 |
| Pathogen-associated molecular pattern dependent induction by organism of innate immunity of other organism involved in symbiotic interaction | 6.64E−09 |
| Modulation by symbiont of host defense response | 6.64E−09 |
| Pathogen-associated molecular pattern dependent induction by symbiont of host innate immunity | 6.64E−09 |
| Modulation by symbiont of host innate immunity | 6.64E−09 |
See Data Sheet 10 in Supplementary Material for the whole list and the human proteins corresponding to each GO term.
First 20 enriched GO process terms in human proteins targeted by both bacterial and viral groups (bacteria–virus-targeted set).
| GO process term | |
|---|---|
| Interspecies interaction between organisms | 1.64E−52 |
| Multi-organism process | 1.01E−47 |
| Positive regulation of biological process | 5.62E−47 |
| Regulation of biological process | 1.32E−42 |
| Biological regulation | 2.66E−40 |
| Positive regulation of cellular process | 4.59E−40 |
| Negative regulation of biological process | 6.32E−37 |
| Regulation of cellular process | 2.00E−36 |
| Negative regulation of cellular process | 4.84E−32 |
| Regulation of protein metabolic process | 7.68E−30 |
| Regulation of macromolecule metabolic process | 3.21E−29 |
| Regulation of cellular protein metabolic process | 1.39E−28 |
| Regulation of cell death | 1.74E−28 |
| Positive regulation of macromolecule metabolic process | 1.91E−28 |
| Positive regulation of cellular metabolic process | 7.04E−28 |
| Regulation of programmed cell death | 1.13E−27 |
| Cellular macromolecule metabolic process | 1.94E−27 |
| Positive regulation of metabolic process | 2.92E−27 |
| Negative regulation of macromolecule metabolic process | 5.31E−27 |
| Regulation of apoptosis | 6.54E−27 |
See Data Sheet 10 in Supplementary Material for the whole list and the human proteins corresponding to each GO term.
First 20 enriched GO process terms in human proteins targeted by at least three viral groups (three-viruses-targeted set).
| Go process term | |
|---|---|
| Interspecies interaction between organisms | 1.89E−40 |
| Multi-organism process | 1.19E−27 |
| Positive regulation of cellular process | 1.12E−17 |
| Positive regulation of biological process | 1.06E−16 |
| Cellular macromolecule metabolic process | 1.12E−15 |
| Nucleic acid metabolic process | 4.49E−14 |
| Positive regulation of macromolecule metabolic process | 4.60E−14 |
| Cell cycle process | 6.72E−14 |
| Positive regulation of gene expression | 1.49E−13 |
| Cell cycle | 2.06E−13 |
| Positive regulation of metabolic process | 3.79E−13 |
| Positive regulation of transcription | 4.37E−13 |
| Macromolecule metabolic process | 8.51E−13 |
| Positive regulation of cellular metabolic process | 3.89E−12 |
| Cellular response to stimulus | 6.61E−12 |
| Positive regulation of nucleobase, nucleoside, nucleotide, and nucleic acid metabolic process | 7.26E−12 |
| Positive regulation of macromolecule biosynthetic process | 1.32E−11 |
| Positive regulation of nitrogen compound metabolic process | 1.41E−11 |
| Positive regulation of transcription, DNA-dependent | 1.44E−11 |
| Regulation of cell cycle | 1.47E−11 |
See Data Sheet 10 in Supplementary Material for the whole list and the human proteins corresponding to each GO term.