| Literature DB >> 24955349 |
Ning Zhang1, Min Jiang2, Tao Huang3, Yu-Dong Cai4.
Abstract
The recently emerging Influenza A/H7N9 virus is reported to be able to infect humans and cause mortality. However, viral and host factors associated with the infection are poorly understood. It is suggested by the "guilt by association" rule that interacting proteins share the same or similar functions and hence may be involved in the same pathway. In this study, we developed a computational method to identify Influenza A/H7N9 virus infection-related human genes based on this rule from the shortest paths in a virus-human protein interaction network. Finally, we screened out the most significant 20 human genes, which could be the potential infection related genes, providing guidelines for further experimental validation. Analysis of the 20 genes showed that they were enriched in protein binding, saccharide or polysaccharide metabolism related pathways and oxidative phosphorylation pathways. We also compared the results with those from human rhinovirus (HRV) and respiratory syncytial virus (RSV) by the same method. It was indicated that saccharide or polysaccharide metabolism related pathways might be especially associated with the H7N9 infection. These results could shed some light on the understanding of the virus infection mechanism, providing basis for future experimental biology studies and for the development of effective strategies for H7N9 clinical therapies.Entities:
Mesh:
Year: 2014 PMID: 24955349 PMCID: PMC4052153 DOI: 10.1155/2014/239462
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The flowchart of the method developed in this study to identify the Influenza A/H7N9 infection-related human genes. Target human proteins interacting with the Influenza A/H7N9 virus were obtained based on sharing GO terms. Shortest path proteins were calculated from the shortest paths between every pair of the target human proteins, by searching by the Dijkstra algorithm in the network constructed from STRING. Finally 20 shortest path proteins were screened out with betweenness >10,000, the related genes of which were considered as infection-related human genes.
Number of proteins/genes in our datasets of the three species of viruses: Influenza A/H7N9 virus, human rhinovirus (HRV), and respiratory syncytial virus (RSV).
| Virus | Virus proteins (sequence identity <40%) | Virus-human protein pairs | Target human proteins (coding genes) | Shortest path proteins | Betweenness threshold | Potential infection-related proteins (coding genes) |
|---|---|---|---|---|---|---|
| H7N9 | 11 | 9,313 | 3,212 (1,023) | 962 | >10,000 | 20 (20) |
| HRV | 4 | 20,955 | 6,985 (2,028) | 1,904 | >47,299 | 11 (11) |
| RSV | 22 | 40,499 | 36,273 (11,036) | 9,846 | >1,275,672 | 44 (44) |
Figure 2Statistical analysis of sharing GO terms between virus proteins and human proteins. All sharing GO terms and their descriptions between virus and human proteins were listed on the Y-axis. Histogram of sharing numbers showed the instances of each term used as a sharing term. The horizontal axis was truncated from 125 to 400.
The infection-related human proteins and their related coding genes for the three species of viruses calculated from shortest paths in a PPI network.
| Infected by virus | Protein | Gene | Chromosome | Betweenness |
|---|---|---|---|---|
|
| ENSP00000361626 | YBX1 | 1 | 58376 |
| ENSP00000363676 | RPL11 | 1 | 49155 | |
| ENSP00000396127 | RAN | 12 | 49036 | |
| ENSP00000362413 | PGK1 | X | 26743 | |
| ENSP00000229239 | GAPDH | 12 | 25345 | |
| ENSP00000294172 | NXF1 | 11 | 23550 | |
| ENSP00000352400 | NUP214 | 9 | 21849 | |
| ENSP00000280892 | EIF4E | 4 | 20883 | |
| ENSP00000379933 | TPI1 | 12 | 19217 | |
| ENSP00000348877 | GPI | 19 | 18885 | |
| ENSP00000317904 | GYS1 | 19 | 18349 | |
| ENSP00000350283 | BRCA1 | 17 | 16827 | |
| ENSP00000283195 | RANBP2 | 2 | 16823 | |
| ENSP00000400591 | SNRPE | 1 | 15796 | |
| ENSP00000265686 | TCIRG1 | 11 | 14465 | |
| ENSP00000358563 | DKC1 | X | 13535 | |
| ENSP00000234396 | ATP6V1B1 | 2 | 13432 | |
| ENSP00000218516 | GLA | X | 12471 | |
| ENSP00000262030 | ATP5B | 12 | 11891 | |
| ENSP00000260947 | BARD1 | 2 | 11297 | |
|
| ||||
| Human Rhinovirus (HRV) | ENSP00000344818 | UBC | 12 | 330154 |
| ENSP00000363676 | RPL11 | 1 | 154993 | |
| ENSP00000361626 | YBX1 | 1 | 136548 | |
| ENSP00000357879 | PSMD4 | 1 | 121991 | |
| ENSP00000337825 | LCK | 1 | 117195 | |
| ENSP00000396127 | RAN | 12 | 116059 | |
| ENSP00000348461 | RAC1 | 7 | 111632 | |
| ENSP00000230354 | TBP | 6 | 100485 | |
| ENSP00000350283 | BRCA1 | 17 | 65076 | |
| ENSP00000314949 | POLR2A | 17 | 54470 | |
| ENSP00000280892 | EIF4E | 4 | 50359 | |
|
| ||||
| Respiratory syncytial virus (RSV) | ENSP00000269305 | TP53 | 17 | 15809765 |
| ENSP00000344456 | CTNNB1 | 3 | 5756301 | |
| ENSP00000263253 | EP300 | 22 | 5694027 | |
| ENSP00000339007 | GRB2 | 17 | 5591895 | |
| ENSP00000275493 | EGFR | 7 | 5245421 | |
| ENSP00000270202 | AKT1 | 14 | 4663263 | |
| ENSP00000264657 | STAT3 | 17 | 4180564 | |
| ENSP00000350941 | SRC | 20 | 3180369 | |
| ENSP00000348461 | RAC1 | 7 | 3066312 | |
| ENSP00000221494 | SF3A2 | 19 | 2994393 | |
| ENSP00000417281 | MDM2 | 12 | 2686189 | |
| ENSP00000338345 | SNCA | 4 | 2647616 | |
| ENSP00000206249 | ESR1 | 6 | 2643164 | |
| ENSP00000296271 | RHO | 3 | 2573058 | |
| ENSP00000329623 | BCL2 | 18 | 2541856 | |
| ENSP00000376609 | GRK5 | 10 | 2364221 | |
| ENSP00000337825 | LCK | 1 | 2306232 | |
| ENSP00000314458 | CDC42 | 1 | 2174421 | |
| ENSP00000262613 | SLC9A3R1 | 17 | 2097178 | |
| ENSP00000355865 | PARK2 | 6 | 2033100 | |
| ENSP00000264033 | CBL | 11 | 1930392 | |
| ENSP00000269571 | ERBB2 | 17 | 1922027 | |
| ENSP00000338018 | HIF1A | 14 | 1915325 | |
| ENSP00000324806 | GSK3B | 3 | 1910676 | |
| ENSP00000215832 | MAPK1 | 22 | 1831541 | |
| ENSP00000358490 | CD2 | 1 | 1751073 | |
| ENSP00000262160 | SMAD2 | 18 | 1727787 | |
| ENSP00000304903 | CD2BP2 | 16 | 1714523 | |
| ENSP00000362649 | HDAC1 | 1 | 1703720 | |
| ENSP00000353483 | MAPK8 | 10 | 1702626 | |
| ENSP00000261799 | PDGFRB | 5 | 1679113 | |
| ENSP00000003084 | CFTR | 7 | 1662248 | |
| ENSP00000401303 | SHC1 | 1 | 1548773 | |
| ENSP00000321656 | CDC25C | 5 | 1521621 | |
| ENSP00000357656 | FYN | 6 | 1503978 | |
| ENSP00000326366 | PSEN1 | 14 | 1498004 | |
| ENSP00000230354 | TBP | 6 | 1458835 | |
| ENSP00000300093 | PLK1 | 16 | 1444680 | |
| ENSP00000350283 | BRCA1 | 17 | 1389799 | |
| ENSP00000228307 | PXN | 12 | 1358706 | |
| ENSP00000329357 | SP1 | 12 | 1347630 | |
| ENSP00000361626 | YBX1 | 1 | 1342956 | |
| ENSP00000387662 | GCG | 2 | 1321174 | |
| ENSP00000367207 | MYC | 8 | 1284185 | |
Figure 3The potential virus infection-related human genes found based on our method for the three species of viruses. 20, 11, and 44 potential infection-related human genes were found for H7N9, HRV, and RSV, respectively. There were 5 sharing genes between those for H7N9 and HRV, and 2 sharing genes between H7N9 and RSV. Other human genes related were not all the same, indicating specific properties or particular characteristics between the infections of the three species of viruses.
GO term enrichment analysis of the 20 potential H7N9 infection-related human proteins (data not shown of GO terms with number of related proteins below 3) and comparisons with HRV and RSV for these terms.
| GO terms | H7N9 | HRV | RSV | |||
|---|---|---|---|---|---|---|
| Number of proteins | Percentage accounting for the 20 proteins (%) | Number of proteins* | Percentage accounting for the 11 proteins (%)* | Number of proteins* | Percentage accounting for the 44 proteins (%)* | |
| GO:0005515 protein binding | 15 | 75.00 | 11 | 100.00 | 42 | 95.45 |
| GO:0005829 cytosol | 13 | 65.00 | 7 | 63.64 | 23 | 52.27 |
| GO:0005737 cytoplasm | 11 | 55.00 | 6 | 54.55 | 30 | 68.18 |
| GO:0005634 nucleus | 9 | 45.00 | 4 | 36.36 | 33 | 75.00 |
| GO:0044281 small molecule metabolic process | 9 | 45.00 | 1 | 9.09 | 2 | 4.55 |
| GO:0003723 RNA binding | 8 | 40.00 | 4 | 36.36 | 2 | 4.55 |
| GO:0005975 carbohydrate metabolic process | 8 | 40.00 | — | — | — | — |
| GO:0005654 nucleoplasm | 7 | 35.00 | 7 | 63.64 | 19 | 43.18 |
| GO:0010467 gene expression | 7 | 35.00 | 8 | 72.73 | 6 | 13.64 |
| GO:0006006 glucose metabolic process | 5 | 25.00 | — | — | 2 | 4.55 |
| GO:0005886 plasma membrane | 4 | 20.00 | 4 | 36.36 | 27 | 61.36 |
| GO:0005622 intracellular | 4 | 20.00 | 3 | 27.27 | 7 | 15.91 |
| GO:0005643 nuclear pore | 4 | 20.00 | 1 | 9.09 | — | — |
| GO:0016032 viral reproduction | 4 | 20.00 | 8 | 72.73 | 4 | 9.09 |
| GO:0016070 RNA metabolic process | 4 | 20.00 | 4 | 36.36 | 1 | 2.27 |
| GO:0006094 gluconeogenesis | 4 | 20.00 | — | — | — | — |
| GO:0006096 glycolysis | 4 | 20.00 | — | — | — | — |
| GO:0055085 transmembrane transport | 4 | 20.00 | — | — | 1 | 2.27 |
| GO:0005625 soluble fraction | 3 | 15.00 | — | — | 5 | 11.36 |
| GO:0008270 zinc ion binding | 3 | 15.00 | 2 | 18.18 | 11 | 25.00 |
| GO:0016071 mRNA metabolic process | 3 | 15.00 | 4 | 36.36 | 1 | 2.27 |
| GO:0006606 protein import into nucleus | 3 | 15.00 | 1 | 9.09 | 1 | 2.27 |
| GO:0005524 ATP binding | 3 | 15.00 | 1 | 9.09 | 14 | 31.82 |
| GO:0006406 mRNA export from nucleus | 3 | 15.00 | 1 | 9.09 | — | — |
| GO:0008286 insulin receptor signaling pathway | 3 | 15.00 | 1 | 9.09 | 4 | 9.09 |
| GO:0005215 transporter activity | 3 | 15.00 | — | — | — | — |
| GO:0015991 ATP hydrolysis coupled proton transport | 3 | 15.00 | — | — | — | — |
| GO:0015992 proton transport | 3 | 15.00 | — | — | — | — |
| GO:0019221 cytokine-mediated signaling pathway | 3 | 15.00 | 2 | 18.18 | 1 | 2.27 |
*—: no proteins having the GO term was picked out as potential infection-related proteins for the virus.
KEGG pathway enrichment analysis of the 20 potential H7N9 infection-related human genes.
| Terms | Genes | Number of genes belonging to the pathway | Percentage accounting for the 20 genes (%) | Adjusted
|
|---|---|---|---|---|
| Glycolysis/Gluconeogenesis | TPI1, GPI, GAPDH, and PGK1 | 4 | 20.00 | 7.0 |
| Oxidative phosphorylation | ATP5B, ATP6V1B1, and TCIRG1 | 3 | 15.00 | 3.3 |
| Starch and sucrose metabolism | GPI, GYS1 | 2 | 10.00 | 5.2 |