| Literature DB >> 19478882 |
Frederic D Bushman1, Nirav Malani, Jason Fernandes, Iván D'Orso, Gerard Cagney, Tracy L Diamond, Honglin Zhou, Daria J Hazuda, Amy S Espeseth, Renate König, Sourav Bandyopadhyay, Trey Ideker, Stephen P Goff, Nevan J Krogan, Alan D Frankel, John A T Young, Sumit K Chanda.
Abstract
We have analyzed host cell genes linked to HIV replication that were identified in nine genome-wide studies, including three independent siRNA screens. Overlaps among the siRNA screens were very modest (<7% for any pairwise combination), and similarly, only modest overlaps were seen in pairwise comparisons with other types of genome-wide studies. Combining all genes from the genome-wide studies together with genes reported in the literature to affect HIV yields 2,410 protein-coding genes, or fully 9.5% of all human genes (though of course some of these are false positive calls). Here we report an "encyclopedia" of all overlaps between studies (available at http://www.hostpathogen.org), which yielded a more extensively corroborated set of host factors assisting HIV replication. We used these genes to calculate refined networks that specify cellular subsystems recruited by HIV to assist in replication, and present additional analysis specifying host cell genes that are attractive as potential therapeutic targets.Entities:
Mesh:
Year: 2009 PMID: 19478882 PMCID: PMC2682202 DOI: 10.1371/journal.ppat.1000437
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Gene Sets Analyzed.
| List | Name | Number of Genes | Description | Reference |
| 1 | siRNA HIV König | 293 | siRNA screen for host factors promoting HIV replication |
|
| 2 | siRNA HIV Brass | 283 | siRNA screen for host factors promoting HIV replication |
|
| 3 | siRNA HIV Zhou | 303 | siRNA screen for host factors promoting HIV replication |
|
| 4 | SNP HIV Fellay | 63 | GWA study of HIV set point in infected individuals |
|
| 5 | Particle Associated HIV | 248 | Proteins in HIV particles identified by mass spec |
|
| 6 | HARC Nef | 6 | Cellular proteins that interact with HIV Nef protein (mass spec) | This work |
| 7 | HARC Tat | 69 | Cellular proteins that interact with HIV Tat protein (mass spec) | This work |
| 8 | HARC Rev | 56 | Cellular proteins that interact with HIV Rev protein (mass spec) | This work |
| 9 | BIND INT HIV | 23 | Cellular proteins that interact with HIV IN protein (yeast two hybrid) |
|
| 10 | NCBI Interactions | 1434 | Published interactions between an HIV protein and a cellular protien |
|
| 11 | siRNA Flu Fly | 98 | Human homologs of fly gene products important for influenza virus infection |
|
| 12 | siRNA WNV | 305 | Gene products important for West Nile virus infection |
|
Figure 1Overlap Analysis to Generate an Empirical p-Value.
In each panel the downward arrow indicates the number of genes overlapping between the experimental data sets, and the bars show the frequencies of overlaps in comparisons to random distributions. (A) Simulation of overlap between all of the genes called in genome-wide screens (lists 1–9) and the NCBI database of factors reported to be involved in HIV replication (list 10). One thousand pairs of gene sets were drawn randomly from the set of all human genes, 1,254 to simulate the set of all genes from genome-side screens and 1,434 to simulate NCBI genes, and the overlap in each pair plotted. The experimental overlap was 257 genes. (B) Simulation of overlapping genes between the König et al. and Zhou et al. siRNA screens. The experimental overlap was nine genes. The p-value calculated using the hypergeometric distribution was slightly lower (p = 0.014). (C) Simulation of expected overlap between screens given the measured error between replicates. The standard deviation of infectivity measurements were calculated from the König et al. siRNA screens, and then simulated datasets were generated containing the measured error. For simulations, either two replicates (pink) or ten replicates (yellow) were generated and the overlap quantified. The y-axis: number of top-scoring genes considered in overlap analysis; x-axis: actual number of overlapping genes seen comparing simulated data sets. (D) Choices for toxicity threshold strongly influence the recovery of genes affecting HIV infection. The genes tested in the König et al. siRNA screen were ranked according to toxicity of knockdown, then sets containing 100% of genes, the least toxic 50%, or the least toxic 20% were extracted (top). From each of these, the 300 genes that when knocked down showed the strongest reduction in HIV infection were then selected, and the overlap between gene sets calculated (bottom).
Statistical Analysis of Genes in Common between All Pairs of Genome-Wide Studies.
| Table Name (Size) | siRNA HIV König (293) | siRNA HIV Brass (283) | siRNA HIV Zhou (303) | siRNA HIV Fellay (63) | Particle Associated HIV (248) | HARC Nef (6) | HARC Tat (69) | HARC Rev (56) | BIND INT HIV (23) | NCBI Interactions (1,434) | siRNA Flu Fly (98) |
| siRNA HIV König (293) | |||||||||||
| siRNA HIV Brass (283) | <0.001 (13) | ||||||||||
| siRNA HIV Zhou (303) | 0.024 (9) | <0.001 (18) | |||||||||
| siRNA HIV Fellay (63) | 1 (0) | 0.511 (1) | 0.541 (1) | ||||||||
| Particle Associated HIV (248) | 0.154 (5) | 0.035 (6) | 0.07 (6) | 0.108 (2) | |||||||
| HARC Nef (6) | 1 (0) | 1 (0) | 1 (0) | 1 (0) | <0.001 (2) | ||||||
| HARC Tat (69) | 1 (0) | 0.004 (4) | 0.052 (3) | 1 (0) | 0.027 (3) | 1 (0) | |||||
| HARC Rev (56) | 0.125 (2) | 0.44 (1) | 0.469 (1) | 1 (0) | <0.001 (10) | 1 (0) | 1 (0) | ||||
| BIND INT HIV (23) | <0.001 (3) | 1 (0) | 0.232 (1) | 1 (0) | 0.191 (1) | 1 (0) | 1 (0) | 0.07 (1) | |||
| NCBI Interactions (1,434) | <0.001 (53) | <0.001 (39) | <0.001 (40) | 0.234 (5) | <0.001 (94) | 1 (0) | <0.001 (21) | <0.001 (21) | 0.009 (5) | ||
| siRNA Flu Fly (98) | <0.001 (13) | 0.125 (3) | 0.738 (1) | 1 (0) | <0.001 (9) | 1 (0) | 1 (0) | 0.002 (3) | 1 (0) | <0.001 (20) | |
| siRNA WNV (305) | 0.02 (8) | 0.004 (9) | 0.693 (3) | 0.14 (2) | 0.013 (8) | 1 (0) | 0.061 (3) | 0.481 (1) | 1 (0) | 0.006 (29) | 0.337 (2) |
Each entry in the table shows the p-values (determined by comparison to random simulation) and the number of overlapping genes in parenthesis. Set names are as in Table 1.
Genes Called in at Least Two siRNA Screens.
| Symbol | Frequency (siRNA) | siRNA HIV König | siRNA HIV Brass | siRNA HIV Zhou | SNP HIV Fellay | Particle Associated HIV | HARC Nef | HARC Tat | HARC Rev | BIND INT HIV | NCBI Interactions | siRNA Flu Fly | siRNA WNV | Druggable? | Exp in CD4 | Description |
| ADRBK1 | 2 | * | * | yes | yes | Adrenergic, beta, receptor kinase 1 | ||||||||||
| AKT1 | 2 | * | * | * | yes | yes | v-akt murine thymoma viral oncogene homolog 1 | |||||||||
| CAV2 | 2 | * | * | yes | Caveolin 2 | |||||||||||
| CCNT1 | 2 | * | * | * | * | Cyclin T1 | ||||||||||
| CD4 | 2 | * | * | * | yes | CD4 molecule | ||||||||||
| DDX3X | 2 | * | * | * | * | * | yes | DEAD (Asp-Glu-Ala-Asp) box polypeptide 3, X-linked | ||||||||
| DMXL1 | 2 | * | * | yes | Dmx-like 1 | |||||||||||
| IDH1 | 2 | * | * | yes | Isocitrate dehydrogenase 1 (NADP+), soluble | |||||||||||
| JAK1 | 2 | * | * | * | yes | yes | Janus kinase 1 (a protein tyrosine kinase) | |||||||||
| MAP4 | 2 | * | * | yes | Microtubule-associated protein 4 | |||||||||||
| MRE11A | 2 | * | * | yes | MRE11 meiotic recombination 11 homolog A (S. cerevisiae) | |||||||||||
| RANBP2 | 2 | * | * | yes | yes | RAN binding protein 2 | ||||||||||
| TCEB3 | 2 | * | * | * | yes | Transcription elongation factor B (110 kDa, elongin A) | ||||||||||
| CXCR4 | 2 | * | * | * | yes | yes | Chemokine (C-X-C motif) receptor 4 | |||||||||
| CHST1 | 2 | * | * | * | yes | Carbohydrate (keratan sulfate Gal-6) sulfotransferase 1 | ||||||||||
| CTDP1 | 2 | * | * | * | yes | CTD (carboxy-terminal domain, RNA polymerase II | ||||||||||
| MED14 | 2 | * | * | yes | Mediator complex subunit 14 | |||||||||||
| RAB28 | 2 | * | * | yes | RAB28, member RAS oncogene family | |||||||||||
| NUP153 | 2 | * | * | * | * | yes | yes | Nucleoporin 153 kDa | ||||||||
| TNPO3 | 2 | * | * | yes | Transportin 3 | |||||||||||
| MED4 | 2 | * | * | yes | Mediator complex subunit 4 | |||||||||||
| ANAPC2 | 2 | * | * | yes | Anaphase promoting complex subunit 2 | |||||||||||
| MID1IP1 | 2 | * | * | * | yes | MID1 interacting protein 1 (gastrulation specific G12 homolog) | ||||||||||
| WNK1 | 2 | * | * | yes | WNK lysine deficient protein kinase 1 | |||||||||||
| RNF26 | 2 | * | * | Ring finger protein 26 | ||||||||||||
| MED28 | 2 | * | * | yes | Mediator complex subunit 28 | |||||||||||
| TRIM55 | 2 | * | * | Tripartite motif-containing 55 | ||||||||||||
| ANKRD30A | 2 | * | * | yes | Ankyrin repeat domain 30A | |||||||||||
| MED19 | 2 | * | * | yes | Mediator complex subunit 19 | |||||||||||
| HMCN2 | 2 | * | * | Hemicentin 2 | ||||||||||||
| RGPD8 | 2 | * | * | yes | RANBP2-like and GRIP domain containing 8 | |||||||||||
| RELA | 3 | * | * | * | * | yes | yes | v-rel reticuloendotheliosis viral oncogene homolog A | ||||||||
| MED7 | 3 | * | * | * | yes | Mediator complex subunit 7 | ||||||||||
| MED6 | 3 | * | * | * | yes | Mediator complex subunit 6 |
The asterisks indicate positive calls. Set names are as in Table 1. “Druggable” is as determined in Hopkins and Groom and Orth et al. [56],[57].
Figure 2Gene Ontology Analysis of GO Groups Enriched among Genes Called in Two or More siRNA Screens.
The color code indicates the number of genes in each functional group from each screen derived using DAVID Functional Annotation Clustering. Annotations for each function group were based on the assessment of GO categories that comprised each group, which can be found in Table S1.
Figure 3Gene Clusters, Generated Using PPI and MCODE Analysis, Derived from the Full Set of Genes Implicated in HIV Infection (Lists 1–10).
The size of each node is proportional to the number of screens in which the host cell gene was called. Gene identifiers are in Table S1. Diamonds indicate genes from the NCBI HIV interactions database. Color code: red = König et al., green = Brass et al., blue = Zhou et al., cyan = Fellay et al., magenta = Frankel interaction screens (unpublished data), yellow = HIV particle associated, and grey = Studamire and Goff integrase interactions. For genes that were called in multiple screens (larger symbols), a color was chosen arbitrarily from among the screens positive for that gene. Default parameters were used, specifically— Degree Cutoff: 2. Node Score Cutoff: 0.0. Haircut: true. Fluff: false. K-Core: 2. Maximum Depth From Seed: 100. (A) Proteasome; (B) Transcription/RNA Pol; (C) Mediator Complex; (D) Tat activation/Transcriptional elongation; (E) Immune response; (F) RNA Binding/Splicing; (G) H5PA5/BiP Chaperone; (H) CCT Chaperone; (I) t-RNA Synthase; (J) Transport; (K) Unknown.