| Literature DB >> 19627600 |
William Dampier1, Perry Evans, Lyle Ungar, Aydin Tozeren.
Abstract
BACKGROUND: The HIV viral genome mutates at a high rate and poses a significant long term health risk even in the presence of combination antiretroviral therapy. Current methods for predicting a patient's response to therapy rely on site-directed mutagenesis experiments and in vitro resistance assays. In this bioinformatics study we treat response to antiretroviral therapy as a two-body problem: response to therapy is considered to be a function of both the host and pathogen proteomes. We set out to identify potential responders based on the presence or absence of host protein and DNA motifs on the HIV proteome.Entities:
Year: 2009 PMID: 19627600 PMCID: PMC2723131 DOI: 10.1186/1755-8794-2-47
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Therapy Classification
| R | NR | Mean AUC | R | NR | Mean AUC | R | NR | Mean AUC | |
| AZT | 526 | 390 | 0.7750 | 581 | 335 | 0.8550 | 395 | 521 | 0.7802 |
| AZT, IDV | 182 | 148 | 0.7803 | 189 | 141 | 0.9281 | 144 | 186 | 0.9107 |
| DDI | 466 | 273 | 0.7572 | 503 | 236 | 0.8363 | 272 | 467 | 0.7648 |
| DDI, NFV | 249 | 130 | 0.7352 | 264 | 115 | 0.8004 | 175 | 204 | 0.6814 |
| D4T | 450 | 307 | 0.7654 | 482 | 275 | 0.8081 | 274 | 483 | 0.7683 |
| D4T, NFV | 266 | 153 | 0.7499 | 280 | 139 | 0.6664 | 181 | 238 | 0.6713 |
| D4T, NFV | 372 | 200 | 0.7377 | 391 | 181 | 0.8455 | 260 | 312 | 0.7613 |
| D4T, DDI, NFV | 234 | 115 | 0.7518 | 242 | 107 | 0.7764 | 173 | 176 | 0.6817 |
| 3TC | 582 | 466 | 0.7721 | 654 | 394 | 0.9280 | 408 | 640 | 0.7788 |
| 3TC, IDV | 187 | 151 | 0.7748 | 196 | 142 | 0.9030 | 144 | 194 | 0.8763 |
| 3TC, NFV | 202 | 159 | 0.7535 | 242 | 119 | 0.8810 | 175 | 186 | 0.8606 |
| 3TC, AZT | 509 | 379 | 0.7731 | 560 | 328 | 0.8439 | 391 | 497 | 0.7845 |
| 3TC, AZT, IDV | 177 | 145 | 0.7849 | 184 | 138 | 0.8858 | 144 | 178 | 0.8815 |
| DDI, EFV | 248 | 121 | 0.7389 | 208 | 89 | 0.9312 | 192 | 177 | 0.6711 |
| D4T, EFV | 260 | 125 | 0.7406 | 285 | 100 | 0.8479 | 194 | 191 | 0.9887 |
| D4T, DDI, EFV | 233 | 107 | 0.7516 | 254 | 86 | 0.9446 | 188 | 152 | 0.7499 |
| 3TC, EFV | 207 | 130 | 0.7313 | 245 | 100 | 0.9731 | 179 | 166 | 0.9497 |
| All Therapies | 1115 | 904 | 0.7644 | 1188 | 831 | 0.8351 | 700 | 1319 | 0.8402 |
The overall statistics of the clinically annotated reverse transcriptase sequences from the Stanford HIV-1 Drug Resistance Database. The table shows breakdown of patients in each therapy regimen using the three different classification rules: Standard Datenum (SD), Incremental Reduction (IR), and Bimodal Classification (BM). R; responders, NR; non responders. The average AUC over 500 training/testing iterations indicate the success in differentiating responders from non responders using short linear sequence motifs as features in machine learning.
Figure 1Responder Classifications. A graphical representation of the three phenotype classification methods: Standard Datenum (SD), Incremental Reduction (IR) and Bimodal classification (BM). Figure 1A: SD, A histogram showing the log10 change in viral load of all patients in the database. Patients labelled as "responders" are marked in pink and non-responders in "blue". Figure 1B: IR, Three scatter plots representing the viral load vs. CD4 counts for all patients in the database after 8, 12, and 24 weeks of therapy. Patients which decreased in viral load in 75% of their visits are labelled as "responders" and marked in pink; those that did not are labelled as "non-responders" and marked in blue. Figure 1C: BM, A histogram of the change in viral load after 24 weeks of therapy. Those patients that decreased by more than 2000 copies/ml were labelled as "responders" and are marked in pink; those that did not were labelled as "non-responders" and are marked in blue.
Figure 2Venn Diagram. Venn diagram showing the intersection between responder sets corresponding to SD, IR, and BM classification.
Figure 3Feature Annotation. Annotation of a short linear motifs (Eukareotytic Linear Motifs, miRNAs binding sites, human transcription factor binding sites) along the viral sequence for 100 subtype C and 500 subtype B sequences. The colour code is as follows: homology Islands (green), human miRNA binding-sites (blue), human TF sites (silver), cleavage ELMs (red), ligation ELMs (purple), modification ELMs (brown), and export ELMs (pink). The clinically annotated sequence region is shown in the black box.
Figure 4ROC Curves. Receiver Operator Characteristic (ROC) curves determined by the stepwise-logistic regression (SWLR) for the therapy regimens presented in Table 1 using the IR classification. The BOLD blue shows the average ROC curve over 500 iterations. The solid black line indicates the prediction ability with 20% shuffling of the responder v non-responder categories. The dashed line indicates the corresponding averages of completely shuffled responder vs. non-responder categories.
Figure 5SWLR Feature Regression Coefficients. Heatmaps indicating the average of the SWLR regression coefficient for the motifs used in the classification. Blue colour in the ruler bar indicates that presence of an ELM motif creates greater likelihood of being in the responder category (R ELM) whereas red indicates greater likelihood of being in the non-responder category (NR ELM). Top Panel: SD; Middle Panel: IR, Bottom Panel: BM.
Interacting Proteins
| 59 | ACTA2 | actin, alpha 2, smooth muscle, aorta | The localization of the HIV-1 reverse transcription complex to actin microfilaments is mediated by the interaction of a reverse transcription complex component (HIV-1 Matrix) with actin, but not vimentin (intermediate filaments) or tubulin (microtubules) |
| 60 | ACTB | Actin, beta | Eukaryotic beta-actin binds to either the large subunit (p66) of HIV-1 reverse transcriptase or to the HIV-1 Pol precursor polyprotein in vitro; this interaction is believed to be important for the secretion of HIV-1 virions |
| 70 | ACTC1 | actin, alpha, cardiac muscle 1 | The localization of the HIV-1 reverse transcription complex to actin microfilaments is mediated by the interaction of a reverse transcription complex component (HIV-1 Matrix) with actin, but not vimentin (intermediate filaments) or tubulin (microtubules) |
| 1457 | CSNK2A1 | casein kinase 2, alpha 1 | Casein kinase II phosphorylates HIV-1 RT p66 and p51 in human cells |
| 3439, 3440, 3449 | IFNA1, IFNA2, IFNA16 | IFN-alpha interferes with the initiation of HIV-1 reverse transcription resulting in a significant reduction in the relative levels of HIV-1 proviral DNA | |
| 3458 | IFNG | Interferon, gamma | Up-regulation of LMP7 by IFN-gamma enhances proteasomal degradation of HIV-1 RT and presentation of the VIYQYMDDL epitope derived from HIV-1 RT |
| 4772, 4773 | NFACT1, NFACT2 | nuclear factor of activated T-cells | NFATc facilitates HIV-1 RT reverse transcription activity and enhances HIV-1 infectivity in human T cells |
| 5286 | PIK3C2A | phosphoinositide-3-kinase, class 2, alpha polypeptide | HIV-1 RT heterodimer expressed in bacteria can be phosphorylated in vitro by several purified mammalian protein kinases including auto-activated protein kinase (PK), CKII, cytosolic protamine kinase (CPK), myelin basic protein kinase 1 (MBPK1), and PRKC |
| 5578, 5579, 5580, 5581, 5584, 5588, 5590 | PRKCA, PRKCB1, | HIV-1 RT heterodimer expressed in bacteria can be phosphorylated in vitro by several purified mammalian protein kinases including auto-activated protein kinase (PK), CKII, cytosolic protamine kinase (CPK), myelin basic protein kinase 1 (MBPK1), and PRKC | |
| 5594, 5604, 6300 | MAPK1, MAP2K1, MAPK12 | mitogen-activated protein kinase 1 | MEK1 in HIV-1 producer cells is able to activate virion-associated MAPK in trans, and the activated MAPK facilitates efficient disengagement of the HIV-1 reverse transcription complex from the cell membrane and subsequent nuclear translocation |
| 5696 | PSMB8 | proteasome subunit, beta type, 8 | Up-regulation of LMP7 by IFN-gamma enhances proteasomal degradation of HIV-1 RT and presentation of the VIYQYMDDL epitope derived from HIV-1 RT |
| 6117, 6118, 6119 | RPA1, RPA2, RPA3 | Replication protein A and HIV-1 nucleocapsid protein interfere with the strand displacement DNA synthesis of HIV-1 reverse transcriptase by binding to the displaced strand and keeping it away from the newly synthesized strand | |
| 7150 | TOP1 | topoisomerase (DNA) I | Topoisomerase I (topo I) enhances HIV-1 reverse transcriptase activity in vitro and this effect can be inhibited by the topo I-specific inhibitor camptothecin |
| 7157 | TP53 | tumor protein p53 (Li-Fraumeni syndrome) | Tumor suppressor protein p53 displays 3' -> 5' exonuclease activity, and interaction of p53 with HIV-1 reverse transcriptase (RT) can provide a proofreading function for HIV-1 RT |
| 10527 | IPO7 | importin 7 | Importin 7, an import receptor for ribosomal proteins and histone H1, is involved in the active nuclear import of the intracellular HIV-1 reverse transcription complex (RTC) containing HIV-1 RT, IN, NC, MA, and Vpr |
| 29935 | RPA4 | replication protein A4, 34 kDa | Replication protein A and HIV-1 nucleocapsid protein interfere with the strand displacement DNA synthesis of HIV-1 reverse transcriptase by binding to the displaced strand and keeping it away from the newly synthesized strand |
| 50810 | hepatoma-derived growth factor, related protein 3 | Hepatoma-derived growth factor 2 (HRP2) restores salt-stripped HIV-1 preintegration complex (PIC) activity in vitro | |
| 60489 | apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3G | Vif-negative HIV-1 produced from 293T cells transiently expressing hA3G are impaired in early and late viral DNA production, and in viral infectivity, which are correlated with an inability of tRNA(3)(Lys) to prime reverse transcription | |
A table of human proteins from the NIAID HIV-1 interaction database which are known to interact with HIV-1 RT and expressing the drug response predicting ELMs
Biological Context
| GO BP Level 5 | GO:0016310~phosphorylation | 13 | 39.39% | 5.21E-8 |
| GO BP Level 5 | GO:0008219~cell death | 10 | 30.30% | 7.86E-5 |
| GO BP Level 5 | GO:0006260~DNA replication | 6 | 18.18% | 9.30E-5 |
| GO BP Level 5 | GO:0006915~apoptosis | 9 | 27.27% | 3.16E-4 |
| GO BP Level 5 | GO:0006935~chemotaxis | 5 | 15.15% | 4.02E-4 |
| GO MF Level 5 | GO:0004697~protein kinase C activity | 7 | 21.21% | 1.22E-12 |
| GO MF Level 5 | GO:0004672~protein kinase activity | 11 | 33.33% | 1.75E-6 |
| GO MF Level 5 | GO:0032559~adenyl ribonucleotide binding | 15 | 45.45% | 1.97E-6 |
| GO MF Level 5 | GO:0003697~single-stranded DNA binding | 5 | 15.15% | 6.65E-6 |
| GO MF Level 5 | GO:0004707~MAP kinase activity | 2 | 6.06% | 0.0395 |
| KEGG PATHWAY | hsa04650:Natural killer cell mediated cytotoxicity | 10 | 30.30% | 8.77E-9 |
| KEGG PATHWAY | hsa04664:Fc epsilon RI signaling pathway | 8 | 24.24% | 1.30E-7 |
| KEGG PATHWAY | hsa04530:Tight junction | 9 | 27.27% | 3.36E-7 |
| KEGG PATHWAY | hsa04370:VEGF signaling pathway | 7 | 21.21% | 1.05E-6 |
| KEGG PATHWAY | hsa04912:GnRH signaling pathway | 6 | 18.18% | 1.24E-4 |
| KEGG PATHWAY | hsa05223:Non-small cell lung cancer | 5 | 15.15% | 1.34E-4 |
Gene Ontology categories (level 5) and KEGG pathways associated with the host proteins listed in Table 2. Count refers to the number of proteins from Table 2 which have the associated term. P-values were determined using the DAVID enrichment tool using the set of all human proteins with the ELMs in Table 2 as a background set.