| Literature DB >> 29044435 |
Patricia Rojas Sánchez1,2, Alberto Cobos3, Marisa Navaro4, José Tomas Ramos5, Israel Pagán3, África Holguín1.
Abstract
Determining the factors modulating the genetic diversity of HIV-1 populations is essential to understand viral evolution. This study analyzes the relative importance of clinical factors in the intrahost HIV-1 subtype B (HIV-1B) evolution and in the fixation of drug resistance mutations (DRM) during longitudinal pediatric HIV-1 infection. We recovered 162 partial HIV-1B pol sequences (from 3 to 24 per patient) from 24 perinatally infected patients from the Madrid Cohort of HIV-1 infected children and adolescents in a time interval ranging from 2.2 to 20.3 years. We applied machine learning classification methods to analyze the relative importance of 28 clinical/epidemiological/virological factors in the HIV-1B evolution to predict HIV-1B genetic diversity (d), nonsynonymous and synonymous mutations (dN, dS) and DRM presence. Most of the 24 HIV-1B infected pediatric patients were Spanish (91.7%), diagnosed before 2000 (83.3%), and all were antiretroviral therapy experienced. They had from 0.3 to 18.8 years of HIV-1 exposure at sampling time. Most sequences presented DRM. The best-predictor variables for HIV-1B evolutionary parameters were the age of HIV-1 diagnosis for d, the age at first antiretroviral treatment for dN and the year of HIV-1 diagnosis for ds. The year of infection (birth year) and year of sampling seemed to be relevant for fixation of both DRM at large and, considering drug families, to protease inhibitors (PI). This study identifies, for the first time using machine learning, the factors affecting more HIV-1B pol evolution and those affecting DRM fixation in HIV-1B infected pediatric patients.Entities:
Keywords: HIV-1; intrahost evolution; machine learning; pediatric patients
Mesh:
Year: 2017 PMID: 29044435 PMCID: PMC5647794 DOI: 10.1093/gbe/evx193
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Main Features and Available HIV-1 Sequences from the Study Cohort
| Features | Patients |
|---|---|
| HIV-1 vertical transmission | 24 |
| HIV-1 subtype B | 24 |
| Gender | |
| Male | 15 |
| Female | 9 |
| Origin country | |
| Spain | 22 |
| Guatemala | 1 |
| Peru | 1 |
| Year of infection (birth year) | |
| 1984–1989 | 7 |
| 1990–1999 | 15 |
| 2000–2002 | 2 |
| Year of HIV-1 diagnosis | |
| 1984–1989 | 1 |
| 1990–1999 | 19 |
| 2000–2004 | 4 |
| Mean age in years (range) | |
| At HIV-1 diagnosisa§ | 1.7 (0.1–8.6) |
| At first ARTb | 3 (0.1–8.5) |
| At baseline sampled sequencec | 8.8 (0.3–18.8) |
| At last sampled sequenced | 16.8 (5–23.9) |
| Mean number of HIV-1 sequences per patient (range) | 7 (3–21) |
| Years between the first and the last sequence (range) | 7.7 (2.2–20.3) |
Unknown data in 6a, 1b, 5c, and 4d patients; §Thirteen (68%) children were diagnosed before the first year of life, three (15.8%) children were diagnosed between the first and the fourth year of life, and two children were diagnosed between the fourth and the ninth year of life. ART, antiretroviral treatment. HIV-1 sequences, partial pol including the complete PR and the first 334 amino acid residues of RT.
Clinical Features of the 24 Patients Included in This Study
| Features | Number of Patients | |
|---|---|---|
| At First Sampling Time | At Last Clinical Report | |
| Clinical follow-up | ||
| Pediatric Unit | 24 | 11 |
| Adult Unit | 0 | 9 |
| Lost to follow-up | 0 | 1 |
| Exitus | 0 | 1 |
| Unknown | 0 | 2 |
| ART status | ||
| Naive | 2 | 0 |
| Treated | 22 | 24 |
| Regimen switches, mean (range) | 3.4 (0–12) | 5.9 (2–12) |
| 1–2 | 9 | 1 |
| 3–6 | 10 | 14 |
| 7–12 | 2 | 6 |
| Unknown | 3 | 3 |
| Previous ARV drugs, mean (range) | 5.9 (0–18) | 9.7 (5–18) |
| <3 | 3 | 0 |
| 3–6 | 12 | 4 |
| 7–13 | 5 | 15 |
| >13 | 1 | 4 |
| Unknown | 3 | 1 |
| Number of DRM | ||
| Total | 126 | 147 |
| To NRTI | 71 | 79 |
| To NNRTI | 21 | 34 |
| To PI (major) | 34 | 34 |
| Viral load (HIV-1 RNA-copies/ml) | ||
| ≤50 | 2 | 4 |
| >50–500 | 2 | 2 |
| >500–1,000 | 2 | 2 |
| >1,000–10,000 | 7 | 8 |
| >10,000–100,000 | 9 | 6 |
| >100,000 | 2 | 1 |
| Unknown | 0 | 1 |
| CD4+ T rate | ||
| <25% | 12 | 16 |
| 25–50% | 10 | 7 |
| >50% | 0 | 0 |
| Unknown | 2 | 1 |
| CD4+ T cells/mm3 | ||
| <350 | 3 | 10 |
| 350–500 | 4 | 4 |
| 501–1,000 | 11 | 8 |
| 1,001–1,500 | 1 | 1 |
| >1,500 | 2 | 0 |
| Unknown | 3 | 1 |
| CD8+ T cells/mm3 | ||
| <350 | 1 | 1 |
| 350–500 | 0 | 0 |
| 501–1,000 | 3 | 5 |
| 1, 001–1,500 | 7 | 5 |
| >1,500 | 5 | 6 |
| Unknown | 8 | 7 |
| CD4/CD8 mean ratio (range) | 0.8 (0.1–3)a | 0.4 (0.06–0.9)b |
Unknown data in 8a and 7b patients. ART, antiretroviral treatment; ARV, antiretroviral; IQR, interquartile range; DRMs, drug resistance mutations to NRTI, NNRTI and PI (major) according to IAS2015 (Wensing etal., 2014).
. 1.—Correlations between patient HIV-1B exposure time and T-CD4+ and T-CD8+ rates (A), CD4/CD8 ratio (B), viral load or VL (C) and evolutionary parameters, namely HIV-1B genetic diversity (D), mean number of nonsynonymous (d) and synonymous (d) nucleotide substitutions (D). For each correlation the R2 and the P values are shown. Dashed line in panel C refers to undetectable viral load < 2.7 log (<500 HIV-1-RNA copies per milliliter of plasma). P values: 0.038 (d), 0.058 (d), 0.450 (d), 0.001 (%CD8), 0.006 (CD4/CD8), and 0.002 (VL).
. 2.—Predicted susceptibility according to the Stanford HIVdb Interpretation Algorithm among those virus carrying DRM to PI (n = 15), to NRTI (n = 19) or to NNRTI (n = 14) in the first available partial pol sequence (A) and to PI (n = 10), to NRTI (n = 18) or to NNRTI (n = 16) in last sequence (B) collected after a mean spanning time of 7.7 years (range 2.2–20.3 years). DRM, drug resistance mutations; NRTI, nucleoside reverse transcriptase inhibitors; NNRTI, non-NRTI; r, ritonavir used for boosting; ATV/r, boosted-atazanavir; DRV/r, boosted-darunavir; FPV/r, boosted-fosamprenavir; IDV/r, boosted-indinavir; LPV/r, boosted-lopinavir; NFV, nelfinavir; SQV/r, boosted-saquinavir; TPV/r, boosted-tipranavir; 3TC, lamivudine; ABC, abacavir; AZT, zidovudine; d4T, estavudine; ddI, didanosine; FTC, emtricitabine; TDF, tenofovir; EFV, efavirenz; ETR, etravirine; NVP, nevirapine; RPV, rilpivirine. The approval year for each drug in Spain was: 1988 (AZT), 1993 (ddI), 1996 (d4T, 3TC, IDV/r, SQV/r), 1998 (NVP, NFV), 1999 (ABC, EFV), 2001 (LPV/r), 2002 (TDF), 2003 (FTC), 2004 (ATV/r, FPV/r), 2005 (TPV/r), 2007 (DRV/r), and 2008 (ETR); data available at Agencia Española de Medicamentos y Productos Sanitarios (https://www.aemps.gob.es/).
Evolutionary Parameters of the Partial pol Sequences Obtained from 24 HIV-1B Population Under Study
| Code | HIV Exposure Time (years) | No Sequences | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Mean | SE | Mean | SE | Mean | SE | Mean | SE | |||
| P18 | 2.3 | 3 | 0.004 | 0.001 | 0.003 | 0.001 | 0.008 | 0.003 | 0.565 | 0.386 |
| P11 | 6 | 6 | 0.028 | 0.008 | 0.018 | 0.004 | 0.050 | 0.019 | 1.736 | 0.508 |
| P14 | 6.8 | 5 | 0.015 | 0.007 | 0.008 | 0.004 | 0.024 | 0.012 | 0.222 | 0.111 |
| P24 | 7.3 | 3 | 0.028 | 0.009 | 0.020 | 0.005 | 0.040 | 0.013 | 0.628 | 0.092 |
| P1 | 9 | 20 | 0.017 | 0.001 | 0.020 | 0.002 | 0.006 | 0.001 | 1.919 | 0.262 |
| P17 | 10.7 | 3 | 0.020 | 0.004 | 0.010 | 0.002 | 0.044 | 0.009 | 0.254 | 0.022 |
| P16 | 10.7 | 5 | 0.006 | 0.001 | 0.004 | 0.001 | 0.010 | 0.003 | 0.670 | 0.268 |
| P5 | 11.7 | 6 | 0.006 | 0.003 | 0.007 | 0.004 | 0.002 | 0.001 | 2.444 | 1.222 |
| P4 | 12.5 | 11 | 0.022 | 0.004 | 0.024 | 0.004 | 0.013 | 0.004 | 1.665 | 0.416 |
| P2 | 12.8 | 20 | 0.040 | 0.001 | 0.030 | 0.001 | 0.058 | 0.003 | 0.790 | 0.011 |
| P10 | 13 | 4 | 0.009 | 0.002 | 0.007 | 0.002 | 0.018 | 0.003 | 0.350 | 0.117 |
| P19 | 14.3 | 3 | 0.018 | 0.006 | 0.015 | 0.006 | 0.020 | 0.004 | 0.657 | 0.205 |
| P22 | 14.3 | 4 | 0.004 | 0.001 | 0.002 | 0.000 | 0.007 | 0.001 | 0.287 | 0.038 |
| P3 | 14.6 | 18 | 0.015 | 0.005 | 0.008 | 0.002 | 0.030 | 0.009 | 0.199 | 0.035 |
| P20 | 14.7 | 3 | 0.008 | 0.001 | 0.006 | 0.003 | 0.009 | 0.003 | 1.215 | 0.764 |
| P21 | 14.7 | 4 | 0.012 | 0.005 | 0.014 | 0.005 | 0.005 | 0.002 | 1.850 | 0.513 |
| P9 | 15.6 | 5 | 0.029 | 0.001 | 0.026 | 0.001 | 0.031 | 0.002 | 1.004 | 0.058 |
| P23 | 15.7 | 3 | 0.007 | 0.002 | 0.004 | 0.001 | 0.011 | 0.004 | 0.235 | 0.074 |
| P12 | 15.9 | 9 | 0.003 | 0.000 | 0.001 | 0.000 | 0.006 | 0.001 | 0.152 | 0.054 |
| P13 | 16 | 5 | 0.037 | 0.011 | 0.027 | 0.006 | 0.053 | 0.020 | 2.220 | 0.818 |
| P12 | 16 | 5 | 0.026 | 0.001 | 0.032 | 0.001 | 0.008 | 0.002 | 1.724 | 0.145 |
| P8 | 18.6 | 5 | 0.007 | 0.000 | 0.006 | 0.000 | 0.007 | 0.000 | 0.511 | 0.043 |
| P15 | 19.8 | 5 | 0.020 | 0.002 | 0.014 | 0.001 | 0.035 | 0.005 | 0.359 | 0.039 |
| P7 | 25.6 | 7 | 0.008 | 0.002 | 0.009 | 0.002 | 0.007 | 0.001 | 1.185 | 0.168 |
| Total/Average | 12 | 162 | 0.004 | 0.001 | 0.003 | 0.001 | 0.008 | 0.003 | 0.565 | 0.386 |
Patients ordered according to HIV exposure time. d, genetic diversity; dN, frequency of nonsynonymous mutations; dS, frequency of synonymous mutations; dN/dS, selection pressure; SE, standard error.
Consensus Best-Predictor Variables for Each Evolutionary Parameter and for Drug Resistance Mutations Presence in the Study Cohort
| Evolutionary parameters | ARV resistance | ||||||
|---|---|---|---|---|---|---|---|
| Major DRMs to PI | DRMs to NRTI | DRMs to NNRTI | DRMs presence | ||||
| Consensusbest-predictor variables | Age of HIV-1 diagnosis | Age at first ART | Year of infection | Year of infection | Year of infection | NNRTI experience | Sampling year |
| Year of infection | NNRTI experience | Year of HIV-1 diagnosis | Patient’s origin | Coinfection with HCV or HBV | No. of ARVs | No. of previous ART regimen switches | |
| No. of previous ART regimen switches | No. of ARVs | Coinfection with HCV or HBV | Coinfection with HCV or HBV | No. of ARVs | CD4/CD8 ratio | CD4 cel/mm3switches | |
| Year at first ART | Year of HIV-1 diagnosis | Year of first ART | Age at first ART | %CD8 | %CD4 | Year of infection | |
| Year of HIV-1 diagnosis | CD8 cell counts/mm3 | Age at first ART | No. of previous ART regimen switches | %CD4 | CD8 cell counts/mm3 | Year of HIV-1 diagnosis | |
| NNRTI experience | CD4/CD8 ratio | PI experience | Year of HIV-1 diagnosis | Year of first ART | %CD8 | ||
| CD8 cell/m3 | No. of previous ART regimen switches | No. of ARVs | Sampling year | Age at sequencing | |||
| Iint experience | Sampling year | IP experience | |||||
| CD4 cell/mm3 | |||||||
| Age at first ART | |||||||
| %CD8 | |||||||
d, genetic diversity; dN, frequency of nonsynonymous mutations; dS, frequency of synonymous mutations; DRMs, drug resistance mutations; PI, protease inhibitor; NRTI, nucleoside reverse transcriptase inhibitor; NNRTI, nonnucleoside reverse transcriptase inhibitor; ART, Antiretroviral treatment; HBV, Hepatitis B viruses; HCV, hepatitis C virus; ARVs, antiretroviral drugs; Iint, Integrase inhibitor. In each column, the variables are sorted by relative importance.
Percentage of Correctly Classified Instances by Each Model Using Wrapper, Gain Ratio, and CFS Data Sets in Our Study Cohort
| Model | Data set | Parameter | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Major DRM to PI | DRM to NRTI | DRM to NNRTI | DRM presence | ||||||
| J48 | Wrapper | Correctly classified instances | 47.4% | 79.3% | 76.4% | 91.5% | 86.7% | 80.6% | 93% |
| Gain ratio | 86.1% | 66.8% | 72.8% | 77.9% | 80.6% | 79.6% | 89.3% | ||
| CFS | 68% | 68.9% | 63.6% | 81.2% | 85.1% | 80.1% | 90.6% | ||
| IB1 | Wrapper | 37.1% | 69.4% | 72.8% | 87.7% | 82.6% | 80.1% | 86.8% | |
| Gain ratio | 78.4% | 73.6% | 75.9% | 92.5% | 83.7% | 86.6% | 92.5% | ||
| CFS | 76.3% | 68.4% | 67.7% | 85.5% | 82.4% | 83.3% | 93.1% | ||
| IB3 | Wrapper | 47% | 68% | 67% | 88.7% | 85% | 79% | 91.8% | |
| Gain ratio | 74.2% | 70.9% | 75.4% | 85.9% | 83.2% | 87.1% | 93.1% | ||
| CFS | 73.7% | 68.4% | 64.1% | 73.7% | 89.2% | 85.5% | 94.9% | ||
Predictive models: J48, Classification tree; IB1 and IB3, Nearest-neighbor; CFS, Correlation Feature Selection; d, genetic diversity; dN, frequency of nonsynonymous mutations; dS, frequency of synonymous mutations; PI, protease inhibitor; NRTI, nucleoside reverse transcriptase inhibitor; NNRTI, non nucleoside reverse transcriptase inhibitor; DRM, drug resistance mutations.