| Literature DB >> 21220770 |
Roger D Kouyos1, Viktor von Wyl, Sabine Yerly, Jürg Böni, Philip Rieder, Beda Joos, Patrick Taffé, Cyril Shah, Philippe Bürgisser, Thomas Klimkait, Rainer Weber, Bernard Hirschel, Matthias Cavassini, Andri Rauch, Manuel Battegay, Pietro L Vernazza, Enos Bernasconi, Bruno Ledergerber, Sebastian Bonhoeffer, Huldrych F Günthard.
Abstract
BACKGROUND: The time passed since the infection of a human immunodeficiency virus (HIV)-infected individual (the age of infection) is an important but often only poorly known quantity. We assessed whether the fraction of ambiguous nucleotides obtained from bulk sequencing as done for genotypic resistance testing can serve as a proxy of this parameter.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21220770 PMCID: PMC3060900 DOI: 10.1093/cid/ciq164
Source DB: PubMed Journal: Clin Infect Dis ISSN: 1058-4838 Impact factor: 9.079
Summary of the Adjusted Least Squares Regression for the Full Data Set
| All patients | Patients infected ≤8 years | |||
| Measure | No. (%) of patients | Median (95% CI) | No. (%) of patients | Coefficient (95% CI) |
| Total no. of patients | 3,307 (100) | – | 2,729 (100) | – |
| Time since infection (per year) | – | .195 (.170–.221) | – | .158 (.141–.175) |
| Time since infection (quadratic term) | – | −.007 (−.008 to −.005) | – | NA |
| Female sex | 714 (21.6) | .034 (−.053 to .121) | 542 (19.9) | .054 (−.047 to .159) |
| Age by quartile, years | ||||
| 25–30 | 949 (28.7) | Referent | 862 (31.6) | Referent |
| 33–35 | 789 (23.9) | .146 (.060–.233) | 605 (22.2) | .140 (.047–.233) |
| 38–41 | 781 (23.6) | .232 (.144–.319) | 613 (22.5) | .247 (.145–.338) |
| 45–54 | 788 (23.8) | .237 (.146–.328) | 649 (23.8) | .252 (.153–.348) |
| Ethnicity | ||||
| White | 2,933 (88.7) | Referent | 2,455 (90.0) | Referent |
| Black | 63 (1.9) | .169 (−.060 to .399) | 58 (2.1) | .080 (−.122 to .290) |
| Hispano-American | 97 (2.9) | .325 (.141–.509) | 89 (3.3) | .343 (.140–.546) |
| Asian | 57 (1.7) | .090 (−.147 to .328) | 55 (2.0) | .095 (−.109 to .290) |
| Unknown ethnicity | 157 (4.7) | −.039 (−.200 to .122) | 72 (2.6) | .001 (−.186 to .211) |
| Mode of HIV acquisition | ||||
| Heterosexual risks | 857 (25.9) | Referent | 722 (26.5) | Referent |
| Intravenous drug use | 862 (26.1) | .233 (.142–.323) | 595 (21.8) | .247 (.140–.357) |
| Homosexual risks | 1,482 (44.8) | −.066 (−.154 to .021) | 1326 (48.6) | −.063 (−.154 to .031) |
| Unknown risk | 106 (3.2) | .121 (−.062 to .305) | 86 (3.2) | .186 (−.006 to .416) |
| Laboratory | ||||
| Laboratory A | 213 (6.4) | Referent | 178 (6.5) | Referent |
| Laboratory B | 1,450 (43.8) | .407 (.274–.540) | 1176 (43.1) | .357 (.246–.456) |
| Laboratory C | 319 (9.6) | .404 (.247–.562) | 264 (9.7) | .312 (.180–.430) |
| Laboratory D | 1,325 (40.1) | .787 (.653–.921) | 1111 (40.7) | .705 (.594–.803) |
| Calendar year of sequencing, median (IQR) | 2007 (2006–2008) | .119 (.101–.137) | 2007 (2006–2008) | .111 (.093–.127) |
| HIV RNA load at time of GRT by 33 percentiles, log10 copies/mL | ||||
| 3.3–4.0 | 1,013 (30.6) | Referent | 829 (30.4) | Referent |
| 4.5–4.8 | 1,013 (30.6) | .097 (.017–.176) | 859 (31.5) | .043 (−.050 to .128) |
| 5.2–5.7 | 1,011 (30.6) | .136 (.056–.217) | 853 (31.3) | .063 (−.020 to .151) |
| HIV RNA load missing | 270 (8.2) | −.106 (−.235 to .023) | 188 (6.9) | −.130 (−.273 to .026) |
| Constant term (1 unit = 1%) | – | −1.046 (−1.231 to −.860) | – | −.885 (−1.042 to −.732) |
NOTE. The 95% confidence intervals (CIs) were estimated with bootstrapping over 1,000 replicates. Note that for the quadratic model (all patients) both the linear and the quadratic term significantly improve the fit, but not when only the first 8 years of infection were considered. GRT, genotypic resistance test; HIV, human immunodeficiency virus; IQR, interquartile range; NA, not applicable.
Statistically significant by the Wald test.
Figure 1.Relationship between the year of infection and the fraction of ambiguous nucleotides (f). A, Mean of f as a function of the age of infection, where data points have been binned according to the age of infection in years (n = 3,307 patients). The shaded area corresponds to the 95% confidence interval of the means of f. These confidence intervals have been determined by bootstrap (with 1,000 samples). The red point corresponds to the mean of f for the Zurich Primary HIV Infection Study data set (n = 130), for which all sequences stem from the first few months after the infection. The associated red line gives the 95% confidence interval of this mean. B, Quadratic and linear fit of the full data set. Note that the linear fit is restricted to ages of infection of ≤8 years. C, Linear fit of the different data sets. Only sequences obtained within the first 8 years after infection were considered. D, Distribution of the age of infection (in years) for different fractions of ambiguous nucleotides. The left plot depicts the density plot of the age of infection for the 5 quintiles of f. The right plot depicts, for each of the 5 quintiles of f, the 25%–75% percentiles (green lines) and 5%–95% percentiles (black lines) of the year of infection.
Summary of the Two Cutoff Methods: A Priori Defined Threshold of .5% and Cutoffs Based on Quintiles
| Stratum number | Stratification cutoff, % | No. per stratum | No. (%) of patients infected ≤1 year | Comparison of strata | Sensitivity, % | Specificity, % | Correctly Classified, % |
| Predefined | |||||||
| 1 | ≤.5 | 1,117 | 184 (16.5) | 1 vs 2 | 86.8 | 69.9 | 70.9 |
| 2 | >.5 | 2,190 | 28 (1.3) | – | – | – | – |
| By quintiles | |||||||
| 1 | 0–.15 | 679 | 146 (21.5) | 1 vs 2, 3, 4, 5 | 68.8 | 82.8 | 81.9 |
| 2 | .16–.67 | 649 | 44 (6.8) | 1, 2 vs 3, 4, 5 | 89.6 | 63.2 | 64.9 |
| 3 | .68–1.35 | 691 | 14 (2.0) | 1, 2, 3 vs 4, 5 | 96.2 | 41.4 | 44.9 |
| 4 | 1.36–2.07 | 633 | 7 (1.1) | 1, 2, 3, 4 vs 5 | 99.5 | 21.1 | 26.2 |
| 5 | 2.08–6.65 | 655 | 1 (.2) | – | – | – | – |
Indicates how strata are collapsed for comparison; for example, strata 1 versus all remaining strata.
Proportion of patients infected ≤1 year whose human immunodeficiency virus (HIV) sequence had ≤.5% ambiguous positions.
Proportion of patients infected >1 year whose HIV sequence had >.5% ambiguous positions.
Figure 2.Temporal increase of the fraction of ambiguous nucleotides in the Wright-Fisher model (WFM) for a population size of 500 and a mutation rate of 3 × 10−5 mutations per generation (solid green line) and at 4-fold degenerate third-codon positions in the full data set (dashed black line). The curve for the WFM has been obtained by averaging over 104 runs of the model. N and m denote the effective population size and the mutation rate, respectively. The WFM describes discrete and nonoverlapping generations in a population with fixed size N. Every generation, each of the N genomes undergoes mutation with probability m. Then the N genomes for the next generation are determined from the gene pool by drawing every offspring genome with uniform probability from the N parental genomes. Note that the WFM assumes selective neutrality.
Unadjusted and Adjusted Estimates for the Growth Rates Over the First Eight Years of Infection and Summary of the Cutoff Method Based on the .5% Threshold for the Four Largest Non-B Subtype Groups in Switzerland
| Cutoff of <.5% | |||||||
| Subtype | No. (%) of recent infections | Univariable model, Coefficient (95% CI) | Multivariable model, Coefficient (95% CI) | Sensitivity, % | Specificity, % | ROC, % | NPP, % |
| A ( | 11 (7.1) | .197 (.121 | .193 (.105 | 63.6 | 63.6 | 63.6 | 95.6 |
| CRF01_AE ( | 10 (5.8) | .210 (.142 | .141 (.064 | 100.0 | 59.5 | 79.8 | 100.0 |
| CRF02_AG ( | 15 (8.1) | .159 (.081 | .110 (.098 | 80.0 | 51.8 | 65.9 | 96.7 |
| C ( | 6 (4.6) | .150 (.069 | .104 (.024 | 100.0 | 66.4 | 83.2 | 100.0 |
NOTE. The unadjusted and adjusted estimates for the growth rates over the first 8 years of infection are analogous to Table 1. The summary of the cutoff method based on the .5% threshold is analogous to Table 2. CI, confidence interval; NPP, negative predictive value; ROC, receiver operating characteristic.