| Literature DB >> 29976219 |
Art F Y Poon1, Jessica L Prodger2,3, Briana A Lynch4, Jun Lai3, Steven J Reynolds3,5, Jingo Kasule5, Adam A Capoferri3, Susanna L Lamers6, Christopher W Rodriguez6, Daniel Bruno7, Stephen F Porcella7, Craig Martens7, Thomas C Quinn4,3, Andrew D Redd4,3.
Abstract
BACKGROUND: The ability of HIV-1 to integrate into the genomes of quiescent host immune cells, establishing a long-lived latent viral reservoir (LVR), is the primary obstacle to curing these infections. Quantitative viral outgrowth assays (QVOAs) are the gold standard for estimating the size of the replication-competent HIV-1 LVR, measured by the number of infectious units per million (IUPM) cells. QVOAs are time-consuming because they rely on culturing replicate wells to amplify the production of virus antigen or nucleic acid to reproducibly detectable levels. Sequence analysis can reduce the required number of culture wells because the virus genetic diversity within the LVR provides an internal replication and dilution series. Here we develop a Bayesian method to jointly estimate the IUPM and variant frequencies (a measure of clonality) from the sequence diversity of QVOAs.Entities:
Keywords: Bayesian inference; HIV-1; Latent viral reservoir; Limiting dilution assay; Next-generation sequencing; Viral outgrowth assay
Mesh:
Year: 2018 PMID: 29976219 PMCID: PMC6034329 DOI: 10.1186/s12977-018-0426-1
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
Fig. 1Schematic diagrams of experimental and data analysis procedures. (Left panel) Resting CD4+ T cells sampled from an HIV+ patient are serially diluted in replicate culture wells. Uninfected cells are added to the culture wells to amplify viral outgrowth (red). IUPMStats estimates the rate parameter of the single-hit Poisson model from the numbers of positive wells at varying dilutions. (Right panel) HIV-1 RNA is extracted from each positive well and amplified for library construction and sequencing. The presence/absence of different sequence variants are tabulated and used to fit a multi-target Poisson model by Markov chain Monte Carlo (MCMC) sampling, in which a prior distribution on the IUPM (lower right, grey dashed curve) is updated by the data to estimate the posterior distribution (solid curve)
Fig. 2Relative error in estimating IUPM for data simulated under the standard experimental design. We calculated the relative error of an estimate given true value x as . Each set of box-and-whisker plots summarizes the relative errors for estimates obtained by IUPMStats (red) and IUPMBayes under three different sets of variant frequencies (see inset legend) for a given true value of IUPM (0.2, 1, 5 and 25 per million cells). We used a log-transformation of relative errors and rescaled the y-axis to clarify differences between methods and simulation conditions; 25 outliers with relative errors below .03 were excluded from this plot region
Fig. 3Distribution of estimates from IUPMStats and IUPMBayes for simulations given IUPM = 1. We used a barplot to summarize the distribution of IUPMStats estimates, which makes clear that these estimates are limited to a relatively small number of values given the dilution series and number of replicate wells in our simulation experiments. The leftmost bar (red) corresponds to the median Bayesian posterior estimate employed by IUPMStats when all wells are negative. The distributions of estimates from IUPMBayes under two sets of variant frequencies (1:1 and 1:1:1:1:1) are summarized with Gaussian kernel densities (curves). Unlike the IUPMStats estimates, these distributions were unimodal and centred near the true value. We obtained similar results under varying conditions and IUPM values
Fig. 4Relative errors in IUPM estimates for simulated data with uniform cell counts. Each box plot summarizes the distribution of relative error for 100 replicate simulations for the respective methods: IUPMStats, QVOA-NGSA, and IUPMBayes (three sets of variant frequencies, see legend). The plots are drawn on a log-transformed y-axis to facilitate comparison between methods under varying conditions. When the maximum likelihood estimators employed by IUPMStats or QVOA-NGSA were unable to produce a finite estimate of IUPM, we assigned an arbitrary value of 100 cells per million. The respective box plots are directly labeled with the numbers of non-finite estimates at the outliers or median bands
Comparison of estimates and 95% confidence/credibility intervals for IUPMStats and IUPMBayes
| Positives |
| IUPMStats | IUPMBayes | ||||
|---|---|---|---|---|---|---|---|
| Estimate | Lower 95% | Upper 95% | Median | Lower 95% | Upper 95% | ||
| 0 | 3 | 0.17* | 0 | 0.75 | 0.34 | 0.05 | 1.10 |
| 1 | 16 | 0.29 | 0.04 | 2.06 | 0.61 | 0.15 | 1.68 |
| 2 | 28 | 0.69 | 0.17 | 2.85 | 1.04 | 0.36 | 2.41 |
| 3 | 37 | 1.39 | 0.41 | 4.72 | 1.36 | 0.52 | 2.87 |
| 4 | 16 | ∞ | Undefined | 1.85 | 0.80 | 3.63 | |
We summarized the results of each method on 100 simulations of 4 replicate wells with cells each, where the true IUPM was set to 1.0 with five sequence variants of equal frequency (1:1:1:1:1). We obtained results for the IUPMStats method directly from the online calculator. Entries for IUPMBayes were averaged across replicate simulations for each number of positives. *IUPMStats uses a median posterior estimate instead of the maximum likelihood estimate (0) when none of the wells are positive
Summary of outgrowth assay results for two patients
| Patient | Positive wells (total wells) | IUPMStats | # variants | |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
| (95% CI) | ( | ||
| 106 | 8 (8) | 2 (2) | 0 (2) | 0 (2) | 0 (2) | 0 (2) | 8.148 | 10, 4 |
| (1.863, 35.635) | ||||||||
| 111 | 13 (16) | 0 (2) | 0 (2) | 0 (2) | 0 (2) | 0 (2) | 1.551 | 13, 20 |
| (0.851, 2.825) | ||||||||
For each dilution (number of cells per well), we report the number of positive wells (total number of wells). IUPM estimates and 95% confidence intervals (CIs) under the single-hit Poisson model were generated using the JavaScript calculator at http://silicianolab.johnshopkins.edu (last access date: June 23, 2017). The final column reports the number of sequence variants that were observed in regions within HIV gp41 and pol, respectively, when HIV RNA was extracted, amplified and sequenced from the positive wells
Fig. 5Summary of IUPM estimates obtained by a Bayesian analysis of experimental data from subjects 106 (high clonality) and 111 (low clonality). IUPM was estimated separately using sequence data obtained for HIV regions pol (red) and gp41 (blue, hatched). Three replicate chains were combined for each patient and gene after assessing that the chains had converged to the posterior distribution over the IUPM parameter. Median estimates are indicated by vertical line segments within each density plot. Grey bars represent the maximum likelihood estimate and 95% confidence interval obtained by IUPMStats [17]. In the case of patient 106, the upper confidence limit from IUPMStats extends to 35.6 cells per million