| Literature DB >> 30978183 |
Daniel I S Rosenbloom1,2, Peter Bacchetti3, Mars Stone4, Xutao Deng4, Ronald J Bosch5, Douglas D Richman6,7, Janet D Siliciano8, John W Mellors9, Steven G Deeks10, Roger G Ptak11, Rebecca Hoh10, Sheila M Keating4,10, Melanie Dimapasoc4, Marta Massanella6, Jun Lai8, Michele D Sobolewski9, Deanna A Kulpa11,12, Michael P Busch4,13.
Abstract
Quantitative viral outgrowth assays (QVOA) use limiting dilutions of CD4+ T cells to measure the size of the latent HIV-1 reservoir, a major obstacle to curing HIV-1. Efforts to reduce the reservoir require assays that can reliably quantify its size in blood and tissues. Although QVOA is regarded as a "gold standard" for reservoir measurement, little is known about its accuracy and precision or about how cell storage conditions or laboratory-specific practices affect results. Owing to this lack of knowledge, confidence intervals around reservoir size estimates-as well as judgments of the ability of therapeutic interventions to alter the size of the replication-competent but transcriptionally inactive latent reservoir-rely on theoretical statistical assumptions about dilution assays. To address this gap, we have carried out a Bayesian statistical analysis of QVOA reliability on 75 split samples of peripheral blood mononuclear cells (PBMC) from 5 antiretroviral therapy (ART)-suppressed participants, measured using four different QVOAs at separate labs, estimating assay precision and the effect of frozen cell storage on estimated reservoir size. We found that typical assay results are expected to differ from the true value by a factor of 1.6 to 1.9 up or down. Systematic assay differences comprised a 24-fold range between the assays with highest and lowest scales, likely reflecting differences in viral outgrowth readout and input cell stimulation protocols. We also found that controlled-rate freezing and storage of samples did not cause substantial differences in QVOA compared to use of fresh cells (95% probability of < 2-fold change), supporting continued use of frozen storage to allow transport and batched analysis of samples. Finally, we simulated an early-phase clinical trial to demonstrate that batched analysis of pre- and post-therapy samples may increase power to detect a three-fold reservoir reduction by 15 to 24 percentage points.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30978183 PMCID: PMC6481870 DOI: 10.1371/journal.pcbi.1006849
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.475
Fig 1Experimental and analytical design of the study.
Panel a: Three frozen aliquots were provided by each HIV+ participant to each lab; an additional fresh aliquot was provided to each lab except SR. Panel b: Experimental design at one of the four labs (U. Pitt.). Fresh panel (five batches): One aliquot from each HIV+ participant was studied fresh and was not batched with any other aliquots. Frozen panel (nine batches): Three aliquots from each participant were cryopreserved and batched together with one other aliquot. Five batches contain two aliquots from the same HIV+ participant. Two HIV+ participants are chosen to supply one aliquot to the same batch (here, participants 2 and 3). The remaining three batches contain an aliquot from one of the remaining HIV+ participants and an aliquot from the negative control. In each lab, different HIV+ participants are chosen for the mixed batch (see S1 Table for complete experimental design). Panel c: Sketch of statistical model used to estimate IUPM for each participant (v); cryopreservation effect (β); systematic effect for each lab (β, set to zero for U. Pitt., which was arbitrarily chosen as reference); and random variation at the level of aliquot, batch, and lab (a, b, c, respectively). These fixed and random effects combine to determine the likelihood that a given well is positive, and the likelihood of the data equals the product of likelihoods of all wells (see Eq (3)).
Fig 2Infection frequency and 95% CI estimated separately for each aliquot by maximum likelihood (i.e., not using the mixed effects statistical model, see Methods: “Analyzing aliquots separately”).
Cryopreserved aliquots are indicated by shaded symbols, fresh aliquots by open symbols. “Index i” is used in model output, and “Cohort ID” represents the identifier used in the SCOPE/OPTIONS cohort.
Fixed and random effects in the model of outgrowth, Eq (2).
| Symbol | Effect type | Description |
|---|---|---|
| Fixed | Log of infection frequency from study participant | |
| Fixed | Scaling parameter for lab | |
| Random | Effect of aliquot-level random variation in aliquot | |
| Random | Effect of batch-level random variation in batch | |
| Random | Effect of lab-level random variation for participant | |
| Fixed | Cryopreservation effect |
Performance of MCMC estimation using the ensemble model and prior, in simulation of multi-lab experimental design in S3 Table.
The experiment has a total of 194 million to 289 million cells from each of five participants, distributed among four labs, encompassing 474 to 569 wells per participant. Bias and absolute error represent over- or underestimation of effect sizes, as difference in log10 of fold-change. For example, typical batch effects are estimated to be 10−0.202 = 63% of the simulated truth. See Methods (“Validation by simulation: Multiple labs”) for simulation details.
| Parameter | Median [2.5, 97.5%ile] bias of estimate (log10) | Median absolute error of estimate (log10) | 95% CI coverage |
|---|---|---|---|
| Total variation | +0.008 [−0.097, +0.132] | 0.040 | 95.6% |
| Aliquot & batch variation | −0.024 [−0.254, +0.126] | 0.058 | 96.6% |
| Aliquot & lab variation | +0.060 [−0.046, +0.169] | 0.061 | 78.7% |
| Batch & lab variation | −0.025 [−0.235, +0.130] | 0.063 | 95.3% |
| +0.043 [−0.175, +0.168] | 0.082 | 88.4% | |
| −0.202 [−0.248, +0.093] | 0.202 | 89.5% | |
| +0.009 [−0.189, +0.191] | 0.114 | 93.2% | |
| −0.000 [−0.278, +0.268] | 0.095 | 95.0% | |
| +0.012 [−0.333, +0.369] | 0.126 | 93.4% | |
| +0.000 [−0.390, +0.409] | 0.135 | 93.2% | |
| −0.013 [−0.461, +0.417] | 0.142 | 94.0% |
Systematic lab effects, fold-change from U. Pitt.
Posterior medians and 95% CIs shown.
| Lab | Ensemble model (accounting for excess variation) | Enforcing model without excess variation |
|---|---|---|
| UCSD | 9.2 (3.8—24) | 7.4 (5.7—9.7) |
| JHU | 0.81 (0.30—2.4) | 0.75 (0.56—1.0) |
| SR | 0.39 (0.13—1.12) | 0.51 (0.35—0.75) |
Estimated excess variation in QVOA.
| Level | Posterior probability of excess variation | Estimated variation, fold-change (posterior median and 95% CI) | |
|---|---|---|---|
| Between aliquot | 0.980 | 1.5 (1.1–2.1) | |
| Between batch | 0.904 | 1.8 (1.0–2.4) | |
| Between lab | 0.963 | 1.5 (1.0–2.5) | |
| Between aliquot & batch, combined | 1−10−18 (var. at either level) | 2.0 (1.6–2.7) | |
| Total excess variation between aliquots in two different labs | At least one level | 1−10−22 | 2.3 (1.8–3.5) |
| At least two levels | 0.9948 | ||
| All three levels | 0.85 | ||
Effect of cryopreservation, fold-change (posterior median and 95% CI).
| Lab(s) | Ensemble model (accounting for excess variation) | Enforcing model without excess variation |
|---|---|---|
| All four, using JHU day 14 readout | 1.04 (0.56–1.97) | 1.0 (0.78–1.26) |
| U. Pitt. | 0.61 (0.18–1.65) | 0.71 (0.42–1.18) |
| UCSD | 0.79 (0.27–2.17) | 0.75 (0.56–1.05) |
| JHU (day 14 readout) | 2.89 (0.62–15.7) | 2.09 (1.30–3.44) |
Fig 3Accuracy of assays used in the experimental study.
Each assay is measured against a consensus standard, appropriately scaled by β for that assay. “All-negative” represents infinite error on the fold-change scale, which occurs when the maximum likelihood estimate of IUPM is zero. Median estimate and 95% credible intervals shown for 0.1, 0.2, 0.5, 1, 2, and 4 IUPM on the U. Pitt. scale. At IUPMs of 1 or more, measured values in these assays are expected to be within 1.6- to 1.9-fold of the truth.
Fig 4Difference in accuracy between UCSD and JHU assays, assuming that the JHU assay is a gold standard (not subject to lab-based random effect).
Batch variation-free ensemble estimates of parameters were used in simulations. Median estimate and 95% credible intervals shown for 0.1, 0.2, 0.5, 1, and 2 IUPM on the U. Pitt. scale. All values plotted are also provided in S14 and S15 Tables.
Simulated results of latency reduction trials, using a t-test to compare pre- and post-treatment data.
| JHU protocol | ||||
| Weaker therapy (3x), | Stronger therapy (10x), | |||
| Unbatched | Batched | Unbatched | Batched | |
| Median estimated fold-reduction | 3.04 | 2.83 | 8.77 | 6.62 |
| Median bias (%) | +1% | −6% | −12% | −34% |
| Median log10 absolute error | 0.14 | 0.12 | 0.19 | 0.20 |
| Accuracy improvement from batching (% reduction in abs. error) | 14% | −5% | ||
| Coverage of 95% CI | 95.4% | 93.9% | 94.7% | 90.5% |
| Power ( | 57.8% | 72.4% | 75.1% | 77.1% |
| Power improvement from batching (percentage point increase) | 15% | 2% | ||
| UCSD protocol | ||||
| Weaker therapy (3x), | Stronger therapy (10x), | |||
| Unbatched | Batched | Unbatched | Batched | |
| Median estimated fold-reduction | 3.03 | 2.95 | 9.29 | 8.57 |
| Median bias | +1% | −2% | −7% | −14% |
| Median log10 absolute error | 0.14 | 0.10 | 0.19 | 0.17 |
| Accuracy improvement from batching (% reduction in abs. error) | 29% | 11% | ||
| Coverage of 95% CI | 94.6% | 94.1% | 94.5% | 94.7% |
| Power ( | 58.6% | 82.6% | 78.5% | 86.4% |
| Power improvement from batching (percentage point increase) | 24% | 8% | ||