| Literature DB >> 22807657 |
Marie-Claude Boily1, Benoît Mâsse, Ramzi Alsallaq, Nancy S Padian, Jeffrey W Eaton, Juan F Vesga, Timothy B Hallett.
Abstract
The rigorous evaluation of the impact of combination HIV prevention packages at the population level will be critical for the future of HIV prevention. In this review, we discuss important considerations for the design and interpretation of cluster randomized controlled trials (C-RCTs) of combination prevention interventions. We focus on three large C-RCTs that will start soon and are designed to test the hypothesis that combination prevention packages, including expanded access to antiretroviral therapy, can substantially reduce HIV incidence. Using a general framework to integrate mathematical modelling analysis into the design, conduct, and analysis of C-RCTs will complement traditional statistical analyses and strengthen the evaluation of the interventions. Importantly, even with combination interventions, it may be challenging to substantially reduce HIV incidence over the 2- to 3-y duration of a C-RCT, unless interventions are scaled up rapidly and key populations are reached. Thus, we propose the innovative use of mathematical modelling to conduct interim analyses, when interim HIV incidence data are not available, to allow the ongoing trials to be modified or adapted to reduce the likelihood of inconclusive outcomes. The preplanned, interactive use of mathematical models during C-RCTs will also provide a valuable opportunity to validate and refine model projections.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22807657 PMCID: PMC3393676 DOI: 10.1371/journal.pmed.1001250
Source DB: PubMed Journal: PLoS Med ISSN: 1549-1277 Impact factor: 11.069
Main characteristics of cluster randomized controlled trials for combination prevention of HIV transmission commissioned by PEPFAR.
| Study | CDC/HSPH | JHU/USAID | PopART (HPTN 071) |
|
| Botswana | Iringa, Tanzania | Zambia+South Africa (Western Cape) |
|
| 2 | 2 | 3 |
|
| A: Enhanced HIV testing (including mobile and home-based testing), active linkage to care and treatment; enhanced MC; ART for all HIV-infected persons with CD4<350 cells/µl or with HIV-1 RNA>10,000 copies/ml; and point-of-care CD4 testing in antenatal clinics with universal HAART in pregnancy started by 28 wk gestation, as well as HIV retesting at delivery among women HIV-negative in second trimester or earlier | A: Treatment by CD4<350 cells/µl; active scale-up and linkage to MC; cash transfer for young women; targeted outreach to the most at-risk populations (including female sex workers); social and behaviour change communication | A: Universal community home-based testing; active linkage of HIV-positive individuals to care and immediate ART according to national guidelines and/or MC. B: Same as A but ART at CD4<350 cells/µl |
|
| B: Standard of care | B: Standard of care | C: Enhanced standard of care |
|
| Pair matched | Stratified | Triplet matched |
|
| |||
| Total | 30 | 24 | 24 (South Africa: 9, Zambia: 15) |
| Per arm | 15 | 12 | 8 |
|
| 5,800 | 8,000–10,000 (∼55%>15 y) | 50,000 (25,000>18 y) |
|
| |||
| Age eligibility | 16–64 y | 15–39 y | 18–44 y |
| Size per cluster | ∼500 adults per cluster | ∼500 adults per cluster | ∼2,500 adults per cluster |
| Total size | 15,000 | 12,000 | 60,000 |
|
| HIV incidence | HIV incidence | HIV incidence |
|
| 3–4 y | 2 y | 2 y |
|
| ∼1.5 per 100 person-years | 1 .0–1.5 per 100 person-years | 1.0–1.5 per 100 person-years |
|
| 25% | 10%–15% | 15% |
|
| In arm A versus B: ∼50% | In arm A versus B: ∼40% (35%–50%) | In arm A versus C: −50% to 60%; in arm B versus C: −25% to 30% |
|
| Start | Start, interim, final | Start, final |
|
| Planning | Pre-trial | Pre-trial |
Data as of 15 March 2012.
The design of the intervention and plan of analysis for this trial are still being finalised.
Standard of care is ART for HIV-positive individuals with CD4<350 cells/µl or AIDS.
Standard of care is standard referral to MC and ART according to Tanzania guidelines (this will soon change from CD4<200 cells/µl to CD4<350 cells/µl, initially focusing on HIV-positive people with tuberculosis and pregnant women).
Standard of care is no home-based testing or home-based visit to facilitate linkage to ART. ART given according to country guidelines; standard referral to MC.
Cumulative HIV incidence measured over the trial duration.
CDC/HSPH, US Centers for Disease Control and Prevention/Harvard School of Public Health; HAART, highly active ART; JHU/USAID, Johns Hopkins University/United States Agency of International Development; PopART (HPTN 071), HIV Prevention Trials Network.
Summary of important considerations for the design and interpretation of cluster randomized controlled trials (of combination interventions.
| Important Considerations | Implications for Trials |
|
| |
| Increase in intervention impact following the start of trial can be slow due to a number of delays before the full impact develops | Short-term impact will underestimate the long-term impact; substantially reducing HIV incidence over a trial of short duration will be challenging even with an ambitious combination intervention and rapid scale-up; it is important to set realistic expectations about the achievable magnitude of impact over the trial duration; this slow growth in impact can undermine the utility of stepped-wedge designs (with staggered randomized time of delivery of the intervention in each community) to measure a difference in HIV incidence between different interventions or components because the time interval between steps may need to be unfeasibly long |
| The maximum impact of different intervention components is achieved at different times | The trial duration will influence which type of intervention seems to be the most effective; the overall impact of a combination intervention will be most strongly determined by different components at different times |
| The epidemiological context influences the intervention impact | The impact of the same intervention may not be the same across trials conducted in different epidemiological contexts; the results of the trial may not be generalisable to other settings |
| HIV prevalence and HIV incidence do not exhaustively describe the epidemiological context | This may introduce imbalance between the intervention and control arms, even after matching for HIV prevalence or even HIV incidence |
| The drivers of short-term and long-term impact can be different | Sufficient information on the epidemic drivers should be collected during the trial to help interpret trial results and to predict longer term impact |
| Distribution of coverage matters even at high coverage | Intervention impact can be substantially reduced if the intervention does not reach high-risk individuals; intervention impact can be substantially improved if the intervention does reach high transmitters; to understand trial results, detailed information on programmatic (e.g., coverage, uptake) and intermediate outcomes (e.g., change in behaviour, CD4 levels, viral load) by risk groups, age, and clinical status in both the intervention and control communities will be essential |
|
| |
| Measurement of HIV incidence in a cohort over the whole trial duration, before the intervention has reached its full effect, underestimates the change in incidence that is achieved at the end of the trial | It would be better to measure incidence at the start and end of the trial using two independent cohorts with shorter follow-up |
| Evolving standard of care in control arm, as the coverage or scale-up of standard of care may improve over time | Reduces the contrast between intervention and control communities over time; our ability to measure a difference between trial arms will depend on the rapid scale-up of the intervention, having a large number of clusters to enable detection of smaller effects, or having trial duration longer than 2–3 y, to allow the intervention impact to be seen |
| Imbalance in key epidemiological characteristics between trial arms can occur, as HIV incidence and prevalence do not determine all key epidemiological characteristics that influence intervention impact | Could lead to a spurious indication that the intervention is working better or worse than it really did—matching clusters may be desirable; matching on HIV prevalence alone may not be sufficient, as trajectories in incidence and underlying patterns of risk behaviour across trial communities would not be captured |
| Dilution and contamination of the intervention impact may occur due to movement and sexual partnerships across multiple communities | The influence of the different sources of contamination on trial results will depend on the type of intervention; when there is extensive sexual contact between individuals from the trial arms, the measurable impact may be more strongly determined by acquisition-reducing than infectiousness-reducing interventions, such as ART; choosing distinct, independent communities will be important, especially to evaluate ART interventions |
Stepped-wedge design can still be useful for programme and intermediate outcomes, as changes in these outcomes can occur more rapidly than for HIV incidence or prevalence.
Figure 1Predicted short-term impact of three intervention components linked to HIV testing in KwaZulu-Natal, South Africa.
The model is based on a high-transmission setting under conditions of the current standard of care versus a high-coverage combination intervention (see [26]). The instantaneous HIV incidence rate ratio in the y-axis is intervention versus control. Impact estimates include an initial 6-mo period of preparation for the study. Assumptions for the combination intervention: 90% of adults in the intervention community are tested in the first year and thereafter every 4 y; those who test positive reduce risk behaviour for 3 y (on average) (25.0%/12.5% of men/women increase condom use; 25%/25% reduce partner acquisition); 70% of uncircumcised men are circumcised in the first year (efficacy = 60%); and all those in need of treatment (CD4 cell count <350 cells/µl) are immediately treated with ART (efficacy = 92%) with an annual dropout rate from treatment of 5%. The efficacy of MC in reducing susceptibility is assumed to be immediate (i.e., the wound healing period is negligible). Viral suppression for infected individuals once on treatment is immediate (i.e., no delay between treatment initiation and viral suppression). Assumptions for the standard of care: 20% of individuals test annually; 12.5%/6.5% of men/women who test positive increase condom use, and 12.5%/12.5% reduce partner acquisition, for one year; HIV-positive individuals are treated if CD4<200 cells/µl (dropout rate of 15%); and 27% of men are circumcised at baseline and 10% more over 4 y since the start of the intervention.
Figure 2Consequence of measuring HIV incidence over the whole trial duration.
Comparison of the instantaneous reduction in HIV incidence measured at one time point with the cumulative incidence rate ratio (IRR) measured over the whole trial duration (i.e., in a cohort that was initiated at the start of the trial) in a simulated population in Zimbabwe [16]. The grey dotted line shows the IRR if the full impact were achieved at the start of the intervention rather than after 10 y. The instantaneous IRR is 0.65 compared with only 0.77 for the cumulative IRR at year 10. From [16].
Figure 3Logical flow of interim modelling analyses.
This approach uses available data from the baseline surveys in each trial cluster and information on process indicators of coverage and intensity available for each cluster within each trial arm gathered after the start of the trial. These data would not include observed HIV incidence. The interim modelling analysis may come to one of four conclusions. (i) The targeted effect size on HIV is likely to be achieved at the end of the study without having to modify the intervention targets/implementation strategy. (ii) The targeted effect size is unlikely to be achieved, and therefore the intervention targets/implementation strategy need to be revised. (iii) The targeted effect size is unlikely to be achieved, even if the intervention targets are improved to their realistic maximum, unless there is a change in the study design (such as an increase in sample size or study duration). (iv) There is little chance of being able to detect an impact at the end of the trial even if the study duration is increased. The number of interim analyses should be predetermined at the start of the trial and take into account trial characteristics, logistical considerations (such as the time and cost required to regularly update programmatic data during the trial and to perform the modelling analyses), and the statistical effect of the interim analysis and proposed changes on the overall type I error.
Figure 4Logical flow of modelling stages for the final impact analyses.