| Literature DB >> 33079927 |
Eric W Djimeu1,2, Anna Heard3.
Abstract
Replication is an important tool to promote high quality research and ensure policy makers can rely on studies in making guidelines or funding programs. By ensuring influential studies are replicable we provide assurance that the policies based on these studies are well-founded and the conclusions and recommendations are robust-to different estimation models or different choices. In this paper, we argue that replication is not only useful but necessary to ensure that an author's choice in how to analyse data is not the only factor that determines whether an intervention is effective or not. We also show that while most research is done well and provides robust results, small differences can lead to different interpretations and these differences need to be acknowledged. This special issue highlights 5 such replication studies, which are replications of influential studies on biomedical, social, behavioural and structural interventions for HIV prevention and treatment. We reflect on their findings. Four out of five studies, which conduct push button replication and pure replication, were able to reproduce the results of the original studies with minor differences, mainly due to minor typographical errors or rounding differences. The analysis of the measurement and estimation analyses conducted in these five studies reveals that the original results are not very robust to alternative analytical approaches, especially when these results rely on a small number of observations. In these cases, the original results are weakened. Furthermore, in contrast to the original papers, two of the five included replication studies conducted a theory of change analysis-to explore how or why the interventions work (or do not) not just whether the intervention works or not. These two analyses indicate that the estimated impacts of the interventions are drawn from few mediators. In addition, they demonstrate that, in some cases, a lack of effect may be related to lack of adequate exposure to the intervention rather than inefficacy of the intervention per se. However, overall, the included replication studies show that the results presented in the original papers are trustworthy and robust, especially when based on larger sample sizes. Replication studies can not only verify the results of a study, they can also provide additional insights on the published results, such as how and why an intervention was effective or less effective than expected. They can thus be a tool to inform the research community and/ or policymakers about whether and how interventions could be adopted, which need to be tested further, and which should be discontinued because of their ineffectiveness. Thus, publishing these replication studies in peer-reviewed journals makes the work public and publicized. The work advances knowledge, and publication should be encouraged, as it is for other types of research.Entities:
Mesh:
Substances:
Year: 2020 PMID: 33079927 PMCID: PMC7575085 DOI: 10.1371/journal.pone.0240159
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Presents a summary of findings and methods used in the original paper and the replication study.
| Study | Methods used in the original paper | Methods used in MEA of the replication | Main Findings from the original study | Main Findings from the replication study |
|---|---|---|---|---|
| Task shifting of antiretroviral treatment from doctors to primary-care nurses in South Africa (STRETCH): a pragmatic parallel, cluster-randomised trial by Fairall and colleagues (2012) [ | 1. Cox proportional hazards (PH) models and Huber-White robust adjustment of errors for intracluster correlation of outcomes | 1. The Schoenfeld residuals test and cumulative sums of martingale-based residuals methods | The task-shifting program was not inferior to standard care: overall, no outcomes were worse in the task shifting intervention groups than in the standard care groups. | 1. Pure replication validates the original findings. |
| 2. The MEA also validates the original findings: | ||||
| A. Overall, time to death did not differ between intervention and control patients; | ||||
| 2. The generalized estimating equation (GEE) approach | ||||
| 3. The frailty model | ||||
| 4. The generalized linear mixed-effects model (GLMM) | ||||
| 2. Binomial regression | B. Rates of viral suppression, a year after enrolment, were equivalent in the intervention and control groups. | |||
| Effect of a cash transfer programme for schooling on prevalence of HIV and herpes simplex type 2 in Malawi: a cluster randomised trial by Sarah Baird and others (2012) [ | 1. An intent to treat analysis, using unadjusted and adjusted ORs with logistic regression models, with robust standard errors, which allows for intraclass correlation. | 1. GLMMs (also known as hierarchical or multilevel models) | The intervention lowered the odds of HIV and HSV-2 prevalence in baseline schoolgirls but did not have a significant effect for baseline dropouts. | 1. Pure replication validates the original findings. |
| 2. Permutation test | 2. In the MEA, it was found that the intervention effect on HIV prevalence was somewhat sensitive to model choice. | |||
| 3. Bivariate outcome estimation | ||||
| 3. In the permutation analysis and in a GLMM, the intervention effect on HIV prevalence was no longer significant, but the results for HSV-2 prevalence were retained. | ||||
| 2. Adjusted ORs were calculated, including age group and geographical stratum as fixed effects, as well as the baseline values for any behavioural outcomes. | ||||
| HIV development assistance and adult mortality in Africa [ | 1. A logistic regression and a difference-in-difference analysis to evaluate the effects of PEPFAR implementation. Specifically, the original authors compared the odds of adult (defined as men and women aged 15 to 59 years) all-cause mortality in focus and non-focus countries pre- and post-PEPFAR implementation. | 1. The use of a logistic regression with the difference-in-difference model but limited to only observations between 2000 and 2006 (the Duber and colleagues (2010) time frame). | Between 2004 and 2008, all-cause adult mortality declined more in countries that implemented PEPFAR than countries that did not implement PEPFAR. | 1. Pure replication validates the original findings. |
| 2. Bendavid and colleagues’ findings were robust to the time period used by Duber and colleagues (2010). | ||||
| 2. A sensitivity analysis: a logistic regression on the unadjusted and adjusted model, leaving out any one country at a time, including only countries that had all data for 2000 to 2006, and using a linear time trend, as opposed to a dichotomous indicator, for PEPFAR implementation. | ||||
| 3. These findings did not match the findings from Duber and colleagues (2010) who find that PEPFAR may have little or no impact on health outcomes not explicitly targeted. | ||||
| Timing of antiretroviral therapy for HIV-1 infection and tuberculosis by Havlir and colleagues (2011)[ | The Pearson chi-square test to compare rates and the Kaplan–Meier method to produce unadjusted survival curves between the two study arms. | 1. Adjustment for loss to follow-up | Earlier ART initiation (within two weeks of the initiation of treatment for TB) reduces the rate of new AIDS-defining illness and death exclusively in persons with CD4 counts lower than 50, as compared with later ART initiation. | 1. Pure replication validates the original findings. |
| 2. An analysis of covariance (ANCOVA) specification, which consists of including the lagged outcome variable in the model specification to estimate the impact of the intervention. | ||||
| 2. Adjusting for loss to follow-up does not affect the main results of the paper. | ||||
| 3. The use of an ANCOVA specification and instrumental variables weakened the main results. Specifically, the estimates from instrumental variables show that earlier ART initiation has no effect on the rate of new AIDS-defining illness and death for HIV positive TB patients with a CD4 count lower than 50. Thus, the MEA shows that the primary result of the paper may not be robust. | ||||
| 3. The estimates in the original paper are from intention to treat, we use an instrumental variables approach to estimate the treatment effect on the treated | ||||
| The Regai Dzive Shiri Project: results of a randomised trial of an HIV prevention intervention for Zimbabwean youth by Cowan and colleagues (2011) [ | 1. The original paper conducted separate intent-to-treat analysis for males and females. | 1. Evaluate the representativeness of the participants based on their characteristics | Despite an impact on knowledge, on some attitudes and on reported pregnancy, there was no impact of this intervention on HIV or HSV-2 prevalence, further evidence that behavioural interventions alone are unlikely to be sufficient to reverse the HIV epidemic. | 1. Pure replication validates the original findings. |
| 2. Apply multilevel modelling to account for the hierarchical structure of the data | 2. The amount of exposure to the intervention an individual received affected knowledge and attitude outcomes and a few risky sexual behaviours. However, increased knowledge and attitudes was not associated with decreased HIV or HSV-2 prevalence. | |||
| 3. Evaluate the impact of the intervention on multilevel attitude or knowledge outcomes | ||||
| 4. Evaluate the impact of the intervention among participants based on the level of the intervention they actually received | ||||
| 2. The unadjusted odds ratios (UORs) and AORs were computed using generalized estimating equations (GEEs) with exchangeable correlation and robust standard errors, which allowed for intraclass correlation among clusters. | ||||
| 5. Evaluate heterogeneous impacts of the intervention on HIV or HSV-2 among different age or history of risky sexual behaviour groups | ||||
| 3. When calculating the AORs, the GEE model included age, strata, marriage and highest level of education as fixed effects. |
Source: Authors ‘construction.