| Literature DB >> 28236806 |
Lavinia Ferrante di Ruffano1, Jacqueline Dinnes1, Alice J Sitch1, Chris Hyde2, Jonathan J Deeks3.
Abstract
BACKGROUND: There is a growing recognition for the need to expand our evidence base for the clinical effectiveness of diagnostic tests. Many international bodies are calling for diagnostic randomized controlled trials to provide the most rigorous evidence of impact to patient health. Although these so-called test-treatment RCTs are very challenging to undertake due to their methodological complexity, they have not been subjected to a systematic appraisal of their methodological quality. The extent to which these trials may be producing biased results therefore remains unknown. We set out to address this issue by conducting a methodological review of published test-treatment trials to determine how often they implement adequate methods to limit bias and safeguard the validity of results.Entities:
Keywords: Bias; Diagnostic accuracy; Methodological quality; Patient outcomes; RCT; Test evaluation; Test-treatment
Mesh:
Year: 2017 PMID: 28236806 PMCID: PMC5326492 DOI: 10.1186/s12874-016-0287-z
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Fig. 1Example of a replacement test-treatment RCT. Patients randomized to the experimental arm receive a test for the presence of Helicobacter pylori, which is eradicated if found, while patients without bacterial infection are given proton pump inhibitors (acid suppression). Patients randomized to the control arm receive no test and are all given proton pump inhibitors (acid suppression), reflecting standard care. The outcome of the trial is eradication of dyspeptic symptoms at 12 months [16]
Search strategy for test-treatment RCTs conducted in CENTRAL Issue 2 2009
| Search strategy | Hits | |
|---|---|---|
| #1 | sensitiva or diagnose or diagnosis or diagnostica in Clinical Trials | 70,052 |
| #2 | randoma in Clinical Trials | 335,175 |
| #3 | “study design” next “rct” in Clinical Trials | 150,275 |
| #4 | (#2 OR #3) | 449,453 |
| #5 | (#1 AND #4) | 50,419 |
| #6 | (#5), from 2004 to 2007 | 12,892 |
adenotes truncation of search term
Definitions and criteria used to appraise the quality of trial methods and conduct
|
| |
|
| |
|
| |
|
| |
|
| |
|
| |
|
|
Types of outcomes measured as primary endpoints in test-treatment RCTs
| Outcome Type | Trials, | (%) |
aOutcome measurements, | (%) |
|---|---|---|---|---|
| Patient | ||||
| Symptom score | 13 | (25) | 14 | (9) |
| Adverse events | 8 | (15) | 15 | (10) |
| Function | 8 | (15) | 11 | (7) |
| Quality of life | 5 | (9) | 17 | (11) |
| Mortality | 4 | (8) | 4 | (3) |
| Health perception | 2 | (4) | 5 | (3) |
| Psychological morbidity | 2 | (4) | 6 | (4) |
| Absenteeism | 1 | (2) | 1 | (1) |
| Clinical status | 9 | (17) | 9 | (6) |
| Residual disease rate | 7 | (13) | 7 | (5) |
| Recurrent disease rate | 6 | (11) | 7 | (5) |
| Patient outcome total | 53 | (54) | 96 | (64) |
| Process | ||||
| Therapeutic yield | 17 | (45) | 20 | (13) |
| Timing of care | 8 | (21) | 8 | (5) |
| Cost | 7 | (18) | 7 | (5) |
| Appropriateness of treatment decision | 5 | (13) | 6 | (4) |
| Diagnostic yield | 4 | (11) | 5 | (3) |
| Process outcome total | 38 | (39) | 46 | (31) |
| Composite outcome | ||||
| Adverse patient and process event rate | 7 | (7) | 7 | (5) |
| Primary outcome not defined | 6 | (6) | 0 | (0) |
| Total | 103 | (100) | 149 | (100) |
aMany trials included more than one primary outcome
Methodological characteristics of test-treatment trials
| Trial Quality Item | Trials, | (%) |
|---|---|---|
| Randomized sequence allocation | ||
| Adequate | 59 | (57) |
| Inadequate | 2 | (2) |
| Unclear | 42 | (41) |
| Allocation concealment | ||
| Adequate | 38 | (37) |
| Inadequate | 3 | (3) |
| Unclear | 62 | (60) |
| Blinding | ||
| Patients | 5 | (5) |
| Care-providers | 4 | (4) |
| Outcome assessors | 22 | (21) |
| aSingle-blind | 20 | (19) |
| bDouble-blind | 5 | (5) |
| No blinding | 78 | (76) |
| Reporting of primary outcome assessment | ||
| Complete | 53 | (51) |
| Partial | 33 | (32) |
| Absent | 17 | (17) |
| Reporting of participant flow | ||
| Complete | 44 | (43) |
| Partial | 58 | (56) |
| Absent | 1 | (1) |
| Missing data | ||
| Complete | 41 | (40) |
| Attrition ≤10% | 30 | (29) |
| Attrition >10% | 25 | (24) |
| Incomplete, cannot calculate | 4 | (4) |
| Unclear if complete | 3 | (3) |
| Differential attrition | ||
| ≥ 5% between arms | 16 | (16) |
| ≥ 20% between arms | 1 | (1) |
| Intention-to-treat (ITT) | ||
| Patients analyzed as randomized | 72 | (70) |
| Complete or imputed data and analyzed as randomized | 30 | (29) |
| Not conducted | 31 | (30) |
| Inconsistent outcome assessment | 21 | (20) |
| Inappropriate subgroup analysis | 9 | (9) |
| Sample size | ||
| Power calculation reported | 81 | (79) |
| cMedian trial sample size [IQR] | 309 | [153–731] |
| cMedian study arm sample size [IQR] | 166 | [72–297] |
ablinding either patients or care-providers or outcome assessors; bblinding at least two of: patients, care-providers or outcome assessors; cbased on numbers randomized in all 103 trials
Fig. 2Example of an inappropriate subgroup comparison, leading to differential attrition of >20%. This triage comparison trial compared a strategy of only undertaking diagnostic laparoscopy in women who had failed first-line intrauterine insemination (IUI) rather than undertaking laparoscopy in all women prior to fertility treatment [59]. The primary outcome was the proportion of women experiencing a change in fertility treatment from IUI. The published analysis used the proportion of participants undergoing diagnostic laparoscopy as the denominator rather than the number randomized in each arm. The authors analysis reported a non-significant small increase (experimental 13/23 (56%), control 31/64 (48%); OR = 1.4 [95%CI: 0.5–3.6]). However when the full study population is used a significant decrease in the proportion of women receiving a change in treatment is observed (experimental 13/77 (17%), control 31/77 (40%); OR = 0.3 [95%CI: 0.14–0.64]). Excluding participants who did not receive a laparoscopy (70% of experimental group participants, and 17% of comparator arm participants) all experimental group patients who became pregnant during intrauterine insemination treatment were excluded from the effectiveness measurement introducing selection bias
Fig. 3Example of inconsistent measurement of the primary outcome between study arms. The primary outcome of the number of days unnecessarily immobilized is not comparable as the assessment that the plaster is unnecessary and can be removed is determined using MRI findings in the experimental arm and by routine testing (commonly X-ray) in the comparator arm [38]