| Literature DB >> 34190996 |
N Bryce Robinson1, Stephen Fremes2, Irbaz Hameed1,3, Mohamed Rahouma1, Viola Weidenmann1, Michelle Demetres4, Mahmoud Morsi1, Giovanni Soletti1, Antonino Di Franco1, Marco A Zenati5, Shahzad G Raja6, David Moher7, Faisal Bakaeen8,9, Joanna Chikwe10, Deepak L Bhatt11, Paul Kurlansky12, Leonard N Girardi1, Mario Gaudino1.
Abstract
Importance: Randomized clinical trials (RCTs) provide the highest level of evidence to evaluate 2 or more surgical interventions. Surgical RCTs, however, face unique challenges in design and implementation. Objective: To evaluate the design, conduct, and reporting of contemporary surgical RCTs. Evidence Review: A literature search performed in the 2 journals with the highest impact factor in general medicine as well as 6 key surgical specialties was conducted to identify RCTs published between 2008 and 2020. All RCTs describing a surgical intervention in both experimental and control arms were included. The quality of included data was assessed by establishing an a priori protocol containing all the details to extract. Trial characteristics, fragility index, risk of bias (Cochrane Risk of Bias 2 Tool), pragmatism (Pragmatic Explanatory Continuum Indicator Summary 2 [PRECIS-2]), and reporting bias were assessed. Findings: A total of 388 trials were identified. Of them, 242 (62.4%) were registered; discrepancies with the published protocol were identified in 81 (33.5%). Most trials used superiority design (329 [84.8%]), and intention-to-treat as primary analysis (221 [56.9%]) and were designed to detect a large treatment effect (50.0%; interquartile range [IQR], 24.7%-63.3%). Only 123 trials (31.7%) used major clinical events as the primary outcome. Most trials (303 [78.1%]) did not control for surgeon experience; only 17 trials (4.4%) assessed the quality of the intervention. The median sample size was 122 patients (IQR, 70-245 patients). The median follow-up was 24 months (IQR, 12.0-32.0 months). Most trials (211 [54.4%]) had some concern of bias and 91 (23.5%) had high risk of bias. The mean (SD) PRECIS-2 score was 3.52 (0.65) and increased significantly over the study period. Most trials (212 [54.6%]) reported a neutral result; reporting bias was identified in 109 of 211 (51.7%). The median fragility index was 3.0 (IQR, 1.0-6.0). Multiplicity was detected in 175 trials (45.1%), and only 35 (20.0%) adjusted for multiple comparisons. Conclusions and Relevance: In this systematic review, the size of contemporary surgical trials was small and the focus was on minor clinical events. Trial registration remained suboptimal and discrepancies with the published protocol and reporting bias were frequent. Few trials controlled for surgeon experience or assessed the quality of the intervention.Entities:
Mesh:
Year: 2021 PMID: 34190996 PMCID: PMC8246313 DOI: 10.1001/jamanetworkopen.2021.14494
Source DB: PubMed Journal: JAMA Netw Open ISSN: 2574-3805
Figure 1. Randomized Clinical Trials During the Study Period
A-C, By year, outcome, and study design.
Trial Characteristics
| Characteristic | No. (%) (N = 388) |
|---|---|
| Journal of publication | |
| 1 (0.3) | |
| 103 (26.5) | |
| 47 (12.1) | |
| 10 (2.6) | |
|
| 9 (2.3) |
| 63 (16.3) | |
| 10 (2.6) | |
| 9 (2.3) | |
| 10 (2.6) | |
| 32 (8.2) | |
| 44 (11.3) | |
| 29 (7.5) | |
| 21 (5.4) | |
| 0 | |
| Location | |
| Africa | 4 (1.0) |
| Asia | 86 (22.2) |
| Australia | 12 (3.1) |
| Europe | 195 (50.3) |
| North America | 65 (16.8) |
| South America | 6 (1.5) |
| Multiple continents | 20 (5.2) |
| Specialty | |
| General surgery | 125 (32.2) |
| Orthopedic surgery | 116 (29.9) |
| Cardiothoracic Surgery | 93 (23.9) |
| Neurosurgery | 21 (5.4) |
| Vascular surgery | 19 (4.9) |
| Obstetrics and gynecology | 10 (2.6) |
| Urology | 2 (0.5) |
| Otolaryngology | 1 (0.3) |
| Transplant | 1 (0.3) |
| Multicenter trial | 167 (43.0) |
Trial Design of 388 Included Trials
| Variable | Frequency, No. (%) |
|---|---|
| Registration in trials registry | 242 (62.4) |
| Discrepancy between registered and primary outcome | 81 (33.5) |
| Superiority design | 329 (84.8) |
| Power, median (IQR), % | 80.0 (80.0-90.0) |
| Estimated relative treatment effect, median (IQR), % | 50.0 (24.7-63.3) |
| Estimated treatment effect of trials with a major clinical end point as primary outcome | 50.0 (24.5-67.9) |
| Estimated treatment effect of trials with a minor clinical end point as primary outcome | 46.6 (25.0-57.1) |
| Intention-to-treat as the primary analysis | 221 (56.9) |
| Noninferiority design | 55 (14.2) |
| Both noninferiority and superiority design | 3 (0.8) |
| Use of composite primary outcome | 82 (21.1) |
| Major clinical event as primary end point | 123 (31.7) |
| No. of patients screened, median (IQR) | 204 (105-465) |
| Sample size, median (IQR) | |
| Projected | 144 (86-299) |
| Final | 122 (70-245) |
| Duration of follow-up, median (IQR), mo | 24.0 (12.0, 32.0) |
| Type of primary outcome | |
| Time to event | 181 (46.7) |
| Quality of life | 50 (12.8) |
| Other scales | 157 (40.5) |
| Randomization | |
| Computer generated | 213 (54.9) |
| Envelope | 90 (23.2) |
| Random number table | 36 (9.3) |
| Telephone call to randomization center | 10 (2.6) |
| Drawing of lots | 2 (0.5) |
| Date of birth | 2 (0.5) |
| Flip of a coin | 1 (0.3) |
| No details given | 34 (8.7) |
| Blinding | |
| None | 74 (19.1) |
| Outcome assessor only | 61 (15.7) |
| Patient and outcome assessor | 60 (15.4) |
| Patient only | 32 (8.3) |
| Patient, outcome assessor, data analyst | 18 (4.6) |
| Outcome assessor and data analyst | 8 (2.1) |
| Data analyst only | 6 (1.5) |
| Patient, surgeon, outcome assessor, data analyst | 1 (0.3) |
| No details given | 128 (33.0) |
| Control for surgeons’ experience | |
| None | 303 (78.1) |
| Surgeons’ experience cut-off | 60 (15.5) |
| Pretrial training | 25 (6.4) |
| Monitoring of the intervention | |
| None | 371 (95.6) |
| Photo | 4 (1.0) |
| Video | 9 (2.3) |
| Site visit | 3 (0.8) |
| Data monitoring of outcomes | 1 (0.3) |
| Details of the experimental procedure | |
| None | 41 (10.6) |
| Limited | 226 (58.2) |
| Detailed | 121 (31.2) |
| Risk of bias assessment | |
| Low risk | 86 (22.2) |
| Some concerns | 211 (54.4) |
| High risk | 91 (23.5) |
| Funding | |
| External | 288 (74.2) |
| Industry | 96 (33.3) |
| Industry sponsor involved in the analysis | 51 (53.1) |
| Conflicts of interest | |
| First author with study sponsor | 34 (35.4) |
| Last author with study sponsor | 29 (30.2) |
| PRECIS-2 score, mean (SD)a | 3.52 (0.65) |
Abbreviations: IQR, interquartile range; PRECIS-2, Pragmatic Explanatory Continuum Indicator Summary 2.
PRECIS-2 uses a 5-point ordinal scale (ranging from very pragmatic to very explanatory) across 9 domains of trial design, including eligibility, recruitment, setting, organization, intervention delivery, intervention adherence, follow-up, primary outcome, and analysis.
Figure 2. Evaluation of Randomized Clinical Trials
A, Evaluation using the Pragmatic Explanatory Continuum Index Summary 2 (PRECIS-2) Tool. B, Evaluation using the Fragility Index. C, Evaluation with reporting bias.
Trial Implementation and Reporting
| Variable | No. (%) |
|---|---|
| Screened patients included, median (IQR). % | 76.8 (45.1-95.2) |
| Patients lost to follow up, median (IQR) | 4.0 (0.0-17.0) |
| Sample size lost to follow up, median (IQR), % | 3.3 (0.0-10.7) |
| Fragility index, median (IQR) | 3.0 (1.0-6.0) |
| Fragility Index minus patients lost to follow up, median (IQR) | 0.0 (0.0-3.0) |
| Crossovers, median (IQR), No. | 1.0 (0.0-6.5) |
| Crossover, median (IQR), % | 0.5 (0.0-3.0) |
| Trials | |
| With a favorable outcome | 166 (42.7) |
| With a neutral outcome | 212 (54.6) |
| Multiplicity | 175 (45.1) |
| Multiple treatment groups | 13 (7.4) |
| Multiple outcomes | 66 (37.7) |
| Multiple analyses of the same outcome | 66 (37.7) |
| Multiple outcomes + multiple analyses of the same outcome | 23 (13.2) |
| Multiple treatment groups + multiple outcomes | 4 (2.3) |
| Multiple treatment groups + multiple analyses of the same outcome | 3 (1.7) |
| Adjusted for multiple comparisons | 35 (20.0) |
| Bonferroni correction | 25 (71.4) |
| Tukey test | 7 (20.0) |
| Dunn test | 1 (2.9) |
| Gatekeeping or hierarchical testing | 1 (2.9) |
| Modified α value | 1 (2.9) |
| Reporting bias present | 109/211 (51.7) |
| Extent of reporting bias | |
| None | 102 (48.3) |
| In 1 section other than conclusion | 11 (5.2) |
| In conclusion only | 34 (16.1) |
| In 2 sections | 33 (15.6) |
| In all sections | 31 (14.7) |
| Citations, median (IQR), No. | 36 (15-91) |
Abbreviation: IQR, interquartile range.