| Literature DB >> 31744489 |
Anthony Chauvin1,2,3, Philippe Ravaud4,5, David Moher6, David Schriger7, Sally Hopewell8, Daniel Shanahan9, Sabina Alam10, Gabriel Baron4,5, Jean-Philippe Regnaux4,5, Perrine Crequit4,5, Valeria Martinez11, Carolina Riveros4,5, Laurence Le Cleach12, Alessandro Recchioni13, Douglas G Altman8, Isabelle Boutron4,5.
Abstract
BACKGROUND: The peer review process has been questioned as it may fail to allow the publication of high-quality articles. This study aimed to evaluate the accuracy in identifying inadequate reporting in RCT reports by early career researchers (ECRs) using an online CONSORT-based peer-review tool (COBPeer) versus the usual peer-review process.Entities:
Keywords: CONSORT statement; Peer reviewers; Randomized controlled trials; Reporting
Mesh:
Year: 2019 PMID: 31744489 PMCID: PMC6864983 DOI: 10.1186/s12916-019-1436-0
Source DB: PubMed Journal: BMC Med ISSN: 1741-7015 Impact factor: 8.775
Fig. 1Example of the CONSORT-based peer-review tool (COBPeer)
Fig. 2Example of the CONSORT-based peer-review tool (COBPeer)
Assessment of domains by usual peer reviewer, reference standard, and early career peer reviewer
| Usual peer reviewer | ➢ For CONSORT domains: For each manuscript included, determine whether the peer reviewers and/or editors raised some concern on the completeness of reporting of the following CONSORT items. The assessment of all peer-review reports and editors’ comments for each manuscript need to be combined. - Yes, some concern was raised - No, some concern was not raised ➢ For switched outcomes: For each manuscript included, check whether peer reviewers and/or editors identified inconsistency between data registered and reported for the primary outcome(s). - Yes, inconsistency was detected - No, inconsistency was not detected - Not available, because the study was not registered or the protocol was not available Comments: For blinding domains researchers could quote “unblinded study.” Moreover, if the domain was considered partially reported or not reported, it was quoted as not reported. |
| Reference standard | ➢ For CONSORT domains: Now you will have to evaluate in each RCT if authors correctly reported all key elements of selected CONSORT items. Please evaluate whether authors correctly reported all key elements of the domain considered. Rate items as inadequately reported only if the reporting is a real barrier to the conduct of a systematic review. - Completely reported - Partially reported - Not reported ➢ For switched outcomes: Did authors register their protocol after the beginning of the study? - Yes - No - Not available Inconsistency between data registered and reported for the primary outcome(s) (i.e., at least one primary outcome added, deleted, or changed)? - Yes - No - Not available, because the study was not registered or the protocol is not available - Unable to assess (i.e., outcomes insufficiently described in the register) Comments: For blinding domains researchers could quote “not available because blinding was impossible.” Moreover, domains partially reported or not reported were quoted as not reported. |
| Early career peer reviewer | See Additional file |
General characteristics of early career peer reviewers (ECRs) participating in the study (n = 119)
| Sex | |
| Male | 72 (60.5) |
| Country | |
| France | 52 (43.7) |
| USA | 20 (16.8) |
| Canada | 18 (15.1) |
| UK | 14 (11.8) |
| Other European country | 12 (10.1) |
| South America | 2 (1.7) |
| Africa | 1 (0.8) |
| Professional background | |
| Physician | 106 (89.1) |
| Student | 11 (9.2) |
| Other | 2 (1.7) |
| Academic background | |
| Master of Science | 49 (41.2) |
| Doctor of Medicine (MD) | 36 (30.2) |
| PhD | 32 (26.9) |
| Other | 2 (1.7) |
| How did you hear about this training program? | |
| Faculty | 84 (70.6) |
| Social network | 11 (9.2) |
| Network of international students | 7 (5.9) |
| Editors of biomedical journals | 2 (1.7) |
| Learned societies | 2 (1.7) |
| Other | 13 (10.9) |
| Previously trained to perform a peer review | 19 (16.0) |
| Previously trained to appraise an RCT report | 77 (64.7) |
Data are n (%)
RCT randomized controlled trial
Results for ECRs when performing the training module. The module contained two extracts per domains that the ECR had to evaluate. The answer was considered appropriate when all bullet points were correctly assessed
| Items assessed | Percentage of correct answers |
|---|---|
| - Item 6a (Outcomes) | 83/238 (34.9) |
| - Item 8a (Randomization/sequence generation)α | 221/237 (92.3) |
| - Item 9 (Allocation concealment mechanism) | 186/238 (78.2) |
| - Item 11a/b (Blinding) | 107/238 (45.0) |
| - Item 13a/b (Participant flow) | 77/238 (32.4) |
| - Item 17a (Outcomes and estimation) | 152/238 (63.9) |
| - Item 19 (Harms)β | 97/238 (41.1) |
| - Item 23 (Trial registration) | 238/238 (100.0) |
| Switch in primary outcomesδ | |
| - Outcome(s) reported by the authors as primary outcome(s) while not registered as such | 116/198 (58.6) |
| - Outcome(s) registered as primary outcome but not reported as such in the manuscript | 103/198 (52.0) |
Data are n (%)
αOne missing value, βTwo missing values, δ40 missing values. All missing values due because of a technical issue
Completeness of reporting and a switch in primary outcome as rated by the reference standard (two systematic reviewers) for the sample of 119 manuscripts
| CONSORT items incompletely reported | |
|---|---|
| - Item 6a (Outcomes) | 58 (48.7) |
| - Item 8a (Randomization/sequence generation) | 38 (31.9) |
| - Item 9 (Allocation concealment mechanism) | 62 (52.1) |
| - Items 11a/11b (Blinding) | 51 (42.9) |
| - Items 13a/13b (Participant flow) | 39 (32.8) |
| - Item 17a (Outcomes and estimation) | 48 (40.3) |
| - Item 19 (Harms) | 71 (59.7) |
| - Item 23 (Trial registration) | 18 (15.1) |
| Switch in primary outcomes | |
| - Yes | 36 (30.3) |
Data are n (%)
Accuracy of ECRs using COBPeer and usual peer review in detecting inadequate reporting (i.e., items completely reported or a switch in primary outcome) by the reference standard (two systematic reviewers). ECR had to evaluate 9 domains in 119 manuscripts, therefore 1071 items
| Inadequate reporting identified | Reference standard | |
|---|---|---|
| Yes | No | |
| ECRs | ||
| Yes | 352 | 241 |
| No | 69 | 409 |
| Usual peer review | ||
| Yes | 80 | 131 |
| No | 341 | 519 |
Sensitivity, specificity, positive likelihood ratio, and negative likelihood ratio for usual peer review and ECRs using COBPeer for each item
| Sensitivity (%) | Specificity (%) | Positive likelihood ratio | Negative likelihood ratio | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| ECR | Usual peer review | ECR | Usual peer review | ECR | Usual peer review | ECR | Usual peer review | |||
| All domains | 83.6 [79.7–87.0] | 19.0 [15.4–23.1] | < 0.001 | 62.9 [59.1–66.6] | 79.9 [76.6–82.9] | < 0.001 | 2.26 [2.02–2.51] | 0.94 [0.73–1.21] | 0.26 [0.21–0.33] | 1.01 [0.96–1.08] |
| All CONSORT domains | 85.7 [81.8–89.1] | 19.7 [15.9–24.1] | < 0.001 | 60.9 [56.7–64.9] | 77.2 [73.6–80.6] | < 0.001 | 2.19 [1.96–2.44] | 0.87 [0.67–1.12] | 0.23 [0.18–0.30] | 1.04 [0.97–1.11] |
| - Item 6a (Outcomes) | 94.8 [85.6–98.9] | 27.6 [16.7–40.9] | < 0.001 | 42.6 [30.0–55.9] | 57.4 [44.1–70.0] | 0.16 | 1.65 [1.32–2.07] | 0.65 [0.39–1.08] | 0.12 [0.04–0.38] | 1.26 [0.97–1.65] |
| - Item 8a (Randomization/sequence generation) | 76.3 [59.8–88.6] | 34.2 [19.6–51.4] | < 0.001 | 85.2 [75.6–92.1] | 76.5 [65.8–85.2] | 0.21 | 5.15 [2.97–8.94] | 1.46 [0.81–2.63] | 0.28 [0.16–0.50] | 0.86 [0.66–1.11] |
| - Item 9 (Allocation concealment mechanism) | 82.3 [70.5–90.8] | 8.1 [2.7–17.8] | < 0.001 | 71.9 [58.5–83.0] | 87.7 [76.3–94.9] | 0.05 | 2.93 [1.90–4.51] | 0.66 [0.22–1.95] | 0.25 [0.14–0.43] | 1.04 [0.93–1.18] |
| - Items 11/1b (Blinding) | 56.9 [42.2–70.7] | 19.6 [9.8–33.1] | < 0.001 | 76.5 [64.6–85.9] | 83.8 [72.9–91.6] | 0.38 | 2.42 [1.48–3.95] | 1.21 [0.56–2.63] | 0.56 [0.40–0.79] | 0.96 [0.81–1.14] |
| - Items 13a/13b (Participant flow) | 92.3 [79.1–98.3] | 17.9 [7.5–33.5] | < 0.001 | 21.3 [12.9–31.8] | 78.8 [68.2–87.1] | < 0.001 | 1.17 [1.01–1.36] | 0.84 [0.38–1.87] | 0.36 [0.11–1.16] | 1.04 [0.87–1.25] |
| - Item 17a (Outcomes and estimation) | 93.8 [82.8–98.7] | 33.3 [20.4–48.4] | < 0.001 | 56.3 [44.0–68.1] | 64.8 [52.5–75.8] | 0.42 | 2.15 [1.63–2.82] | 0.95 [0.57–1.58] | 0.11 [0.04–0.34] | 1.03 [0.79–1.34] |
| - Item 19 (Harms) | 97.2 [90.2–99.7] | 9.9 [4.1–19.3] | < 0.001 | 8.3 [2.3–20.0] | 75.0 [60.4–86.3] | < 0.001 | 1.06 [0.97–1.16] | 0.39 [0.17–0.93] | 0.34 [0.06–1.77] | 1.20 [1.00–1.44] |
| - Item 23 (Trial registration) | 88.9 [65.3–98.6] | 11.1 [1.4–34.7] | < 0.001 | 95.1 [88.8–98.4] | 88.1 [80.2–93.7] | 0.09 | 17.96 [7.52–42.86] | 0.94 [0.23–3.83] | 0.12 [0.03–0.43] | 1.01 [0.84–1.21] |
| Switched outcomes | 61.1 [43.4–76.9] | 11.1 [3.1–26.1] | < 0.001 | 77.1 [66.6–85.6] | 97.6 [91.6–99.7] | < 0.001 | 2.67 [1.66–4.28] | 4.61 [0.88–24.05] | 0.50 [0.33–0.77] | 0.91 [0.81–1.03] |
Fig. 3Proportions of items evaluated by early career reviewers and usual peer review classified as true positive (TP), false negative (FN), true negative (TN), and false positive (FP)