| Literature DB >> 33076975 |
Shivani M Reddy1, Sheila Patel2, Meghan Weyrich3, Joshua Fenton3, Meera Viswanathan2.
Abstract
BACKGROUND: The exponential growth of the biomedical literature necessitates investigating strategies to reduce systematic reviewer burden while maintaining the high standards of systematic review validity and comprehensiveness.Entities:
Mesh:
Year: 2020 PMID: 33076975 PMCID: PMC7574591 DOI: 10.1186/s13643-020-01450-2
Source DB: PubMed Journal: Syst Rev ISSN: 2046-4053
Fig. 1Semi-automation screening tests with RobotAnalyst
Fig. 2Semi-automation screening tests with AbstrackR
Fig. 3Outcome definitions and confusion matrix
Fig. 4Traditional approach PRISMA
Fig. 5Review of reviews PRISMA
Sensitivity and specificity of review of reviews approach compared with traditional review approach
| Traditional review approach | ||||
|---|---|---|---|---|
| Include | Exclude | Total | ||
| Include | 33 | 0 | 33 | |
| Exclude | 26 | 3122 | 3148 | |
| Total | 59 | 3122 | 3181 | |
*Of the 3122 excluded studies, 92 were excluded after full-text review and 3030 were citations not retrieved through the review of reviews approach
Comparison of inclusion criteria for the systematic review update and systematic reviews in review of reviews (ROR) approach
| Systematic review update Inclusion criteria | SRs with inclusion criteria matching SR update eligibility criteria* | ||
|---|---|---|---|
| % | |||
| Population | Localized prostate cancer | 27 | 90 |
| Interventions studied in one or more arm | Active therapy (surgery and/or radiation) | 17 | 57 |
| Ablative therapy only | 9 | 30 | |
| Comparators studied in one or more arm | Active treatment comparator | 17 | 57 |
| Conservative treatment comparator | 11 | 37 | |
| No comparator | 8 | 27 | |
| Study design | Randomized and non-randomized trials | 20 | 67 |
| Observational studies | 14 | 47 | |
| Single arm studies | 8 | 27 | |
| Total | 30 | 100 | |
*Studies may include multiple arms or multiple study types; percentages do not sum to 100
Characteristics of studies missed by the review of reviews approach
| Sample size | Subtype | Studies missed by the ROR search* | |
|---|---|---|---|
| % | |||
| Interventions studied in one or more arm | Surgery | 15 | 60 |
| Radiation therapy | 15 | 60 | |
| Ablative therapy | 10 | 40 | |
| Comparators | Active treatment comparator | 15 | 60 |
| Conservative treatment comparator | 8 | 32 | |
| No comparator | 8 | 32 | |
| Study design | Randomized and non-randomized trials | 1 | 4 |
| Observational studies | 24 | 88 | |
| Single arm studies or within-treatment comparisons† | 2 | 8 | |
| Year of publication | 2014 | 3 | 12 |
| 2015 | 6 | 24 | |
| 2016 | 7 | 28 | |
| 2017 | 8 | 32 | |
| 2018 | 1 | 4 | |
| Total | 25‖ | 100.0 | |
*Studies may have multiple arms; percentages do not sum to 100
†Single-arm studies (or within-treatment comparisons) were eligible only for ablative therapies
‖The ROR approach missed a total of 26 eligible citations, which one was a companion study and one was the Fenton et al. review. The analysis is limited to the 25 primary studies
Abbreviations: N number, NA not applicable, RCT randomized controlled trial, ROR review of reviews
Semi-automation test with RobotAnalyst using a training set of dually reviewed randomly selected citations with labels from title and abstract screening
| Inclusion prediction: 0.3 | Inclusion prediction: 0.4 | Inclusion prediction: 0.5 | |
|---|---|---|---|
| Predicted includes | 2168 | 1970 | 1363 |
| Predicted excludes | 22 | 220 | 827 |
| Sensitivity | 100% | 93% | 74% |
| Specificity | 30% | 36% | 55% |
| Missed citations | 0 | 3 | 12 |
| Burden | 99% | 93% | 74% |
| Time savings (min) | 11 | 110 | 413.5 |
FP false positive, ML machine learning, TP true positive, TN true positive
Semi-automation test with RobotAnalyst using a training set of dually reviewed citations from a review-of-review with labels from full-text screening
| Inclusion prediction: 0.3 | Inclusion prediction: 0.4 | Inclusion Prediction: 0.5 | ||
|---|---|---|---|---|
| Predicted includes | NA | 3040 | 2819 | 1166 |
| Predicted excludes | NA | 16 | 237 | 1890 |
| Sensitivity | 54% | 100% | 97% | 69% |
| Specificity | 100% | 3% | 10% | 63% |
| Missed citations | 26 | 0 | 2 | 18 |
| Burden | 4% | 99% | 93% | 41% |
| Time savings (min) | 1561 | 8 | 118.5 | 945 |
FP false positive, ML machine learning, NA not applicable, TP true positive, TN true positive
Fig. 6Burden and sensitivity outcomes for RobotAnalyst tests using alternative training sets
Semi-automation test with RobotAnalyst using a training set of dually reviewed citations from a review-of-review and randomly selected citations with labels from full-text screening
| Inclusion prediction: 0.3 | Inclusion prediction: 0.4 | Inclusion prediction: 0.5 | |
|---|---|---|---|
| Predicted includes | 2094 | 1765 | 676 |
| Predicted excludes | 24 | 353 | 1442 |
| Sensitivity | 98% | 84% | 80% |
| Specificity | 34% | 44% | 79% |
| Missed citations | 1 | 3 | 12 |
| Burden | 99% | 89% | 55% |
| Time savings (min) | 12 | 177 | 721 |
FP false positive, ML machine learning, TP true positive, TN true positive
Fig. 7Sensitivity and burden using AbstrackR active learning for title-abstract screening