| Literature DB >> 35759334 |
Joe Menke1,2, Peter Eckmann2,3, Ibrahim Burak Ozyurt2,3, Martijn Roelandse4, Nathan Anderson2, Jeffrey Grethe2,3, Anthony Gamst5, Anita Bandrowski2,3.
Abstract
BACKGROUND: Improving rigor and transparency measures should lead to improvements in reproducibility across the scientific literature; however, the assessment of measures of transparency tends to be very difficult if performed manually.Entities:
Keywords: cell line authentication; data and code availability; reporting transparency; reproducibility crisis; research metric; research reproducibility; rigor and transparency; science of science; university ranking
Mesh:
Year: 2022 PMID: 35759334 PMCID: PMC9274430 DOI: 10.2196/37324
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Conditional scoring groupings and logic for rigor adherence section.
| Grouping | Criteria included | If this grouping is detected, what is expected? | This grouping is expected when what is detected? |
| Ethics-1 | Institutional Animal Care and Use Committee, Institutional Review Board, and consent | Group selection, sex, demographics, random, blinding, and power | Euthanasia |
| Ethics-2 | Field sample permit | Random, blinding, and power | Never expected |
| Euthanasia | Euthanasia statement and euthanasia agent | Ethics-1, group selection, sex, demographics, random, blinding, and power | Never expected |
| Group selection | Inclusion and exclusion criteria and attrition | Random, blinding, and power | Ethics-1 and euthanasia |
| Sex | Sex | Random, blinding, and power | Ethics-1, euthanasia, and demographics |
| Demographics | Age and weight | Sex, random, blinding, and power | Ethics-1 and euthanasia |
| Random | Random | Blinding and power | Always expected |
| Blinding | Blinding | Random and power | Always expected |
| Power | Power analysis | Random and blinding | Always expected |
| Replication | Replication statement, number of replications, and type of replication | Random, blinding, and power | Never expected |
| Cell line authentication | Cell line authentication and cell line contamination | Sex, random, blinding, and power | Cell lines |
| Methods and materials availabilitya | Data availability, data identifiers, code availability, code identifiers, and protocol identifiers | Never expected; do not affect score | Never expected; do not affect score |
| Cell lines | Cell lines | Cell line authentication | Never expected |
| Other resourcesb | Antibodies, organisms, plasmids, and tools | Never expected; only affects resource transparency score | Never expected; only affects resource transparency score |
| Miscellaneousa | Oligonucleotides, statistical tests, and incorrect research resource identifiers | Never expected; does not affect either score | Never expected; does not affect either score |
aRow indicates criteria that do not affect any score.
bRow indicates criteria that do not affect the rigor adherence score, only the resource transparency score.
Rates of false negatives, false positives, and overall agreement based on manual analysis of 250 scored papers (SciScore >0) from our data set.
| Entity type | False positives | False negatives | Overall agreement | |
|
| Size and rate, n (%) | Size and rate, n (%) | Size and rate, (agreed, n) (%) | |
|
| ||||
|
| Institutional review board statement | 14 (5.6) | 11 (4.4) | 225 (90) |
|
| Consent statement | 1 (0.4) | 11 (4.4) | 238 (95.2) |
|
| Institutional animal care and use committee statement | 2 (0.8) | 17 (6.8) | 231 (92.4) |
|
| Field sample permit | 19 (7.6) | 0 (0) | 231 (92.4) |
|
| Euthanasia | 6 (2.4) | 7 (2.8) | 237 (94.8) |
|
| Inclusion and exclusion criteria | 10 (4) | 17 (6.8) | 223 (89.2) |
|
| Attrition | 35 (14) | 7 (2.8) | 208 (83.2) |
|
| Type of replication | 0 (0) | 3 (1.2) | 247 (98.8) |
|
| Number of replications | 17 (6.8) | 16 (6.4) | 217 (86.8) |
|
| General replication | 13 (5.2) | 16 (6.4) | 221 (88.4) |
|
| Randomization of participants into groups | 20 (8) | 4 (1.6) | 226 (90.4) |
|
| Blinding of investigator or analysis | 5 (2) | 5 (2) | 240 (96) |
|
| Power analysis for group size | 12 (4.8) | 4 (1.6) | 234 (93.6) |
|
| Sex as a biological variable | 6 (2.4) | 21 (8.4) | 223 (89.2) |
|
| Age | 5 (2) | 44 (17.6) | 201 (80.4) |
|
| Weight | 6 (2.4) | 22 (8.8) | 222 (88.8) |
|
| Cell line authentication | 15 (6) | 1 (0.4) | 234 (93.6) |
|
| Cell line contamination check | 0 (0) | 0 (0) | 250 (100) |
|
| Protocol identifiers | 3 (1.2) | 2 (0.8) | 245 (98) |
|
| Code availability | 4 (1.6) | 1 (0.4) | 245 (98) |
|
| Code identifiers | 0 (0) | 2 (0.8) | 248 (99.2) |
|
| Data availability | 24 (9.6) | 0 (0) | 226 (90.4) |
|
| Data identifiers | 27 (10.8) | 3 (1.2) | 220 (88) |
|
| ||||
|
| Antibody | 2 (0.8) | 5 (2) | 243 (97.2) |
|
| Organism | 3 (1.2) | 7 (2.8) | 240 (96) |
|
| Cell line | 6 (2.4) | 4 (1.6) | 240 (96) |
|
| Software project and tools | 8 (3.2) | 38 (15.2) | 204 (81.6) |
Figure 1Disambiguation of affiliation strings workflow. ROR: Research Organization Registry; regexA: exact regular expression.
Affiliation to institution matching: in-house tool compared with the Research Organization Registry (ROR) application programming interface and a human-curated set of 200 affiliations.
| Confidence | Time per affiliation (ms) | Accuracy | ||
|
| In-house | ROR | In-house | ROR |
| High only | 1.759 | 400.90 | 0.5323 | 0.6666 |
| High and low | 9.745 | 400.90 | 0.7043 | 0.7043 |
Figure 2Average score for Rigor and Transparency Index (RTI), version 1.0 (1997-2019) and version 2.0 (1997-2020). PubMed Central- Open Archives Initiative steadily increases over time. Differences between versions are negligible.
Figure 3Proportion of papers addressing various bias limiting criteria (ie, blinding, randomization, power, and replication) across all scored papers (1997-2020).
Figure 4Data, code, and protocol addressment across all papers (1997-2020).
Figure 5Data shown from 1997 to 2020. Left axis shows the percentage of papers containing cell lines that authenticate, check cells for contamination, and include sex. Right axis shows the number of papers using cell lines each year.
Figure 6Data from 1997 to 2020. Percentage of papers describing demographic information (sex, age, or weight) that contain at least one transgenic organism.
Figure 7Analysis of Rigor and Transparency Index (RTI) across research institutions in 2020. The left axis represents the RTI. The 50 institutions with the most papers published in 2020 were ranked according to their RTI. The 25 institutions with the highest RTI are shown.
Spearman rank-order correlation coefficient calculations between the number of academic staffs, the total revenue, and Rigor and Transparency Index (RTI).a The partial correlation coefficient between revenue and RTI was calculated to be -0.1154.
|
| Total academic staff | Current total revenue | RTI |
| Total academic staff | 1 | N/Ab | N/A |
| Total current revenue | 0.6208 | 1 | N/A |
| RTI | −0.1209 | −0.1648 | 1 |
aData from 2013. A partial correlation was calculated between total revenue and RTI correcting for the number of academic staffs.
bN/A: not applicable.
Figure 8Clustering and Rigor and Transparency (RTI) ranking of the top 80 UK departments by paper count are shown. The t-distributed stochastic neighbor embeddings of the semantic vector representation of each department’s average paper abstract is shown, with k-means clusters indicated by coloring (left panel). Field names are shown for clusters with a single unifying theme among all departments. The labels were added by hand for presentation purposes. We also show the average RTIs of each department (right panel).
Figure 9Measured SciScores for Cancer Reproducibility Project papers. Original papers are in blue, registered reports are in orange, and replication studies are in green. A smoothed density plot of scores is shown in solid color. The white dot represents the median score, the thick black line the interquartile range (IQR), and the thin black line 1.5x IQR.