| Literature DB >> 33196023 |
Joe Menke1, Martijn Roelandse2, Burak Ozyurt3, Maryann Martone4, Anita Bandrowski4.
Abstract
The reproducibility crisis is a multifaceted problem involving ingrained practices within the scientific community. Fortunately, some causes are addressed by the author's adherence to rigor and reproducibility criteria, implemented via checklists at various journals. We developed an automated tool (SciScore) that evaluates research articles based on their adherence to key rigor criteria, including NIH criteria and RRIDs, at an unprecedented scale. We show that despite steady improvements, less than half of the scoring criteria, such as blinding or power analysis, are routinely addressed by authors; digging deeper, we examined the influence of specific checklists on average scores. The average score for a journal in a given year was named the Rigor and Transparency Index (RTI), a new journal quality metric. We compared the RTI with the Journal Impact Factor and found there was no correlation. The RTI can potentially serve as a proxy for methodological quality.Entities:
Keywords: Bioinformatics; Biological Sciences; Biological Sciences Research Methodologies; Methodology in Biological Sciences
Year: 2020 PMID: 33196023 PMCID: PMC7644557 DOI: 10.1016/j.isci.2020.101698
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Rigor Criteria with Applicable Guideline Source and Brief Description Listed
| Entity Type | Source | What is This? |
|---|---|---|
| Institutional Review Board Statement | MDAR | A statement (usually a single sentence) addressing IRB approval for biomedical research involving human subjects (or why IRB approval was not required) |
| Consent Statement | MDAR | A statement (usually a single sentence) addressing subject/patient consent in human research (or why consent was not required) |
| Institutional animal care and use committee statement | MDAR, ARRIVE | A statement (usually a single sentence) addressing IACUC ethical approval for research involving vertebrate organisms |
| Randomization of subjects into groups | MDAR, NIH, CONSORT, ARRIVE | Considered addressed when a statement describing whether randomization was used (e.g., assigning subjects to experimental groups, positions in a multiwell device, processing order, etc.) |
| Blinding of investigator or analysis | MDAR, NIH, CONSORT, ARRIVE | A statement discussing the degree to which experimenters were unaware (or blinded) of group assignment and/or outcome assessment |
| Power analysis for group size | MDAR, NIH, CONSORT, ARRIVE | A statement addressing how (and if) an appropriate sample size was computed |
| Sex as a biological variable | MDAR, NIH, CONSORT, ARRIVE | Reporting the sex of any and all organisms, cell lines, and human subjects |
| Cell Line Authentication | MDAR, NIH | A statement detailing how the cell lines used were authenticated (e.g., short tandem repeat analysis). This is only required when cell lines are detected |
| Cell Line Contamination Check | MDAR, NIH | A statement addressing the mycoplasma contamination status of the cell lines used. This is only required when cell lines are detected |
| Antibody | MDAR, NIH, STAR, RRID | SciScore attempts to find all antibody entities within the methods section. “Identifiable” antibodies are reported with any metadata required to uniquely identify the antibody used such as vendor, catalog number, clone ID, batch number, or RRID |
| Organism | MDAR, NIH, RRID, STAR, ARRIVE | SciScore attempts to find all organism entities within the methods section. “Identifiable” organisms are reported with any metadata required to uniquely identify the organism used such as vendor, catalog number, or RRID |
| Cell line | MDAR, NIH, STAR, RRID | SciScore attempts to find all cell line entities within the methods section. “Identifiable” cell lines are reported with any metadata required to uniquely identify the cell line used such as vendor, catalog number, or RRID |
| Plasmid | STAR, RRID | SciScore attempts to find all plasmid entities within the methods section. Plasmids were not used in this analysis |
| Oligonucleotide | STAR | SciScore attempts to find all oligonucleotide entities within the methods section. Oligonucleotides do not impact score and were not used in this analysis |
| Software Project/Tool | STAR, RRID | SciScore attempts to find all software tools within the methods section. “Identifiable” tools are reported with an RRID or are able to be uniquely identified through a distinct name/URL |
SciScore was trained to recognize each of these criteria, with the training dataset size and classifier performance listed in Table S1. Example sentences recognized by SciScore are listed for each criterion, and we have underlined the entity that was likely to be recognized by the tool. These types of entities were annotated by curators to train individual classifiers.
Institutional Review Board and consent statements are scored together as a block where detection of one or more of these entities will give the full point value for this section.
Cell line authentication and contamination statements are only scored when a cell line is detected in the key resources table and they are scored together, either of these will provide the full points for this section.
Entity type not used for analysis in the current paper.
Figure 1Overall Scores and Their Breakdown Shown between 1997 and 2019
(A) Average score of the dataset representative of the biomedical corpus showing a relatively steady increase over time.
(B) Percentage of papers mentioning the use of sex, blinding, randomization of subjects, and power analysis. Sex and randomization have increased significantly, whereas blinding and power analysis have increased but are still at relatively low rates.
(C) Percentages of key resources (antibodies, organisms, cell lines, and software tools) that are considered uniquely identifiable. Rates of software tools and antibodies have increased, whereas organisms and cell lines have remained relatively stagnant. Data underlying these graphs are available in Data S2.
Figure 2Percentage of Antibodies That Are Able to Be Uniquely Identified Shown by Journal with the Overall Trend across the Biomedical Literature Shown in Blue
A significant improvement can be seen starting in 2016 for Cell and eLife when STAR methods formatting and RRIDs were first implemented in their respective journals contributing to a noticeable improvement in antibody identifiability for the entire biomedical literature. Data underlying this graph are available in Data S3.
Top 15 Journals Sorted by Percent of Antibodies that Were Identifiable in 2019
| Papers Analyzed | Year | Journal | Antibodies Detected | Identifiable | % Identifiable |
|---|---|---|---|---|---|
| 13 | 2019 | 242 | 236 | 98% | |
| 16 | 2019 | 430 | 419 | 97% | |
| 24 | 2019 | 362 | 350 | 97% | |
| 42 | 2019 | 548 | 524 | 96% | |
| 11 | 2019 | 84 | 80 | 95% | |
| 13 | 2019 | 123 | 117 | 95% | |
| 12 | 2019 | 181 | 171 | 94% | |
| 17 | 2019 | 132 | 123 | 93% | |
| 14 | 2019 | 91 | 84 | 92% | |
| 11 | 2019 | 11 | 10 | 91% | |
| 240 | 2019 | 1,458 | 1,312 | 90% | |
| 491 | 2019 | 2,476 | 2,218 | 90% | |
| 75 | 2019 | 344 | 308 | 90% | |
| 18 | 2019 | 62 | 55 | 89% | |
| 21 | 2019 | 69 | 58 | 84% |
For this analysis, there were 682 journals in which more than 10 antibody containing articles were accessible in our dataset.
Indicates participation by the journal in the RRID initiative as of 2019. The complete dataset is available as Data S7.
Top 15 Journals Sorted by Percent of Cell Line Authentication (Authentication or Contamination) that Were Identifiable in 2019.
| Papers Analyzed | Year | Title | Cell Lines Found | % Identifiable | % Authentication |
|---|---|---|---|---|---|
| 278 | 2019 | 849 | 55% | 71% | |
| 11 | 2019 | 39 | 33% | 64% | |
| 83 | 2019 | 302 | 41% | 54% | |
| 23 | 2019 | 87 | 31% | 52% | |
| 27 | 2019 | 85 | 47% | 52% | |
| 33 | 2019 | 95 | 36% | 52% | |
| 14 | 2019 | 86 | 35% | 43% | |
| 29 | 2019 | 65 | 38% | 41% | |
| 17 | 2019 | 23 | 43% | 41% | |
| 17 | 2019 | 42 | 36% | 41% | |
| 18 | 2019 | 67 | 36% | 39% | |
| 178 | 2019 | 485 | 40% | 38% | |
| 37 | 2019 | 157 | 37% | 38% | |
| 14 | 2019 | 48 | 44% | 36% | |
| 946 | 2019 | 2,601 | 38% | 34% |
There were 2,280 journals in which more than 180,316 articles and more than 388,337 cell lines were accessible in our dataset. The complete dataset is available in Data S8.
Figure 3Analysis of Rigor Criteria for the Journal Nature
The right axis represents the percentage of papers that fulfill a particular criterion. The left axis represents the average SciScore. The figure shows that, during and after the implementation of the Nature checklist, the average SciScore as well as all measures except for organism identifiability have improved markedly. Although scores were increasing before the checklist implementation, the checklist appears to quickly boost numbers. Data underlying this graph are available in the (Data S1 and S4, https://scicrunch.org/scicrunch/data/source/SCR_016251-1/search?q=∗&l=∗).
Figure 4Average Journal SciScore between 2016 and 2017 as a Function of the Journal Impact Factor for 2018 (Data from Published Papers from 2016 to 2017)
Data from 490 journals are shown in each graph.
(A) A comparison between the raw JIFs and Rigor and Transparency Index is shown. The correlation coefficient is calculated using the formula for Spearman's rank-order correlation (Rs = −0.1102253134).
(B) A comparison between JIF percentiles and SciScore percentiles is shown. The axes are labeled with quartiles; top quartile is Q1. For presentation purposes only, using Google Sheets with journal names as centered data labels, we chose the top 45 journals by the number of articles included and then we removed labels that were overlapping until we were left with 25 labeled journals, shown above. All 490 journals, for which we had sufficient data in the open access literature to compare to the Journal Impact Factor, are presented in (Data S5). Correlation values were calculated using the formula for Spearman's rank-order correlation, the line is not shown (Rs = −0.1541069707).