| Literature DB >> 30930504 |
S P J M Horbach1,2, W Halffman1.
Abstract
There is a mounting worry about erroneous and outright fraudulent research that gets published in the scientific literature. Although peer review's ability to filter out such publications is contentious, several peer review innovations attempt to do just that. However, there is very little systematic evidence documenting the ability of different review procedures to flag problematic publications. In this article, we use survey data on peer review in a wide range of journals to compare the retraction rates of specific review procedures, using the Retraction Watch database. We were able to identify which peer review procedures were used since 2000 for 361 journals, publishing a total of 833,172 articles, of which 670 were retracted. After addressing the dual character of retractions, signalling both a failure to identify problems prior to publication, but also the willingness to correct mistakes, we empirically assess review procedures. With considerable conceptual caveats, we were able to identify peer review procedures that seem able to detect problematic research better than others. Results were verified for disciplinary differences and variation between reasons for retraction. This leads to informed recommendations for journal editors about strengths and weaknesses of specific peer review procedures, allowing them to select review procedures that address issues most relevant to their field.Entities:
Keywords: Peer review; Research integrity; Retractions; Scientific publishing
Year: 2018 PMID: 30930504 PMCID: PMC6404393 DOI: 10.1007/s11192-018-2969-2
Source DB: PubMed Journal: Scientometrics ISSN: 0138-9130 Impact factor: 3.238
Procedures of peer review categorized by dimension and attribute
| Dimension | Attribute | Range |
|---|---|---|
| Selection conditions | Timing | a. No review |
| b. Pre-submission (including registered reports) | ||
| c. Pre-publication | ||
| d. Post-publication | ||
| Criteria | a. Methodological rigour and correctness | |
| b. Anticipated impact (either within or outside of science) | ||
| c. Novelty | ||
| d. Fit with journal’s scope | ||
| e. Other | ||
| Identities and access | Type of reviewer | a. Editor-in-chief |
| b. Editorial committee | ||
| c. External reviewers selected by authors | ||
| d. External reviewers selected by editor(s) | ||
| e. Wider community/readers | ||
| f. Commercial review platforms | ||
| Anonymity of authors | a. Author identities are blinded to editor and reviewer | |
| b. Author identities are blinded to reviewer but known to editor | ||
| c. Author identities are known to editor and reviewer | ||
| Anonymity of reviewers | a. Anonymous reviewers | |
| b. Reviewers’ identities are open to the authors | ||
| c. Reviewers’ identities are open to other reviewers | ||
| d. Reviewers’ identities are open to the reader | ||
| Availability of review reports | a. Review reports are accessible to authors and editors | |
| b. Review reports are accessible to other reviewers | ||
| c. Review reports are accessible to readers of the published manuscript | ||
| d. Review reports are publicly accessible | ||
| Interaction | a. No interaction between authors/reviewers | |
| b. Interaction amongst reviewers is facilitated | ||
| c. Author’s responses to review reports are communicated to the reviewer | ||
| d. Interaction between authors and reviewers is facilitated | ||
| Specialisation in review | Structure | a. Unstructured |
| b. Semi-structured: Reviewers are guided by some open questions or are presented several criteria for judgement | ||
| c. Structured: Review occurs through mandatory forms or checklists to be filled out by reviewers | ||
| Statistical review | a. Not applicable | |
| b. No special attention is given to statistical review | ||
| c. Incorporated in review | ||
| d. Performed by additional, specialist reviewer | ||
| e. Performed through automatic computer software | ||
| External sources | a. No reviews from external sources are used | |
| b. Reviews from other (partner) journals accompanying manuscripts rejected elsewhere are used | ||
| c. Reviews from commercial review platforms are used | ||
| d. Reviews performed by the wider community (i.e. not by invited or targeted reviewers) are used | ||
| Technological tools in review | Technical support | a. No digital tools are used |
| b. Plagiarism detection software is used | ||
| c. Digital tools to assess validity or consistency of statistics are used | ||
| d. Digital tools to detect image manipulation are used | ||
| e. Digital tools to check references are used | ||
| f. Other technical support (e.g. machine learning techniques to assess consistency and completeness) | ||
| Reader commentary | a. No reader commentary facilitated | |
| b. In-channel reader commentary facilitated | ||
| c. Out-of-channel reader commentary facilitated |
Fig. 1Overview of the retractions data. Starting with the RetractionWatch database, we omitted a large number of retracted conference proceedings by IEEE. The final set consists of the resulting retracted articles that are contained in the WoS database
The number of articles and retracted articles since 2000 in our sample, according to research area, as defined in the Leiden Ranking
| Research area | Social science and humanities | Biomedical and health sciences | Physical sciences and engineering | Life and earth sciences | Mathematics and computer sciences | Total |
|---|---|---|---|---|---|---|
| Total articles published | 1,660,394 | 6,325,415 | 5,717,615 | 2,948,177 | 1,605,903 | 18,257,504 |
| Articles in our sample | 53,922 | 382,950 | 196,845 | 160,126 | 38,225 | 833,172 |
| % of total articles in our sample | 3.25 | 6.05 | 3.44 | 5.43 | 2.38 | 4.56 |
| Total retractions | 380 | 4394 | 1693 | 800 | 281 | 7547 |
| Retractions in our sample | 30 | 368 | 183 | 67 | 21 | 670 |
| % of retracted articles in our sample | 7.90 | 8.38 | 10.81 | 8.38 | 7.48 | 8.86 |
Fig. 2Cumulative distribution of retractions over journals, either in the entire population, in our sample, or outside of our sample
Timing of peer review relative to the publication process related to number of non-retracted and retracted articles in our sample
| At what stage of the publication process does review take place? | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| No review takes place | 0 (0.0%) | 0.0–0.0 | 648 (0.1%) | 0.1–0.1 |
| Pre-submission review (including registered reports) | 32 (4.8%) | 3.2–6.4 | 63,262 (7.6%) | 7.5–7.7 |
| Pre-publication review | 645 (96.3%) | 94.8–97.7 | 812,362 (97.6%) | 97.5–97.6 |
| Post-publication review | 0 (0.0%) | 0.0–0.0 | 7008 (0.8%) | 0.8–0.9 |
Review criteria related to number of non-retracted and retracted articles in our sample
| What quality criteria does your journal use for peer review? | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| Methodological rigour and correctness | 666 (99.4%) | 99.1–99.7 | 810,932 (97.4%) | 97.4–97.4 |
| Anticipated impact (either within or outside of science) | 549 (81.9%) | 80.4–83.5 | 523,629 (62.9%) | 62.8–63.0 |
| Novelty | 610 (91.0%) | 89.9–92.2 | 697,846 (83.8%) | 83.8–83.9 |
| Fit with journal’s scope | 641 (95.7%) | 94.9–96.5 | 733,670 (88.1%) | 88.1–88.2 |
Identity of reviewer related to number of non-retracted and retracted articles in our sample
| Type of reviewers | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| Editor-in-chief | 197 (29.4%) | 27.2–31.6 | 347,035 (41.7%) | 41.6–41.8 |
| Editorial committee | 376 (56.1%) | 53.7–58.5 | 439,754 (52.8%) | 52.8–52.9 |
| External reviewers suggested by authors | 372 (55.5%) | 53.1–57.9 | 445,636 (53.5%) | 53.5–53.6 |
| External reviewers suggested and selected by editor(s) | 653 (97.5%) | 96.7–98.2 | 805,787 (96.8%) | 96.8–96.8 |
| Wider community/readers | 34 (5.1%) | 4.0–6.1 | 146,502 (17.6%) | 17.5–17.6 |
| Commercial review platforms | 0 (0.0%) | 0.0–0.0 | 9192 (1.1%) | 1.1–1.1 |
Level of author anonymity during peer review related to the number of non-retracted and retracted articles per research area
| Level of author anonymity | Author identities are known to editor and reviewer | Author identities are blinded to reviewer but known to editor | Author identities are blinded to editor and reviewer | |
|---|---|---|---|---|
| All articles (percentage of all articles) | 692,920 (83.2%) | 135,019 (16.2%) | 7011 (0.8%) | |
| Non-retracted (percentage of non-retracted papers) | Total | 692,280 (83.2%) | 134,989 (16.2%) | 7011 (0.8%) |
| Social Science & Humanities | 10,206 (18.9%) | 42,819 (79.5%) | 1192 (2.2%) | |
| Biomedical and Health Sciences | 317,182 (82.9%) | 65,329 (17.1%) | 1327 (0.3%) | |
| Physical Sciences & Engineering | 181,201 (92.1%) | 12,000 (6.1%) | 3461 (1.8%) | |
| Life and Earth Sciences | 146,040 (91.2%) | 13,186 (8.2%) | 1032 (0.6%) | |
| Mathematics & Computer Sciences | 36,849 (96.5%) | 1355 (3.5%) | 0 (0.0%) | |
| Retracted (percentage of retracted papers) | Total | 640 (95.5%) | 30 (4.5%) | 0 (0.0%) |
| Social Science & Humanities | 26 (86.7%) | 4 (13.3%) | 0 (0.0%) | |
| Biomedical & Health Sciences | 348 (94.6%) | 20 (5.4%) | 0 (0.0%) | |
| Physical Sciences & Engineering | 180 (98.4%) | 3 (1.6%) | 0 (0.0%) | |
| Life and Earth Sciences | 65 (97.0%) | 2 (3.0%) | 0 (0.0%) | |
| Mathematics and Computer Sciences | 20 (95.4%) | 1 (4.8%) | 0 (0.0%) |
Level of reviewer anonymity related to number of non-retracted and retracted articles in our sample
| Extent to which reviewers are anonymised | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| Reviewers are anonymous (both to authors and other reviewers as well as to readers of the published manuscript) | 654 (97.6%) | 96.5–98.8 | 816,784 (98.1%) | 98.1–98.1 |
| Reviewer identities are known to other reviewers of the same manuscript | 6 (0.9%) | 0.2–1.6 | 4004 (0.5%) | 0.5–0.5 |
| Reviewer identities are known to the authors | 9 (1.3%) | 0.5–2.2 | 5412 (0.7%) | 0.6–0.7 |
| Reviewer identities are known to the readers of the published manuscript | 5 (0.7%) | 0.1–1.4 | 4782 (0.6%) | 0.6–0.6 |
Accessibility of review reports related to number of non-retracted and retracted articles in our sample
| Accessibility of review reports | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| Review reports are accessible to authors and editors | 619 (92.4%) | 90.8–94.0 | 811,087 (97.4%) | 97.4–97.5 |
| Review reports are accessible to other reviewers | 448 (66.9%) | 64.1–69.7 | 487,933 (58.6%) | 58.5–58.7 |
| Review reports are accessible to readers of the published manuscript | 5 (0.7%) | 0.2–1.3 | 4790 (0.6%) | 0.6–0.6 |
| Review reports are publicly accessible | 2 (0.3%) | 0.0–0.6 | 2108 (0.3%) | 0.2–0.3 |
Level of interaction between authors and reviewers related to number of non-retracted and retracted articles in our sample
| Interaction between authors and reviewers | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| No interaction between authors or reviewers is facilitated | 130 (19.4%) | 16.6–22.2 | 339,782 (40.8%) | 40.7–40.9 |
| Author’s responses to review reports are communicated to the reviewer | 562 (83.9%) | 81.3–86.5 | 573,308 (68.9%) | 68.8–69.0 |
| Interaction between reviewers is facilitated | 48 (9.0%) | 5.3–9.0 | 75,100 (9.0%) | 9.0–9.1 |
| Interaction between authors and reviewers is facilitated (on top of formal review reports and formal responses to review reports) | 18 (2.7%) | 1.5–3.8 | 52,088 (6.3%) | 6.2–6.3 |
Level of structure in review related to the number of non-retracted and retracted articles per research area
| Level of structure | Unstructured | Semi-structured | Structured | |
|---|---|---|---|---|
| All articles (percentage of all articles) | 126,282 (15.2%) | 627,559 (75.3%) | 158,274 (19.0%) | |
| Non-retracted (percentage of non-retracted papers) | Total | 126,233 (15.2%) | 626,971 (75.3%) | 158,184 (19.0%) |
| Social Science & Humanities | 17,275 (32.1%) | 33,574 (62.3%) | 14,652 (27.2%) | |
| Biomedical & Health Sciences | 45,414 (11.9%) | 270,776 (70.8%) | 82,097 (21.5%) | |
| Physical Sciences & Engineering | 19,946 (10.1%) | 171,251 (87.1%) | 51,172 (26.0%) | |
| Life & Earth Sciences | 35,010 (21.9%) | 122,726 (76.7%) | 7783 (4.9%) | |
| Mathematics & Computer Sciences | 8290 (21.7%) | 27,910 (73.1%) | 2113 (5.5%) | |
| Retracted (percentage of retracted papers) | Total | 49 (7.3%) | 588 (87.8%) | 90 (13.4%) |
| Social Science & Humanities | 1 (3.3%) | 26 (86.7%) | 3 (10.0%) | |
| Biomedical & Health sciences | 21 (5.7%) | 303 (82.3%) | 47 (12.8%) | |
| Physical Sciences & Engineering | 1 (0.5%) | 182 (99.5%) | 10 (5.5%) | |
| Life & Earth Sciences | 26 (38.8%) | 56 (83.6%) | 29 (43.3%) | |
| Mathematics & Computer Sciences | 0 (0.0%) | 21 (100.0%) | 0 (0.0%) |
Level and type of statistical review related to number of non-retracted and retracted articles in our sample
| Level and type of statistical review | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| Not applicable (statistics does not play a role in my journal’s research area) | 193 (28.8%) | 25.4–32.2 | 117,760 (14.1%) | 14.1–14.2 |
| No special attention is given to statistical review | 70 (10.4%) | 8.2–12.7 | 165,637 (19.9%) | 19.8–20.0 |
| Incorporated in review (assessing statistics is part of reviewer’s and editor’s tasks) | 348 (51.9%) | 48.2–55.7 | 497,935 (59.8%) | 59.7–59.9 |
| Statistical review is performed by an additional, specialist reviewer | 81 (12.1%) | 9.7–14.5 | 152,040 (18.3%) | 18.2–18.3 |
| Statistics review is performed through automatic computer software | 0 (0.0%) | 0.0–0.0 | 4129 (0.5%) | 0.5–0.5 |
Extent to which reviews from external sources are used related to number of non-retracted and retracted articles in our sample
| Reviews from external sources | Retracted | 95% CI | Non-retracted | 95% CI |
|---|---|---|---|---|
| No reviews from external sources are used | 422 (63.0%) | 58.8–67.2 | 545,443 (65.5%) | 65.4–65.6 |
| Reviews from other (partner) journals are used | 90 (13.4%) | 10.5–16.4 | 244,385 (29.4%) | 29.3–29.5 |
| Reviews from commercial review platforms are used | 0 (0.0%) | 0.0–0.0 | 3971 (0.5%) | 0.5–0.5 |
| Reviews performed by the wider community are used | 1 (0.1%) | 0.0–0.5 | 3159 (0.4%) | 0.4–0.4 |
Usage of digital tools in peer review related to number of non-retracted and retracted articles in our sample
| Level of structure | No digital tools are used | Plagiarism detection software | Digital tools to check references | Digital tools to detect image manipulation | Digital tools to assess statistics | |
|---|---|---|---|---|---|---|
| All articles (percentage of all articles) | 259,601 | 482,414 | 206,083 | 69,533 | 23,567 | |
| Non-retracted (percentage of non-retracted papers) | Total | 259,321 (31.1%) | 482,037 (57.9%) | 205,861 (24.7%) | 69,485 (8.3%) | 23,561 (2.8%) |
| Social Science & Humanities | 25,389 (47.1%) | 28,096 (52.1%) | 8089 (15.0%) | 5105 (9.5%) | 2155 (4.0%) | |
| Biomedical & Health Sciences | 161,264 (42.2%) | 192,353 (50.3%) | 50,651 (13.2%) | 30,135 (7.9%) | 10,622 (2.8%) | |
| Physical Sciences & Engineering | 29,085 (14.8%) | 156,573 (79.6%) | 94,567 (48.1%) | 7688 (3.9%) | 1014 (0.5%) | |
| Life & Earth Sciences | 36,911 (23.1%) | 84,049 (51.9%) | 48,009 (30.0%) | 26,557 (16.6%) | 8238 (5.1%) | |
| Mathematics & Computer Sciences | 6231 (16.3%) | 19,882 (52.0%) | 4177 (10.9%) | 0 (0.0%) | 1533 (4.0%) | |
| Retracted (percentage of retracted papers) | Total | 280 (41.8%) | 377 (56.3%) | 222 (33.1%) | 48 (7.2%) | 6 (0.9%) |
| Social Science & Humanities | 9 (30.0%) | 21 (70.0%) | 7 (23.3%) | 0 (0.0%) | 0 (0.0%) | |
| Biomedical & Health sciences | 242 (65.8%) | 120 (32.6%) | 29 (7.9%) | 39 (10.6%) | 6 (1.6%) | |
| Physical Sciences & Engineering | 2 (1.1%) | 180 (98.4%) | 163 (89.1%) | 1 (0.5%) | 0 (0.0%) | |
| Life & Earth Sciences | 26 (38.8%) | 35 (52.2%) | 16 (23.9%) | 8 (11.9%) | 0 (0.0%) | |
| Mathematics & Computer Sciences | 1 (4.8%) | 20 (95.2%) | 6 (28.6%) | 0 (0.0%) | 0 (0.0%) |
Level of reader commentary related to the number of non-retracted and retracted articles per research area
| Level of reader commentary | No direct reader commentary is facilitated | Reader commentary is facilitated on the journal’s website | Out-of-channel reader commentary is facilitated | |
|---|---|---|---|---|
| All articles (percentage of all articles) | 627,267 (75.3%) | 203,692 (24.4%) | 3735 (0.4%) | |
| Non-retracted (percentage of non-retracted papers per research area) | Total | 626,849 (75.3%) | 203,441 (24.4%) | 3712 (0.4%) |
| Social Science & Humanities | 45,185 (83.8%) | 7057 (13.1%) | 1691 (3.1%) | |
| Biomedical & Health Sciences | 215,921 (56.4%) | 166,460 (43.5%) | 201 (0.1%) | |
| Physical Sciences & Engineering | 185,461 (94.3%) | 10,841 (5.5%) | 360 (0.2%) | |
| Life & Earth Sciences | 141,586 (88.5%) | 18,473 (11.5%) | 1460 (0.9%) | |
| Mathematics & Computer Sciences | 37,781 (98.9%) | 423 (1.1%) | 0 (0.0%) | |
| Retracted (percentage of retracted papers per research area) | Total | 418 (62.4%) | 251 (37.5%) | 23 (3.4%) |
| Social Science & Humanities | 27 (90.0%) | 3 (10.0%) | 0 (0.0%) | |
| Biomedical & Health sciences | 122 (33.2%) | 246 (66.8%) | 0 (0.0%) | |
| Physical Sciences & Engineering | 182 (99.5%) | 0 (0.0%) | 1 (0.5%) | |
| Life & Earth Sciences | 65 (97.0%) | 2 (3.0%) | 22 (32.8%) | |
| Mathematics & Computer Sciences | 21 (100.0%) | 0 (0.0%) | 0 (0.0%) |
Overview of review procedures’ significant effects on retraction rates
| Review attribute | Significant effects on retractions | Significant interaction with research area | Significant variation between reasons for retraction |
|---|---|---|---|
| 1. Timing | Pre-submission review is related to fewer retractions. | No | No |
| 2. Criteria | Focussing on anticipated impact and novelty is related to more retractions. | No | No |
| 3. Type reviewer | Involvement of editor-in-chief and wider community is related to fewer retractions. | Yes* | Yes* |
| 4. Author anonymity | Blinding author identities is related to fewer retractions. | No | Yes |
| 5. Reviewer anonymity | No significant effects. | No | No |
| 6. Review reports | No significant effects in general, some effect for specific reasons for retraction and difference in effect for different research areas. | Yes* | Yes* |
| 7. Interaction | No interaction is related to fewer retractions, having authors respond to review reports is related to more retractions. | Yes* | Yes* |
| 8. Structure | Unstructured and structured review is related to fewer retractions, semi-structured to more. | Yes* | Yes* |
| 9. Statistics | More retractions in journals deeming statistics ‘not relevant’, fewer retractions in journals paying no specific attention to statistics, incorporating statistics in review, or employing specialist statistics reviewers. | Yes* | Yes* |
| 10. External sources | Using review reports from partner journals is related to fewer retractions. | No | No |
| 11. IT-tools | Not using digital tools and using tools to check references is related to more retractions. | Yes | Yes* |
| 12. Reader commentary | Not facilitating reader commentary is related to fewer retractions, facilitating in-channel commentary is related to more retractions. | Yes | Yes* |
Significance is measured at the level of α = 0.01. *p< 0.001