| Literature DB >> 28462024 |
Nicole A Vasilevsky1,2, Jessica Minnier3, Melissa A Haendel1,2, Robin E Champieux1.
Abstract
BACKGROUND: There is wide agreement in the biomedical research community that research data sharing is a primary ingredient for ensuring that science is more transparent and reproducible. Publishers could play an important role in facilitating and enforcing data sharing; however, many journals have not yet implemented data sharing policies and the requirements vary widely across journals. This study set out to analyze the pervasiveness and quality of data sharing policies in the biomedical literature.Entities:
Keywords: Data management; Data sharing; Open data; Reproducibility; Scholarly communication; Scientific communication
Year: 2017 PMID: 28462024 PMCID: PMC5407277 DOI: 10.7717/peerj.3208
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Journal impact factor category.
| Journal impact factor category | |
|---|---|
| <2 | 19 (6%) |
| 2–3.99 | 125 (39.3%) |
| 4–5.99 | 102 (32.1%) |
| 6–7.99 | 25 (7.9%) |
| 8–9.99 | 15 (4.7%) |
| 10–29.99 | 29 (9.1%) |
| 30–43 | 3 (0.9%) |
Number of citable items per journal.
| Number of citable items per journal | |
|---|---|
| <100 | 42 (13.2%) |
| 100–500 | 239 (75.2%) |
| 500–1,000 | 28 (8.8%) |
| 1,000–32,000 | 9 (2.8%) |
Journal scoring rubric used in this study, adapted from Stodden, Guo & Ma (2013).
| 1 | Required as condition of publication, barring exceptions |
| 2 | Required but, no explicit statement regarding effect on publication/editorial decisions |
| 3 | Explicitly encouraged/addressed, but not required. |
| 4 | Mentioned indirectly |
| 5 | Only protein, proteomic, and/or genomic data sharing are addressed. |
| 6 | No mention |
| 1 | Open access |
| 0 | Subscription |
| a | Yes |
| b | No |
| A | Public online repository |
| B | Journal hosted |
| C | By reader request to authors |
| D | Multiple methods equally recommended |
| E | Unspecified |
| a | Journal will host regardless of size |
| b | Journal has data hosting file/s size limit |
| c | Unspecified |
| a | Explicitly stated or mentioned |
| b | No mention |
| a | Explicitly stated |
| b | No mention |
| a | Explicitly stated |
| b | No mention |
Number of journals per open access.
| Open access | # Journals (%) | Median # citable items per journal 2013 | # Citable items 2013 (%) | # Citable items 2013, remove PLOS ONE (%) | Median # citable items per journal 2014 | # Citable items 2014 (%) | # Citable items 2014, remove PLOS ONE (%) |
|---|---|---|---|---|---|---|---|
| Open access | 44 (13.8%) | 199.5 | 43,789 (33.6%) | 12,293 (12.4%) | 207 | 45,831 (35.0%) | 15,791 (15.6%) |
| Subscription | 274 (86.2%) | 246.5 | 86,541 (66.4%) | 86,541 (87.6%) | 240 | 85,276 (65.0%) | 85,276 (84.4%) |
Curator reliability.
| Score | # Journals | % Agreement | Cohen’s Kappa |
|---|---|---|---|
| Open access mark | 40 | 100.000 | 1.000 |
| Data sharing mark | 40 | 92.500 | 0.905 |
| Protein proteomic genomic or microaray sequence or structural data sharing addressed required with deposit to specific data banks | 40 | 100.000 | 1.000 |
| Recommended preferred sharing mark | 40 | 97.500 | 0.959 |
| Size guidelines if journal hosted provided | 13 | 92.308 | 0.629 |
| Copyright licensing mark | 40 | 100.000 | 1.000 |
| Archival retention mark | 40 | 97.500 | 0.655 |
| Reproducibility noted mark | 40 | 92.500 | 0.754 |
Figure 1Percentage of journals per each data sharing mark (DSM).
(A) shows the percentage of all journals for each data sharing mark. (B) shows the percentage of citable items from each journal (including PLOS ONE) for each data sharing mark. (C) shows the percentage of citable items for each journal (excluding PLOS ONE) for each data sharing mark. Because of the journal PLOS ONE’s high publishing activity, we analyzed the percentage of citable items for each data sharing mark including and excluding PLOS ONE.
Publishing volume by data sharing mark.
| DSM | DSM description | # Journals (%) | Median # citable items per journal 2013 | # Citable items 2013 (%) | # Citable items 2013, remove PLOS ONE (%) | Median # citable items per journal 2014 | # Citable items 2014 (%) | # Citable items 2014, remove PLOS ONE (%) |
|---|---|---|---|---|---|---|---|---|
| 1 | Required as condition of publication, barring exceptions | 38 (11.9%) | 230.5 | 42,669 (32.7%) | 11,173 (11.3%) | 220 | 42,794 (32.6%) | 12,754 (12.6%) |
| 2 | Required but no explicit statement regarding effect on publication/editorial decisions | 29 (9.1%) | 209 | 12,138 (9.3%) | 12,138 (12.3%) | 227 | 12,436 (9.5%) | 12,436 (12.3%) |
| 3 | Explicitly encouraged/addressed, but not required. | 74 (23.3%) | 259.5 | 25,519 (19.6%) | 25,519 (25.8%) | 282.5 | 26,026 (19.9%) | 26,026 (25.8%) |
| 4 | Mentioned indirectly | 29 (9.1%) | 256 | 8,062 (6.2%) | 8,062 (8.2%) | 225 | 7,894 (6%) | 7,894 (7.8%) |
| 5 | Only protein, proteomic, and/or genomic data sharing are addressed. | 47 (14.8%) | 277 | 19,339 (14.8%) | 19,339 (19.6%) | 316 | 19,080 (14.6%) | 19,080 (18.9%) |
| 6 | No mention | 101 (31.8%) | 211 | 22,603 (17.3%) | 22,603 (22.9%) | 213 | 22,877 (17.4%) | 22,877 (22.6%) |
| DSM 1&2 | Required | 67 (21.1%) | 226 | 54,807 (42.1%) | 23,311 (23.6%) | 221 | 55,230 (42.1%) | 25,190 (24.9%) |
| DSM 3–6 | Not Required | 251 (78.9%) | 248 | 75,523 (57.9%) | 75,523 (76.4%) | 244 | 75,877 (57.9%) | 75,877 (75.1%) |
| All Journals | 318 (100%) | 243 | 130,330 (100%) | 98,834 (100%) | 237.5 | 131,107 (100%) | 101,067 (100%) | |
Figure 2Impact factors were higher for journals with the strongest data sharing policies (DSM 1) compared to journals with no mention of data sharing (DSM 6).
The median Impact Factor was calculated for the journals with each data sharing mark (1–6) for each report year (left = 2013, right = 2014). The lower and upper hinges of the boxplots represent the first and third quartiles of journal Impact Factor, the horizontal line represents the median, the triangle represents the mean, and the upper and lower whiskers extend from the hinge to the highest (lowest) value that is within 1.5 times the interquartile range of the hinge, with journals outside this range represented as points.
Open access journals and citable items by data sharing mark.
| Open access journals and citable items by data sharing mark | Subscription | Open access | % Open access |
|---|---|---|---|
| Data sharing mark | # Journals (# Citable items) | # Journals (# Citable items) | % Journals (% Citable items) |
| 1- Required as condition of publication, barring exceptions | 29 (7,709) | 9 (34,960; 3464 | 23.7% (81.9%; 31% |
| 2- Required but no explicit statement regarding effect on publication/editorial decisions | 27 (11,864) | 2 (274 | 6.9% (2.3%) |
| 3- Explicitly encouraged/addressed, but not required. | 63 (22,884) | 11 (2,635) | 14.9% (10.3%) |
| 4- Mentioned indirectly | 29 (8,062) | 0 (0) | 0% (0%) |
| 5- Only protein, proteomic, and/or genomic data sharing are addressed. | 40 (17,401) | 7 (1,938) | 14.9% (10.0%) |
| 6- No mention | 86 (18,621) | 15 (3,982) | 14.9% (17.6%) |
| DSM 1&2 - Required | 56 (19,573) | 11 (35,234; 3,738 | 16.42% (64.29%; 16.04% |
| DSM 3–6 - Not required | 218 (66,968) | 33 (8,555) | 13.15% (11.33%) |
Notes.
After removing PLOS ONE.
Figure 3Recommended data sharing method by data sharing mark (DSM) 1–5.
The number (percent) of journals with each recommended data sharing method is represented by each tile, with brighter blue shades denoting higher percentages of journals with the given data sharing method.