| Literature DB >> 33983972 |
Luke A McGuinness1,2, Athena L Sheppard3.
Abstract
OBJECTIVE: To determine whether medRxiv data availability statements describe open or closed data-that is, whether the data used in the study is openly available without restriction-and to examine if this changes on publication based on journal data-sharing policy. Additionally, to examine whether data availability statements are sufficient to capture code availability declarations.Entities:
Year: 2021 PMID: 33983972 PMCID: PMC8118451 DOI: 10.1371/journal.pone.0250887
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Categories used to classify the data availability statements.
| Key | Main category | Sub-category | Example |
|---|---|---|---|
| Not applicable (protocol for a review, commentary, etc) | "Data sharing not applicable to this article as no datasets were generated or analysed during the current study." [ | ||
| "Closed" | Data not made available | "Not available for public" [ | |
| "Closed" | Data available on request to authors | "Data can be available upon reasonable request to the corresponding author." [ | |
| "Closed" | Data will be made available in the future (link provided) | "The protocol and full dataset will be available at Open Science Framework upon peer review publication ( | |
| "Closed" | Data will be made available in the future (no link provided) | "Data will be deposited in Dryad upon publication" [ | |
| "Closed" | Data available from central repository (access-controlled or open access), but insufficient detail available to find specific dataset | "Data were obtained from the international MSBase cohort study. Information regarding data availability can be obtained at | |
| "Closed" | Data available from central access-controlled repository, and sufficient details included to identify specific dataset e.g. via extract or accession ID or date stamp | "This research has been conducted using the UK Biobank Resource under application number 24494. All bona fide researchers can apply to use the UK Biobank resource for health related research that is in the public interest." [ | |
| "Open" | Data available in the manuscript/ | "All data related to this study are present in the paper or the | |
| "Open" | Data available via a online repository that is not access-controlled e.g. Dryad, Zenodo | "Extracted data used in this meta-analysis and analysis code are available at |
Illustrative examples of each category were taken from preprints included in our sample (see "Data extraction").
Fig 1Distribution of the data availability statements of preprinted (Panel A) and published (Panel B) records by category from Table 1.
Change in openness of data availability statements from preprint to published article, grouped by journal data-sharing policy.
| Journal data sharing policy | Preprinted records subsequently published (N) | Open DAS in preprinted version % (N) | Open DAS in published version % (N) | Change in DAS from preprint to publication | ||
|---|---|---|---|---|---|---|
| More open (N) | More closed (N) | No change (N) | ||||
| 94 | 20.2% (19) | 22.3% (21) | 10 | 8 | 76 | |
| 57 | 33.3% (19) | 61.4% (35) | 16 | 0 | 41 | |
Assessment of whether researchers promising to make data available on publication actually do so, and whether this differs if researchers included a link to an embargoed repository or not.
| Preprint Category | Number of preprints | Published Category | Number of published studies |
|---|---|---|---|
| 3 | 1. Data not made available | 1 (33.3%) | |
| 5. Data available from central repository (access-controlled or open access), but insufficient detail available to find specific dataset | 1 (33.3%) | ||
| 8. Data available via a online repository that is not access-controlled e.g. Dryad, Zenodo | 1 (33.3%) | ||
| 7 | 1. Data not made available | 1 (14.3%) | |
| 2. Data available on request to authors | 1 (14.3%) | ||
| 7. Data available in the manuscript/ | 1 (14.3%) | ||
| 8. Data available via a online repository that is not access-controlled e.g. Dryad, Zenodo | 4 (57.1%) |