| Literature DB >> 30457573 |
Helena Cousijn1, Amye Kenall2, Emma Ganley3, Melissa Harrison4, David Kernohan5, Thomas Lemberger6, Fiona Murphy7, Patrick Polischuk8, Simone Taylor9, Maryann Martone10, Tim Clark11,12.
Abstract
This article presents a practical roadmap for scholarly publishers to implement data citation in accordance with the Joint Declaration of Data Citation Principles (JDDCP), a synopsis and harmonization of the recommendations of major science policy bodies. It was developed by the Publishers Early Adopters Expert Group as part of the Data Citation Implementation Pilot (DCIP) project, an initiative of FORCE11.org and the NIH BioCADDIE program. The structure of the roadmap presented here follows the "life of a paper" workflow and includes the categories Pre-submission, Submission, Production, and Publication. The roadmap is intended to be publisher-agnostic so that all publishers can use this as a starting point when implementing JDDCP-compliant data citation. Authors reading this roadmap will also better know what to expect from publishers and how to enable their own data citations to gain maximum impact, as well as complying with what will become increasingly common funder mandates on data transparency.Entities:
Mesh:
Year: 2018 PMID: 30457573 PMCID: PMC6244190 DOI: 10.1038/sdata.2018.259
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
How and why to cite data.
| Supports reproducibility and validation of results, allows data reuse. Provides credit for data generators. Publications linked to publicly available data have been associated with increased citations. Improves connectivity and provenance tracking of data described in publications. Data sharing is increasingly required by funders, publishers, and institutions. | |
|
For primary data: determine an appropriate long-term repository for data archiving. Your publisher should provide access to a list of acceptable archival repositories.
Deposit your data and get an accession number or dataset DOI from the repository.
For secondary data: cite what you use. When using the data of others, cite both the related peer-reviewed literature and the actual datasets used.
Include ‘formal’ data citations whenever possible: When datasets have formal, stable identifiers or accession numbers, they should be included in the main reference list.
Be as complete as possible, but don’t invent metadata: If a data record does not have a clear author/creator or title, don’t make one up.
Refer to your publisher’s | |
| Help authors and journals to comply easily with funder mandates. Improve author service by simplifying policies and procedures and increasing the visibility and connectivity of their articles and data. Improve editor and peer reviewer service with better guidelines and support for data and visibility of data in the peer review process. Improve reader and author service with more consistent links to data. Support editorial goals to publish more open and reproducible research. Make the most of your repository partnerships. | |
| How? | Revise editor training and advocacy material Revise reviewer training material Update information for authors by: a. providing guidance on author responsibilities and a policy on data citation; b. asking authors to provide a Data Availability Statement; c. specifying how to format data citations; and d. providing detailed guidance on suitable repositories. Update guidelines for internal customer services queries and provide author FAQs. Capture data citation in reference list at point of submission in a structured way. Data availability should be captured in a structured way. Update XML DTD for data citation tagging. Display data citations in the article. Deliver data citation metadata to Crossref. |
Figure 1Data citation example.
(1) Data citation in text; (2) Reference; (3) Globally resolvable unique identifier. Example from Beresford NA, et al. (2016). Available at https://doi.org/10.1016/j.jenvrad.2015.03.022[16].
Figure 2Data citation resolution structure (ideal workflow).
Articles (1) link to datasets in appropriate repositories, on which their conclusions are based, through citation to a dataset (a), whose unique persistent identifier (PID) resolves (b) to a landing page (2) in a well-supported data repository. The data landing page contains human- and machine-readable metadata, to support search and to resolve (c) back to the citing article, and (d) a link to the data itself (3).
Figure 3Example of a Data Availability Statement.
Taken from Ma et al.[19]. Available at https://doi.org/10.1186/s13059-018-1435-z.
Estimated data citation implementation timelines for eight academic publishers.
| Publisher | Planning | Implementation | Planned go-live date |
|---|---|---|---|
| NB. The data citation rollout at the given publishers will be in line with their respective data policies and will apply to all journals whose content is based on a dataset or which references datasets. | |||
| eLife | Q1–Q3 2017 | Q1–Q3 2018 | Live |
| Elsevier | Q2–Q3 2016 | Q4 2016 | Live |
| EMBO Press | Q1–Q2 2017 | Q3 2017–Q1 2018 | Live |
| Frontiers | Q1–Q4 2017 | Q1– Q2 2018 | Live |
| PLOS | Q1–Q4 2017 | 2018 | Q1/Q2 2019 |
| SpringerNature | 2016–2017 | Q2 2017–Q4 2018 | Q4 2018 |
| Taylor & Francis | Q1–Q2 2017 | Q4 2017 continuing through 2018 | Live |
| Wiley | Q1–Q2 2017 | Q4 2017 continuing through 2018 | 2018 |