| Literature DB >> 30971690 |
Martin Fenner1, Mercè Crosas2, Jeffrey S Grethe3, David Kennedy4, Henning Hermjakob5, Phillippe Rocca-Serra6, Gustavo Durand2, Robin Berjon7, Sebastian Karcher8, Maryann Martone3, Tim Clark9.
Abstract
This article presents a practical roadmap for scholarly data repositories to implement data citation in accordance with the Joint Declaration of Data Citation Principles, a synopsis and harmonization of the recommendations of major science policy bodies. The roadmap was developed by the Repositories Expert Group, as part of the Data Citation Implementation Pilot (DCIP) project, an initiative of FORCE11.org and the NIH-funded BioCADDIE ( https://biocaddie.org ) project. The roadmap makes 11 specific recommendations, grouped into three phases of implementation: a) required steps needed to support the Joint Declaration of Data Citation Principles, b) recommended steps that facilitate article/data publication workflows, and c) optional steps that further improve data citation support provided by data repositories. We describe the early adoption of these recommendations 18 months after they have first been published, looking specifically at implementations of machine-readable metadata on dataset landing pages.Entities:
Mesh:
Year: 2019 PMID: 30971690 PMCID: PMC6472386 DOI: 10.1038/s41597-019-0031-8
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Guidelines for Repositories.
| Level | # | Guideline |
|---|---|---|
| Required | 1 | All datasets intended for citation |
| 2 | Persistent identifiers for datasets | |
| 3 | The persistent identifier expressed as an URL | |
| 4 | The persistent identifier | |
| 5 | The repository must provide documentation and support for data citation. | |
| Recommended | 6 | The landing page |
| 7 | The machine-readable metadata | |
| 8 | Metadata | |
| 9 | Metadata | |
| Optional | 10 | Content negotiation for schema.org/JSON-LD and other content types |
| 11 | HTTP link headers |
Fig. 1Generic data citation - relationships of the citation reference, repository landing page and underlying data.
Fig. 2Providing information about how a dataset should be cited, with download link for citation (in BibTex or other standard bibliographic reference manager format).
Citation metadata for Data Repositories. Key:
| Citation Metadata | Dublin Corea | Schema.orgb | DataCitec | DATSd |
|---|---|---|---|---|
| Dataset Identifier | identifier | @id* | identifier | identifier |
| Title | title | name | title | title |
| Creator** | creator | author | creator | creator |
| Data repository or archive | publisher | publisher | publisher | publisher |
| Publication Date | date | datePublished | publicationYear | date |
| Version |
| version | version | version |
| Type | type | type | resourceTypeGeneral | type |
aDublin Core Metadata Element Set (https://dublincore.org/documents/dces/);
bDataset - Schema.org (https://schema.org/Dataset);
cDataCite Metadata Working Group[21];
dGonzalez-Beltran & Rocca-Serra[22,23];
*name of ID field depends on schema.org serialization format, it is @id for JSON-LD;
**not all datasets will have “the main researchers involved in producing the data” (DataCite Schema), in which case the more generic “An entity primarily responsible for making the resource” from Dublin Core should be used, and this can also be an organization.
Important discovery metadata for Data Repositories. Key:
| Discovery Metadata | Dublin Core | Schema.org | DataCite | DATS |
|---|---|---|---|---|
| Description | description | description | description | dataType |
| Keywords | subject | keywords | subject | keywords |
| License | license | license | rights | license |
| Related Dataset** | isPartOf isVersionOf | isPartOf | relatedIdentifier | isPartOf |
| Related Publication*** | bibliographicCitation | citation | relatedIdentifier | publication |
*DATS provides much more detailed metadata to describe a biomedical dataset;
**related datasets can have part/whole relations (IsPartOf, etc.), version relations (IsVersionOf, etc.) or reference relations (references);
***related publications reference a dataset published previously, reference a dataset published in parallel with the publication, or otherwise document a dataset.
Fig. 3Implementation status of Schema.org metadata in repository landing pages.