Literature DB >> 32934235

Facilitating author-driven, machine-readable descriptions with the new minISA metadata format.

.   

Abstract

Entities:  

Year:  2020        PMID: 32934235      PMCID: PMC7493984          DOI: 10.1038/s41597-020-00641-9

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


× No keyword cloud information.
Since launch, Scientific Data has provided a machine-readable metadata file alongside each published Data Descriptor to summarize key characteristics about the data being described in the article[1]. Until August 2019, the metadata files provided for every Data Descriptor were based on the full ISA-Tab format[2,3] and were searchable via the ISA-Explorer discovery tool[4]. Although the ISA-Tab format was initially developed with biological assays in mind, it is a highly flexible format that allowed the journal to represent the generation of data in a machine-readable manner across a number of research disciplines. The data sharing and management landscape has changed dramatically since the journal first launched. In the last six years, we have seen the emergence of the FAIR data concept[5], the maturation of the data repository ecosystem, the emergence of repository indexes (e.g. https://fairsharing.org/ & https://www.re3data.org/), data indexing services (e.g. https://datamed.org/ & https://datasetsearch.research.google.com/), and advances in formal data citation infrastructure[6]. With machine-readable metadata now available from a wider range of sources, we have sought to focus our efforts on providing lighter metadata files that bridge the gap between citation-level metadata and the richer highly-structured information provided by data repositories. To this end, we have worked with the ISA coordinator group (https://isa-tools.org) to update our Data Descriptor machine-readable metadata format. The changes are twofold. We upgraded the representation from tabular to JSON-LD format (https://json-ld.org/) and reduced the set of elements in the metadata. Importantly, we have kept the cross-references to the relevant data records in repositories, where users can access the full repository-level metadata and the data files. To develop this new lighter-weight metadata format we used the existing ISA-JSON format as a starting point, and extracted the metadata elements that focus on describing: how and where data were measured, what experimental factors were varied in the study, and related publications that demonstrate usage of the data. We then anchored these elements to Schema.org types to make them compatible with other semantic web metadata sources (https://schema.org/). Metadata in the new minISA format[7] are now available for all Data Descriptors published since September 2019. The metadata files are hosted in figshare, receive their own distinct digital object identifiers, and are accessible directly from each Data Descriptor or via figshare’s API. The minISA format has been implemented in a new custom built webtool developed in partnership with figshare. Authors will be sent a link to the tool at an appropriate point during the review process. From today, all Data Descriptor authors will be asked to submit information about their data via the tool during the editorial and review process. Our in-house curation team will continue to work with authors of accepted manuscripts to finalize the metadata, ensuring the use of controlled vocabulary and ontology terms wherever possible. The curation team will also continue to carry out their usual checks to ensure the identification and remedy of any errors that may have been overlooked during peer review, and to improve dataset reusability[8]. While minISA will be used for all of our machine readable metadata going forward, our past ISA-Tab formatted metadata files remain available from each online Data Descriptor. We have also permanently archived a copy of the full set of 852 Scientific Data ISA-Tab files at figshare[9]. Metadata can be used to enhance the discovery and reuse value of the data described, thereby improving the FAIRness of the data. The metadata files associated with each Data Descriptor article have always been available for use by all under the CC0 waiver[1] and can be downloaded directly from each Data Descriptor article online. Data discovery infrastructure is still in its infancy, and we aim to continue to play an active role in this space. We are excited to see what the future holds for Scientific Data’s minISA metadata files and look forward to the next innovations in data publishing.
  5 in total

1.  ISA software suite: supporting standards-compliant experimental annotation and enabling curation at the community level.

Authors:  Philippe Rocca-Serra; Marco Brandizi; Eamonn Maguire; Nataliya Sklyar; Chris Taylor; Kimberly Begley; Dawn Field; Stephen Harris; Winston Hide; Oliver Hofmann; Steffen Neumann; Peter Sterk; Weida Tong; Susanna-Assunta Sansone
Journal:  Bioinformatics       Date:  2010-08-02       Impact factor: 6.937

2.  Toward interoperable bioscience data.

Authors:  Susanna-Assunta Sansone; Philippe Rocca-Serra; Dawn Field; Eamonn Maguire; Chris Taylor; Oliver Hofmann; Hong Fang; Steffen Neumann; Weida Tong; Linda Amaral-Zettler; Kimberly Begley; Tim Booth; Lydie Bougueleret; Gully Burns; Brad Chapman; Tim Clark; Lee-Ann Coleman; Jay Copeland; Sudeshna Das; Antoine de Daruvar; Paula de Matos; Ian Dix; Scott Edmunds; Chris T Evelo; Mark J Forster; Pascale Gaudet; Jack Gilbert; Carole Goble; Julian L Griffin; Daniel Jacob; Jos Kleinjans; Lee Harland; Kenneth Haug; Henning Hermjakob; Shannan J Ho Sui; Alain Laederach; Shaoguang Liang; Stephen Marshall; Annette McGrath; Emily Merrill; Dorothy Reilly; Magali Roux; Caroline E Shamu; Catherine A Shang; Christoph Steinbeck; Anne Trefethen; Bryn Williams-Jones; Katherine Wolstencroft; Ioannis Xenarios; Winston Hide
Journal:  Nat Genet       Date:  2012-01-27       Impact factor: 38.330

3.  Open data, open curation.

Authors: 
Journal:  Sci Data       Date:  2018-09-25       Impact factor: 6.444

4.  Data citation needed.

Authors: 
Journal:  Sci Data       Date:  2019-04-10       Impact factor: 6.444

5.  The FAIR Guiding Principles for scientific data management and stewardship.

Authors:  Mark D Wilkinson; Michel Dumontier; I Jsbrand Jan Aalbersberg; Gabrielle Appleton; Myles Axton; Arie Baak; Niklas Blomberg; Jan-Willem Boiten; Luiz Bonino da Silva Santos; Philip E Bourne; Jildau Bouwman; Anthony J Brookes; Tim Clark; Mercè Crosas; Ingrid Dillo; Olivier Dumon; Scott Edmunds; Chris T Evelo; Richard Finkers; Alejandra Gonzalez-Beltran; Alasdair J G Gray; Paul Groth; Carole Goble; Jeffrey S Grethe; Jaap Heringa; Peter A C 't Hoen; Rob Hooft; Tobias Kuhn; Ruben Kok; Joost Kok; Scott J Lusher; Maryann E Martone; Albert Mons; Abel L Packer; Bengt Persson; Philippe Rocca-Serra; Marco Roos; Rene van Schaik; Susanna-Assunta Sansone; Erik Schultes; Thierry Sengstag; Ted Slater; George Strawn; Morris A Swertz; Mark Thompson; Johan van der Lei; Erik van Mulligen; Jan Velterop; Andra Waagmeester; Peter Wittenburg; Katherine Wolstencroft; Jun Zhao; Barend Mons
Journal:  Sci Data       Date:  2016-03-15       Impact factor: 6.444

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.