| Literature DB >> 29688367 |
Daniela Raciti1, Karen Yook1, Todd W Harris2, Tim Schedl3, Paul W Sternberg1.
Abstract
Large volumes of data generated by research laboratories coupled with the required effort and cost of curation present a significant barrier to inclusion of these data in authoritative community databases. Further, many publicly funded experimental observations remain invisible to curation simply because they are never published: results often do not fit within the scope of a standard publication; trainee-generated data are forgotten when the experimenter (e.g. student, post-doc) leaves the lab; results are omitted from science narratives due to publication bias where certain results are considered irrelevant for the publication. While authors are in the best position to curate their own data, they face a steep learning curve to ensure that appropriate referential tags, metadata, and ontologies are applied correctly to their observations, a task sometimes considered beyond the scope of their research and other numerous responsibilities. Getting researchers to adopt a new system of data reporting and curation requires a fundamental change in behavior among all members of the research community. To solve these challenges, we have created a novel scholarly communication platform that captures data from researchers and directly delivers them to information resources via Micropublication. This platform incentivizes authors to publish their unpublished observations along with associated metadata by providing a deliberately fast and lightweight but still peer-reviewed process that results in a citable publication. Our long-term goal is to develop a data ecosystem that improves reproducibility and accountability of publicly funded research and in turn accelerates both basic and translational discovery. Database URL: www.micropublication.org.Entities:
Mesh:
Year: 2018 PMID: 29688367 PMCID: PMC5836261 DOI: 10.1093/database/bay013
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1.Micropublication: biology platform homepage at http://www.micropublication.org.
Figure 2.Summary of the data submission process and validation pipeline.
Figure 3.Iterative gene expression submission form design.
Figure 4.Submission form and WormBase Expression pattern data model. A simplified example that shows how metadata captured through the form represent specific data fields in the data model. For gene expression, authors can describe the spatio-temporal localization of a transcript/protein by choosing terms from pre-defined ontologies. We use the C. elegans anatomy ontology to describe localization in cells/tissues, the GO Cellular Component Ontology to describe subcellular localization and the C. elegans developmental ontology to capture temporal expression. We allow authors to choose from pre-designed qualifier fields (certainly expressed, partially expressed, possibly expressed and NOT expressed) that allow a more detailed description of the pattern.
Figure 5.WormBase view of a gene expression micropublication available at www.wormbase.org. http://wormbase.org/resources/paper/WBPaper00050256#03–10.