Literature DB >> 30382537

Automated Metadata Suggestion During Repository Submission.

Robert A McDougal1,2, Isha Dalal3, Thomas M Morse4, Gordon M Shepherd4.   

Abstract

Knowledge discovery via an informatics resource is constrained by the completeness of the resource, both in terms of the amount of data it contains and in terms of the metadata that exists to describe the data. Increasing completeness in one of these categories risks reducing completeness in the other because manually curating metadata is time consuming and is restricted by familiarity with both the data and the metadata annotation scheme. The diverse interests of a research community may drive a resource to have hundreds of metadata tags with few examples for each making it challenging for humans or machine learning algorithms to learn how to assign metadata tags properly. We demonstrate with ModelDB, a computational neuroscience model discovery resource, that using manually-curated regular-expression based rules can overcome this challenge by parsing existing texts from data providers during user data entry to suggest metadata annotations and prompt them to suggest other related metadata annotations rather than leaving the task to a curator. In the ModelDB implementation, analyzing the abstract identified 6.4 metadata tags per abstract at 79% precision. Using the full-text produced higher recall with low precision (41%), and the title alone produced few (1.3) metadata annotations per entry; we thus recommend data providers use their abstract during upload. Grouping the possible metadata annotations into categories (e.g. cell type, biological topic) revealed that precision and recall for the different text sources varies by category. Given this proof-of-concept, other bioinformatics resources can likewise improve the quality of their metadata by adopting our approach of prompting data uploaders with relevant metadata at the minimal cost of formalizing rules for each potential metadata annotation.

Entities:  

Keywords:  Data sharing; Metadata; Natural language processing; Repository

Mesh:

Year:  2019        PMID: 30382537      PMCID: PMC6494730          DOI: 10.1007/s12021-018-9403-z

Source DB:  PubMed          Journal:  Neuroinformatics        ISSN: 1539-2791


  22 in total

1.  Dendritic asymmetry cannot account for directional responses of neurons in visual cortex.

Authors:  J C Anderson; T Binzegger; O Kahana; K A Martin; I Segev
Journal:  Nat Neurosci       Date:  1999-09       Impact factor: 24.884

2.  Database tools for integrating and searching membrane property data correlated with neuronal morphology.

Authors:  J S Mirsky; P M Nadkarni; M D Healy; P L Miller; G M Shepherd
Journal:  J Neurosci Methods       Date:  1998-07-01       Impact factor: 2.390

3.  Text mining neuroscience journal articles to populate neuroscience databases.

Authors:  Chiquito J Crasto; Luis N Marenco; Michele Migliore; Buqing Mao; Prakash M Nadkarni; Perry Miller; Gordon M Shepherd
Journal:  Neuroinformatics       Date:  2003

4.  NMDA/AMPA ratio impacts state transitions and entrainment to oscillations in a computational model of the nucleus accumbens medium spiny projection neuron.

Authors:  John A Wolf; Jason T Moyer; Maciej T Lazarewicz; Diego Contreras; Marianne Benoit-Marand; Patricio O'Donnell; Leif H Finkel
Journal:  J Neurosci       Date:  2005-10-05       Impact factor: 6.167

5.  Pyramidal neurons switch from integrators in vitro to resonators under in vivo-like conditions.

Authors:  Steven A Prescott; Stéphanie Ratté; Yves De Koninck; Terrence J Sejnowski
Journal:  J Neurophysiol       Date:  2008-10-01       Impact factor: 2.714

6.  Big data: The future of biocuration.

Authors:  Doug Howe; Maria Costanzo; Petra Fey; Takashi Gojobori; Linda Hannick; Winston Hide; David P Hill; Renate Kania; Mary Schaeffer; Susan St Pierre; Simon Twigger; Owen White; Seung Yon Rhee
Journal:  Nature       Date:  2008-09-04       Impact factor: 49.962

7.  Abnormal Excitability of Oblique Dendrites Implicated in Early Alzheimer's: A Computational Study.

Authors:  Thomas M Morse; Nicholas T Carnevale; Pradeep G Mutalik; Michele Migliore; Gordon M Shepherd
Journal:  Front Neural Circuits       Date:  2010-05-31       Impact factor: 3.492

Review 8.  Getting started in text mining.

Authors:  K Bretonnel Cohen; Lawrence Hunter
Journal:  PLoS Comput Biol       Date:  2008-01       Impact factor: 4.475

9.  High speed two-photon imaging of calcium dynamics in dendritic spines: consequences for spine calcium kinetics and buffer capacity.

Authors:  L Niels Cornelisse; Ronald A J van Elburg; Rhiannon M Meredith; Rafael Yuste; Huibert D Mansvelder
Journal:  PLoS One       Date:  2007-10-24       Impact factor: 3.240

10.  Semi-automated curation of protein subcellular localization: a text mining-based approach to Gene Ontology (GO) Cellular Component curation.

Authors:  Kimberly Van Auken; Joshua Jaffery; Juancarlos Chan; Hans-Michael Müller; Paul W Sternberg
Journal:  BMC Bioinformatics       Date:  2009-07-21       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.