Literature DB >> 15046245

Text mining neuroscience journal articles to populate neuroscience databases.

Chiquito J Crasto1, Luis N Marenco, Michele Migliore, Buqing Mao, Prakash M Nadkarni, Perry Miller, Gordon M Shepherd.   

Abstract

We have developed a program NeuroText to populate the neuroscience databases in SenseLab (http://senselab.med.yale.edu/senselab) by mining the natural language text of neuroscience articles. NeuroText uses a two-step approach to identify relevant articles. The first step (pre-processing), aimed at 100% sensitivity, identifies abstracts containing database keywords. In the second step, potentially relevant abstracts identified in the first step are processed for specificity dictated by database architecture, and neuroscience, lexical and semantic contexts. NeuroText results were presented to the experts for validation using a dynamically generated interface that also allows expert-validated articles to be automatically deposited into the databases. Of the test set of 912 articles, 735 were rejected at the pre-processing step. For the remaining articles, the accuracy of predicting database-relevant articles was 85%. Twenty-two articles were erroneously identified. NeuroText deferred decisions on 29 articles to the expert. A comparison of NeuroText results versus the experts' analyses revealed that the program failed to correctly identify articles' relevance due to concepts that did not yet exist in the knowledgebase or due to vaguely presented information in the abstracts. NeuroText uses two "evolution" techniques (supervised and unsupervised) that play an important role in the continual improvement of the retrieval results. Software that uses the NeuroText approach can facilitate the creation of curated, special-interest, bibliography databases.

Mesh:

Year:  2003        PMID: 15046245     DOI: 10.1385/NI:1:3:215

Source DB:  PubMed          Journal:  Neuroinformatics        ISSN: 1539-2791


  27 in total

1.  Neuronal database integration: the Senselab EAV data model.

Authors:  L Marenco; P Nadkarni; E Skoufos; G Shepherd; P Miller
Journal:  Proc AMIA Symp       Date:  1999

2.  Use of general-purpose negation detection to augment concept indexing of medical documents: a quantitative study using the UMLS.

Authors:  P G Mutalik; A Deshpande; P M Nadkarni
Journal:  J Am Med Inform Assoc       Date:  2001 Nov-Dec       Impact factor: 4.497

3.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles.

Authors:  C Friedman; P Kra; H Yu; M Krauthammer; A Rzhetsky
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

4.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

5.  ModelDB: making models publicly accessible to support computational neuroscience.

Authors:  Michele Migliore; Thomas M Morse; Andrew P Davison; Luis Marenco; Gordon M Shepherd; Michael L Hines
Journal:  Neuroinformatics       Date:  2003

Review 6.  The Human Brain Project: neuroinformatics tools for integrating, searching and modeling multidisciplinary neuroscience data.

Authors:  G M Shepherd; J S Mirsky; M D Healy; M S Singer; E Skoufos; M S Hines; P M Nadkarni; P L Miller
Journal:  Trends Neurosci       Date:  1998-11       Impact factor: 13.837

7.  High agreement but low kappa: II. Resolving the paradoxes.

Authors:  D V Cicchetti; A R Feinstein
Journal:  J Clin Epidemiol       Date:  1990       Impact factor: 6.437

8.  Inhibition of synaptic transmission by neuropeptide Y in rat hippocampal area CA1: modulation of presynaptic Ca2+ entry.

Authors:  J Qian; W F Colmers; P Saggau
Journal:  J Neurosci       Date:  1997-11-01       Impact factor: 6.167

9.  Eco Cyc: encyclopedia of Escherichia coli genes and metabolism.

Authors:  P D Karp; M Riley; S M Paley; A Pellegrini-Toole; M Krummenacker
Journal:  Nucleic Acids Res       Date:  1999-01-01       Impact factor: 16.971

View more
  11 in total

1.  Creating knowledgebases to text-mine PUBMED articles using clustering techniques.

Authors:  Chiquito J Crasto; Thomas M Morse; Michele Migliore; Prakash Nadkarni; Michael Hines; Douglas E Brash; Perry L Miller; Gordon M Shepherd
Journal:  AMIA Annu Symp Proc       Date:  2003

2.  Semi-automated population of an online database of neuronal models (ModelDB) with citation information, using PubMed for validation.

Authors:  Andrew P Davison; Thomas M Morse; Michele Migliore; Gordon M Shepherd; Michael L Hines
Journal:  Neuroinformatics       Date:  2004

3.  Using text mining to link journal articles to neuroanatomical databases.

Authors:  Leon French; Paul Pavlidis
Journal:  J Comp Neurol       Date:  2012-06-01       Impact factor: 3.215

4.  A hybrid approach to shape-based interpolation of stereotactic atlases of the human brain.

Authors:  Jimin Liu; Wieslaw L Nowinski
Journal:  Neuroinformatics       Date:  2006

5.  SenseLab: new developments in disseminating neuroscience information.

Authors:  Chiquito J Crasto; Luis N Marenco; Nian Liu; Thomas M Morse; Kei-Hoi Cheung; Peter C Lai; Gautam Bahl; Peter Masiar; Hugo Y K Lam; Ernest Lim; Huajin Chen; Prakash Nadkarni; Michele Migliore; Perry L Miller; Gordon M Shepherd
Journal:  Brief Bioinform       Date:  2007-05-17       Impact factor: 11.622

6.  Natural Language Processing Applications in the Clinical Neurosciences: A Machine Learning Augmented Systematic Review.

Authors:  Quinlan D Buchlak; Nazanin Esmaili; Christine Bennett; Farrokh Farrokhi
Journal:  Acta Neurochir Suppl       Date:  2022

7.  A Text Mining Pipeline Using Active and Deep Learning Aimed at Curating Information in Computational Neuroscience.

Authors:  Matthew Shardlow; Meizhi Ju; Maolin Li; Christian O'Reilly; Elisabetta Iavarone; John McNaught; Sophia Ananiadou
Journal:  Neuroinformatics       Date:  2019-07

8.  Automated Metadata Suggestion During Repository Submission.

Authors:  Robert A McDougal; Isha Dalal; Thomas M Morse; Gordon M Shepherd
Journal:  Neuroinformatics       Date:  2019-07

9.  PepBank--a database of peptides based on sequence text mining and public peptide data sources.

Authors:  Timur Shtatland; Daniel Guettler; Misha Kossodo; Misha Pivovarov; Ralph Weissleder
Journal:  BMC Bioinformatics       Date:  2007-08-01       Impact factor: 3.169

10.  Automated recognition of brain region mentions in neuroscience literature.

Authors:  Leon French; Suzanne Lane; Lydia Xu; Paul Pavlidis
Journal:  Front Neuroinform       Date:  2009-09-01       Impact factor: 4.081

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.