Literature DB >> 10566344

Analysis of biomedical text for chemical names: a comparison of three methods.

W J Wilbur1, G F Hazard, G Divita, J G Mork, A R Aronson, A C Browne.   

Abstract

At the National Library of Medicine (NLM), a variety of biomedical vocabularies are found in data pertinent to its mission. In addition to standard medical terminology, there are specialized vocabularies including that of chemical nomenclature. Normal language tools including the lexically based ones used by the Unified Medical Language System (UMLS) to manipulate and normalize text do not work well on chemical nomenclature. In order to improve NLM's capabilities in chemical text processing, two approaches to the problem of recognizing chemical nomenclature were explored. The first approach was a lexical one and consisted of analyzing text for the presence of a fixed set of chemical segments. The approach was extended with general chemical patterns and also with terms from NLM's indexing vocabulary, MeSH, and the NLM SPECIALIST lexicon. The second approach applied Bayesian classification to n-grams of text via two different methods. The single lexical method and two statistical methods were tested against data from the 1999 UMLS Metathesaurus. One of the statistical methods had an overall classification accuracy of 97%.

Mesh:

Substances:

Year:  1999        PMID: 10566344      PMCID: PMC2232672     

Source DB:  PubMed          Journal:  Proc AMIA Symp        ISSN: 1531-605X


  19 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

3.  Shape identification within the SPECIALIST textTools.

Authors:  Guy Divita; Sayfi Yousef
Journal:  AMIA Annu Symp Proc       Date:  2006

4.  Automated annotation of chemical names in the literature with tunable accuracy.

Authors:  Jun D Zhang; Lewis Y Geer; Evan E Bolton; Stephen H Bryant
Journal:  J Cheminform       Date:  2011-11-22       Impact factor: 5.514

5.  Mining metabolites: extracting the yeast metabolome from the literature.

Authors:  Chikashi Nobata; Paul D Dobson; Syed A Iqbal; Pedro Mendes; Jun'ichi Tsujii; Douglas B Kell; Sophia Ananiadou
Journal:  Metabolomics       Date:  2010-10-31       Impact factor: 4.290

6.  Tunable machine vision-based strategy for automated annotation of chemical databases.

Authors:  Jungkap Park; Gus R Rosania; Kazuhiro Saitou
Journal:  J Chem Inf Model       Date:  2009-08       Impact factor: 4.956

7.  Seeking a new biology through text mining.

Authors:  Andrey Rzhetsky; Michael Seringhaus; Mark Gerstein
Journal:  Cell       Date:  2008-07-11       Impact factor: 41.582

8.  An automated framework for hypotheses generation using literature.

Authors:  Vida Abedi; Ramin Zand; Mohammed Yeasin; Fazle Elahi Faisal
Journal:  BioData Min       Date:  2012-08-29       Impact factor: 2.522

9.  Concept Discovery for Pathology Reports using an N-gram Model.

Authors:  Vincent Yip; Mutlu Mete; Umit Topaloglu; Sinan Kockara
Journal:  Summit Transl Bioinform       Date:  2010-03-01

10.  The Text-mining based PubChem Bioassay neighboring analysis.

Authors:  Lianyi Han; Tugba O Suzek; Yanli Wang; Steve H Bryant
Journal:  BMC Bioinformatics       Date:  2010-11-08       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.