Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Analysis of biomedical text for chemical names: a comparison of three methods.

Literature DB >> 10566344

Analysis of biomedical text for chemical names: a comparison of three methods.

W J Wilbur¹, G F Hazard, G Divita, J G Mork, A R Aronson, A C Browne.

Abstract

At the National Library of Medicine (NLM), a variety of biomedical vocabularies are found in data pertinent to its mission. In addition to standard medical terminology, there are specialized vocabularies including that of chemical nomenclature. Normal language tools including the lexically based ones used by the Unified Medical Language System (UMLS) to manipulate and normalize text do not work well on chemical nomenclature. In order to improve NLM's capabilities in chemical text processing, two approaches to the problem of recognizing chemical nomenclature were explored. The first approach was a lexical one and consisted of analyzing text for the presence of a fixed set of chemical segments. The approach was extended with general chemical patterns and also with terms from NLM's indexing vocabulary, MeSH, and the NLM SPECIALIST lexicon. The second approach applied Bayesian classification to n-grams of text via two different methods. The single lexical method and two statistical methods were tested against data from the 1999 UMLS Metathesaurus. One of the statistical methods had an overall classification accuracy of 97%.

Mesh：

Substances：

Year: 1999 PMID： 10566344 PMCID： PMC2232672

Source DB: PubMed Journal: Proc AMIA Symp ISSN： 1531-605X

Keyword Cloud
Cited

19 in total

Analysis of biomedical text for chemical names: a comparison of three methods.

1. Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

2. An overview of MetaMap: historical perspective and recent advances.

3. Shape identification within the SPECIALIST textTools.

4. Automated annotation of chemical names in the literature with tunable accuracy.

5. Mining metabolites: extracting the yeast metabolome from the literature.

6. Tunable machine vision-based strategy for automated annotation of chemical databases.

7. Seeking a new biology through text mining.

8. An automated framework for hypotheses generation using literature.

9. Concept Discovery for Pathology Reports using an N-gram Model.

10. The Text-mining based PubChem Bioassay neighboring analysis.