Literature DB >> 12603049

A simple algorithm for identifying abbreviation definitions in biomedical text.

Ariel S Schwartz1, Marti A Hearst.   

Abstract

The volume of biomedical text is growing at a fast rate, creating challenges for humans and computer systems alike. One of these challenges arises from the frequent use of novel abbreviations in these texts, thus requiring that biomedical lexical ontologies be continually updated. In this paper we show that the problem of identifying abbreviations' definitions can be solved with a much simpler algorithm than that proposed by other research efforts. The algorithm achieves 96% precision and 82% recall on a standard test collection, which is at least as good as existing approaches. It also achieves 95% precision and 82% recall on another, larger test set. A notable advantage of the algorithm is that, unlike other approaches, it does not require any training data.

Entities:  

Mesh:

Year:  2003        PMID: 12603049

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  97 in total

1.  A simple and practical dictionary-based approach for identification of proteins in Medline abstracts.

Authors:  Sergei Egorov; Anton Yuryev; Nikolai Daraselia
Journal:  J Am Med Inform Assoc       Date:  2004-02-05       Impact factor: 4.497

2.  Parenthetically speaking: classifying the contents of parentheses for text mining.

Authors:  K Bretonnel Cohen; Thomas Christiansen; Lawrence E Hunter
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

3.  Using UMLS lexical resources to disambiguate abbreviations in clinical text.

Authors:  Youngjun Kim; John Hurdle; Stéphane M Meystre
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

4.  Using text mining to link journal articles to neuroanatomical databases.

Authors:  Leon French; Paul Pavlidis
Journal:  J Comp Neurol       Date:  2012-06-01       Impact factor: 3.215

5.  TRANSLATING BIOLOGY: TEXT MINING TOOLS THAT WORK.

Authors:  K Bretonnel Cohen; Hong Yu; Philip E Bourne; Lynette Hirschman
Journal:  Pac Symp Biocomput       Date:  2008-01-01

6.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

7.  ALICE: an algorithm to extract abbreviations from MEDLINE.

Authors:  Hiroko Ao; Toshihisa Takagi
Journal:  J Am Med Inform Assoc       Date:  2005-05-19       Impact factor: 4.497

8.  BioTagger-GM: a gene/protein name recognition system.

Authors:  Manabu Torii; Zhangzhi Hu; Cathy H Wu; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2008-12-11       Impact factor: 4.497

9.  Enhancing acronym/abbreviation knowledge bases with semantic information.

Authors:  Manabu Torii; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

10.  Rule-based deduplication of article records from bibliographic databases.

Authors:  Yu Jiang; Can Lin; Weiyi Meng; Clement Yu; Aaron M Cohen; Neil R Smalheiser
Journal:  Database (Oxford)       Date:  2014-01-16       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.