Literature DB >> 15905486

ALICE: an algorithm to extract abbreviations from MEDLINE.

Hiroko Ao1, Toshihisa Takagi.   

Abstract

OBJECTIVE: To help biomedical researchers recognize dynamically introduced abbreviations in biomedical literature, such as gene and protein names, we have constructed a support system called ALICE (Abbreviation LIfter using Corpus-based Extraction). ALICE aims to extract all types of abbreviations with their expansions from a target paper on the fly.
METHODS: ALICE extracts an abbreviation and its expansion from the literature by using heuristic pattern-matching rules. This system consists of three phases and potentially identifies valid 320 abbreviation-expansion patterns as combinations of the rules.
RESULTS: It achieved 95% recall and 97% precision on randomly selected titles and abstracts from the MEDLINE database.
CONCLUSION: ALICE extracted abbreviations and their expansions from the literature efficiently. The subtly compiled heuristics enabled it to extract abbreviations with high recall without significantly reducing precision. ALICE does not only facilitate recognition of an undefined abbreviation in a paper by constructing an abbreviation database or dictionary, but also makes biomedical literature retrieval more accurate. This system is freely available at http://uvdb3.hgc.jp/ALICE/ALICE_index.html.

Mesh:

Year:  2005        PMID: 15905486      PMCID: PMC1205607          DOI: 10.1197/jamia.M1757

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  8 in total

1.  PNAD-CSS: a workbench for constructing a protein name abbreviation dictionary.

Authors:  M Yoshida; K Fukuda; T Takagi
Journal:  Bioinformatics       Date:  2000-02       Impact factor: 6.937

2.  Mapping abbreviations to full forms in biomedical articles.

Authors:  Hong Yu; George Hripcsak; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 May-Jun       Impact factor: 4.497

3.  Automatic extraction of acronym-meaning pairs from MEDLINE databases.

Authors:  J Pustejovsky; J Castaño; B Cochran; M Kotecki; M Morrell
Journal:  Stud Health Technol Inform       Date:  2001

4.  Mining terminological knowledge in large biomedical corpora.

Authors:  Hongfang Liu; Carol Friedman
Journal:  Pac Symp Biocomput       Date:  2003

5.  A simple algorithm for identifying abbreviation definitions in biomedical text.

Authors:  Ariel S Schwartz; Marti A Hearst
Journal:  Pac Symp Biocomput       Date:  2003

6.  Creating an online dictionary of abbreviations from MEDLINE.

Authors:  Jeffrey T Chang; Hinrich Schütze; Russ B Altman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

7.  A study of abbreviations in MEDLINE abstracts.

Authors:  Hongfang Liu; Alan R Aronson; Carol Friedman
Journal:  Proc AMIA Symp       Date:  2002

8.  Heuristics for identification of acronym-definition patterns within text: towards an automated construction of comprehensive acronym-definition dictionaries.

Authors:  J D Wren; H R Garner
Journal:  Methods Inf Med       Date:  2002       Impact factor: 2.176

  8 in total
  16 in total

1.  Enhancing acronym/abbreviation knowledge bases with semantic information.

Authors:  Manabu Torii; Hongfang Liu
Journal:  AMIA Annu Symp Proc       Date:  2007-10-11

2.  A sense inventory for clinical abbreviations and acronyms created using clinical notes and medical dictionary resources.

Authors:  Sungrim Moon; Serguei Pakhomov; Nathan Liu; James O Ryan; Genevieve B Melton
Journal:  J Am Med Inform Assoc       Date:  2013-06-27       Impact factor: 4.497

Review 3.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

4.  MBA: a literature mining system for extracting biomedical abbreviations.

Authors:  Yun Xu; ZhiHao Wang; YiMing Lei; YuZhong Zhao; Yu Xue
Journal:  BMC Bioinformatics       Date:  2009-01-09       Impact factor: 3.169

5.  Discriminative application of string similarity methods to chemical and non-chemical names for biomedical abbreviation clustering.

Authors:  Atsuko Yamaguchi; Yasunori Yamamoto; Jin-Dong Kim; Toshihisa Takagi; Akinori Yonezawa
Journal:  BMC Genomics       Date:  2012-06-11       Impact factor: 3.969

6.  TPX: Biomedical literature search made easy.

Authors:  Thomas Joseph; Vangala G Saipradeep; Ganesh Sekar Venkat Raghavan; Rajgopal Srinivasan; Aditya Rao; Sujatha Kotte; Naveen Sivadasan
Journal:  Bioinformation       Date:  2012-06-28

7.  Allie: a database and a search service of abbreviations and long forms.

Authors:  Yasunori Yamamoto; Atsuko Yamaguchi; Hidemasa Bono; Toshihisa Takagi
Journal:  Database (Oxford)       Date:  2011-04-15       Impact factor: 3.451

8.  Improved biomedical word embeddings in the transformer era.

Authors:  Jiho Noh; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2021-07-18       Impact factor: 8.000

9.  Building Linked Open Data towards integration of biomedical scientific literature with DBpedia.

Authors:  Yasunori Yamamoto; Atsuko Yamaguchi; Akinori Yonezawa
Journal:  J Biomed Semantics       Date:  2013-03-13

10.  Getting more out of biomedical documents with GATE's full lifecycle open source text analytics.

Authors:  Hamish Cunningham; Valentin Tablan; Angus Roberts; Kalina Bontcheva
Journal:  PLoS Comput Biol       Date:  2013-02-07       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.