Literature DB >> 23077130

Knowledge-based biomedical word sense disambiguation: an evaluation and application to clinical document classification.

Vijay N Garla1, Cynthia Brandt.   

Abstract

BACKGROUND: Word sense disambiguation (WSD) methods automatically assign an unambiguous concept to an ambiguous term based on context, and are important to many text-processing tasks. In this study we developed and evaluated a knowledge-based WSD method that uses semantic similarity measures derived from the Unified Medical Language System (UMLS) and evaluated the contribution of WSD to clinical text classification.
METHODS: We evaluated our system on biomedical WSD datasets and determined the contribution of our WSD system to clinical document classification on the 2007 Computational Medicine Challenge corpus.
RESULTS: Our system compared favorably with other knowledge-based methods. Machine learning classifiers trained on disambiguated concepts significantly outperformed those trained using all concepts.
CONCLUSIONS: We developed a WSD system that achieves high disambiguation accuracy on standard biomedical WSD datasets and showed that our WSD system improves clinical document classification. DATA SHARING: We integrated our WSD system with MetaMap and the clinical Text Analysis and Knowledge Extraction System, two popular biomedical natural language processing systems. All codes required to reproduce our results and all tools developed as part of this study are released as open source, available under http://code.google.com/p/ytex.

Entities:  

Keywords:  Natural Language Processing; Semantic similarity; Word Sense Disambiguation

Mesh:

Year:  2012        PMID: 23077130      PMCID: PMC3756260          DOI: 10.1136/amiajnl-2012-001350

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  19 in total

1.  The NLM Indexing Initiative.

Authors:  A R Aronson; O Bodenreider; H F Chang; S M Humphrey; J G Mork; S J Nelson; T C Rindflesch; W J Wilbur
Journal:  Proc AMIA Symp       Date:  2000

2.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

3.  Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC).

Authors:  Leonard W D'Avolio; Thien M Nguyen; Wildon R Farwell; Yongming Chen; Felicia Fitzmeyer; Owen M Harris; Louis D Fiore
Journal:  J Am Med Inform Assoc       Date:  2010 Jul-Aug       Impact factor: 4.497

4.  Measures of semantic similarity and relatedness in the biomedical domain.

Authors:  Ted Pedersen; Serguei V S Pakhomov; Siddharth Patwardhan; Christopher G Chute
Journal:  J Biomed Inform       Date:  2006-06-10       Impact factor: 6.317

5.  Ontology-guided feature engineering for clinical text classification.

Authors:  Vijay N Garla; Cynthia Brandt
Journal:  J Biomed Inform       Date:  2012-05-09       Impact factor: 6.317

6.  Semantic similarity estimation in the biomedical domain: an ontology-based information-theoretic perspective.

Authors:  David Sánchez; Montserrat Batet
Journal:  J Biomed Inform       Date:  2011-04-02       Impact factor: 6.317

7.  Word Sense Disambiguation by Selecting the Best Semantic Type Based on Journal Descriptor Indexing: Preliminary Experiment.

Authors:  Susanne M Humphrey; Willie J Rogers; Halil Kilicoglu; Dina Demner-Fushman; Thomas C Rindflesch
Journal:  J Am Soc Inf Sci Technol       Date:  2006-01-01

8.  Studying the correlation between different word sense disambiguation methods and summarization effectiveness in biomedical texts.

Authors:  Laura Plaza; Antonio J Jimeno-Yepes; Alberto Díaz; Alan R Aronson
Journal:  BMC Bioinformatics       Date:  2011-08-26       Impact factor: 3.169

9.  Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010.

Authors:  Berry de Bruijn; Colin Cherry; Svetlana Kiritchenko; Joel Martin; Xiaodan Zhu
Journal:  J Am Med Inform Assoc       Date:  2011-05-12       Impact factor: 4.497

10.  Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

Authors:  Antonio J Jimeno-Yepes; Bridget T McInnes; Alan R Aronson
Journal:  BMC Bioinformatics       Date:  2011-06-02       Impact factor: 3.169

View more
  11 in total

1.  Electronic health records-driven phenotyping: challenges, recent advances, and perspectives.

Authors:  Jyotishman Pathak; Abel N Kho; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-12       Impact factor: 4.497

2.  Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text.

Authors:  Bridget T McInnes; Ted Pedersen
Journal:  J Biomed Inform       Date:  2013-09-04       Impact factor: 6.317

3.  Corpus domain effects on distributional semantic modeling of medical terms.

Authors:  Serguei V S Pakhomov; Greg Finley; Reed McEwan; Yan Wang; Genevieve B Melton
Journal:  Bioinformatics       Date:  2016-08-16       Impact factor: 6.937

4.  Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.

Authors:  Sameer Pradhan; Noémie Elhadad; Brett R South; David Martinez; Lee Christensen; Amy Vogel; Hanna Suominen; Wendy W Chapman; Guergana Savova
Journal:  J Am Med Inform Assoc       Date:  2014-08-21       Impact factor: 4.497

5.  Mapping Phenotypic Information in Heterogeneous Textual Sources to a Domain-Specific Terminological Resource.

Authors:  Noha Alnazzawi; Paul Thompson; Sophia Ananiadou
Journal:  PLoS One       Date:  2016-09-19       Impact factor: 3.240

Review 6.  Semantic annotation in biomedicine: the current landscape.

Authors:  Jelena Jovanović; Ebrahim Bagheri
Journal:  J Biomed Semantics       Date:  2017-09-22

7.  Clinical text classification with rule-based features and knowledge-guided convolutional neural networks.

Authors:  Liang Yao; Chengsheng Mao; Yuan Luo
Journal:  BMC Med Inform Decis Mak       Date:  2019-04-04       Impact factor: 2.796

8.  Complexities, variations, and errors of numbering within clinical notes: the potential impact on information extraction and cohort-identification.

Authors:  David A Hanauer; Qiaozhu Mei; V G Vinod Vydiswaran; Karandeep Singh; Zach Landis-Lewis; Chunhua Weng
Journal:  BMC Med Inform Decis Mak       Date:  2019-04-04       Impact factor: 2.796

9.  Enhancing ontology-driven diagnostic reasoning with a symptom-dependency-aware Naïve Bayes classifier.

Authors:  Ying Shen; Yaliang Li; Hai-Tao Zheng; Buzhou Tang; Min Yang
Journal:  BMC Bioinformatics       Date:  2019-06-13       Impact factor: 3.169

10.  Development of phenotype algorithms using electronic medical records and incorporating natural language processing.

Authors:  Katherine P Liao; Tianxi Cai; Guergana K Savova; Shawn N Murphy; Elizabeth W Karlson; Ashwin N Ananthakrishnan; Vivian S Gainer; Stanley Y Shaw; Zongqi Xia; Peter Szolovits; Susanne Churchill; Isaac Kohane
Journal:  BMJ       Date:  2015-04-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.