Literature DB >> 29584896

Interactive medical word sense disambiguation through informed learning.

Yue Wang1, Kai Zheng2, Hua Xu3, Qiaozhu Mei1,4.   

Abstract

Objective: Medical word sense disambiguation (WSD) is challenging and often requires significant training with data labeled by domain experts. This work aims to develop an interactive learning algorithm that makes efficient use of expert's domain knowledge in building high-quality medical WSD models with minimal human effort.
Methods: We developed an interactive learning algorithm with expert labeling instances and features. An expert can provide supervision in 3 ways: labeling instances, specifying indicative words of a sense, and highlighting supporting evidence in a labeled instance. The algorithm learns from these labels and iteratively selects the most informative instances to ask for future labels. Our evaluation used 3 WSD corpora: 198 ambiguous terms from Medical Subject Headings (MSH) as MEDLINE indexing terms, 74 ambiguous abbreviations in clinical notes from the University of Minnesota (UMN), and 24 ambiguous abbreviations in clinical notes from Vanderbilt University Hospital (VUH). For each ambiguous term and each learning algorithm, a learning curve that plots the accuracy on the test set against the number of labeled instances was generated. The area under the learning curve was used as the primary evaluation metric.
Results: Our interactive learning algorithm significantly outperformed active learning, the previous fastest learning algorithm for medical WSD. Compared to active learning, it achieved 90% accuracy for the MSH corpus with 42% less labeling effort, 35% less labeling effort for the UMN corpus, and 16% less labeling effort for the VUH corpus. Conclusions: High-quality WSD models can be efficiently trained with minimal supervision by inviting experts to label informative instances and provide domain knowledge through labeling/highlighting contextual features.

Entities:  

Mesh:

Year:  2018        PMID: 29584896      PMCID: PMC6658868          DOI: 10.1093/jamia/ocy013

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  15 in total

1.  Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

Authors:  H Liu; Y A Lussier; C Friedman
Journal:  J Biomed Inform       Date:  2001-08       Impact factor: 6.317

2.  Automatic resolution of ambiguous terms based on machine learning and conceptual relations in the UMLS.

Authors:  Hongfang Liu; Stephen B Johnson; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

3.  A multi-aspect comparison study of supervised word sense disambiguation.

Authors:  Hongfang Liu; Virginia Teller; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2004-04-02       Impact factor: 4.497

Review 4.  Word sense disambiguation in the biomedical domain: an overview.

Authors:  Martijn J Schuemie; Jan A Kors; Barend Mons
Journal:  J Comput Biol       Date:  2005-06       Impact factor: 1.479

5.  Using MEDLINE as a knowledge source for disambiguating abbreviations and acronyms in full-text biomedical journal articles.

Authors:  Hong Yu; Won Kim; Vasileios Hatzivassiloglou; W John Wilbur
Journal:  J Biomed Inform       Date:  2006-06-07       Impact factor: 6.317

6.  Clinical Word Sense Disambiguation with Interactive Search and Classification.

Authors:  Yue Wang; Kai Zheng; Hua Xu; Qiaozhu Mei
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

7.  Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.

Authors:  Hua Xu; Peter D Stetson; Carol Friedman
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

Review 8.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

Authors:  Weiyi Sun; Anna Rumshisky; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2013-04-05       Impact factor: 4.497

9.  Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

Authors:  Antonio J Jimeno-Yepes; Bridget T McInnes; Alan R Aronson
Journal:  BMC Bioinformatics       Date:  2011-06-02       Impact factor: 3.169

10.  Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

Authors:  Hua Xu; Marianthi Markatou; Rositsa Dimova; Hongfang Liu; Carol Friedman
Journal:  BMC Bioinformatics       Date:  2006-07-05       Impact factor: 3.169

View more
  4 in total

1.  deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

Authors:  Ahmad Pesaranghader; Stan Matwin; Marina Sokolova; Ali Pesaranghader
Journal:  J Am Med Inform Assoc       Date:  2019-05-01       Impact factor: 4.497

2.  Interactive NLP in Clinical Care: Identifying Incidental Findings in Radiology Reports.

Authors:  Gaurav Trivedi; Esmaeel R Dadashzadeh; Robert M Handzel; Wendy W Chapman; Shyam Visweswaran; Harry Hochheiser
Journal:  Appl Clin Inform       Date:  2019-09-04       Impact factor: 2.342

3.  Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

Authors:  Canlin Zhang; Daniel Biś; Xiuwen Liu; Zhe He
Journal:  BMC Bioinformatics       Date:  2019-12-02       Impact factor: 3.169

4.  Leveraging medical context to recommend semantically similar terms for chart reviews.

Authors:  Cheng Ye; Bradley A Malin; Daniel Fabbri
Journal:  BMC Med Inform Decis Mak       Date:  2021-12-18       Impact factor: 2.796

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.