Literature DB >> 29399672

Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

Akm Sabbir1, Antonio Jimeno-Yepes2, Ramakanth Kavuluru3.   

Abstract

Biomedical word sense disambiguation (WSD) is an important intermediate task in many natural language processing applications such as named entity recognition, syntactic parsing, and relation extraction. In this paper, we employ knowledge-based approaches that also exploit recent advances in neural word/concept embeddings to improve over the state-of-the-art in biomedical WSD using the public MSH WSD dataset [1] as the test set. Our methods involve weak supervision - we do not use any hand-labeled examples for WSD to build our prediction models; however, we employ an existing concept mapping program, MetaMap, to obtain our concept vectors. Over the MSH WSD dataset, our linear time (in terms of numbers of senses and words in the test instance) method achieves an accuracy of 92.24% which is a 3% improvement over the best known results [2] obtained via unsupervised means. A more expensive approach that we developed relies on a nearest neighbor framework and achieves accuracy of 94.34%, essentially cutting the error rate in half. Employing dense vector representations learned from unlabeled free text has been shown to benefit many language processing tasks recently and our efforts show that biomedical WSD is no exception to this trend. For a complex and rapidly evolving domain such as biomedicine, building labeled datasets for larger sets of ambiguous terms may be impractical. Here, we show that weak supervision that leverages recent advances in representation learning can rival supervised approaches in biomedical WSD. However, external knowledge bases (here sense inventories) play a key role in the improvements achieved.

Entities:  

Year:  2018        PMID: 29399672      PMCID: PMC5792196          DOI: 10.1109/BIBE.2017.00-61

Source DB:  PubMed          Journal:  Proc IEEE Int Symp Bioinformatics Bioeng


  25 in total

1.  A multi-aspect comparison study of supervised word sense disambiguation.

Authors:  Hongfang Liu; Virginia Teller; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2004-04-02       Impact factor: 4.497

2.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

Review 3.  Utilizing social media data for pharmacovigilance: A review.

Authors:  Abeed Sarker; Rachel Ginn; Azadeh Nikfarjam; Karen O'Connor; Karen Smith; Swetha Jayaraman; Tejaswi Upadhaya; Graciela Gonzalez
Journal:  J Biomed Inform       Date:  2015-02-23       Impact factor: 6.317

4.  Abbreviation and acronym disambiguation in clinical discourse.

Authors:  Sergeui Pakhomov; Ted Pedersen; Christopher G Chute
Journal:  AMIA Annu Symp Proc       Date:  2005

5.  Knowledge based word-concept model estimation and refinement for biomedical text mining.

Authors:  Antonio Jimeno Yepes; Rafael Berlanga
Journal:  J Biomed Inform       Date:  2014-12-12       Impact factor: 6.317

6.  Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation.

Authors:  Antonio Jimeno Yepes
Journal:  J Biomed Inform       Date:  2017-08-07       Impact factor: 6.317

7.  Hyperdimensional computing approach to word sense disambiguation.

Authors:  Bjoern-Toby Berster; J Caleb Goodwin; Trevor Cohen
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

8.  Evaluating measures of semantic similarity and relatedness to disambiguate terms in biomedical text.

Authors:  Bridget T McInnes; Ted Pedersen
Journal:  J Biomed Inform       Date:  2013-09-04       Impact factor: 6.317

9.  Corpus domain effects on distributional semantic modeling of medical terms.

Authors:  Serguei V S Pakhomov; Greg Finley; Reed McEwan; Yan Wang; Genevieve B Melton
Journal:  Bioinformatics       Date:  2016-08-16       Impact factor: 6.937

10.  An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.

Authors:  Ramakanth Kavuluru; Anthony Rios; Yuan Lu
Journal:  Artif Intell Med       Date:  2015-05-15       Impact factor: 5.326

View more
  5 in total

1.  deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

Authors:  Ahmad Pesaranghader; Stan Matwin; Marina Sokolova; Ali Pesaranghader
Journal:  J Am Med Inform Assoc       Date:  2019-05-01       Impact factor: 4.497

2.  A clinical text classification paradigm using weak supervision and deep representation.

Authors:  Yanshan Wang; Sunghwan Sohn; Sijia Liu; Feichen Shen; Liwei Wang; Elizabeth J Atkinson; Shreyasee Amin; Hongfang Liu
Journal:  BMC Med Inform Decis Mak       Date:  2019-01-07       Impact factor: 2.796

3.  Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

Authors:  Canlin Zhang; Daniel Biś; Xiuwen Liu; Zhe He
Journal:  BMC Bioinformatics       Date:  2019-12-02       Impact factor: 3.169

4.  Improved biomedical word embeddings in the transformer era.

Authors:  Jiho Noh; Ramakanth Kavuluru
Journal:  J Biomed Inform       Date:  2021-07-18       Impact factor: 8.000

5.  Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.

Authors:  Denis Newman-Griffis; Eric Fosler-Lussier
Journal:  Front Digit Health       Date:  2021-03-10
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.