Literature DB >> 29573845

Co-occurrence graphs for word sense disambiguation in the biomedical domain.

Andres Duque1, Mark Stevenson2, Juan Martinez-Romo3, Lourdes Araujo4.   

Abstract

Word sense disambiguation is a key step for many natural language processing tasks (e.g. summarization, text classification, relation extraction) and presents a challenge to any system that aims to process documents from the biomedical domain. In this paper, we present a new graph-based unsupervised technique to address this problem. The knowledge base used in this work is a graph built with co-occurrence information from medical concepts found in scientific abstracts, and hence adapted to the specific domain. Unlike other unsupervised approaches based on static graphs such as UMLS, in this work the knowledge base takes the context of the ambiguous terms into account. Abstracts downloaded from PubMed are used for building the graph and disambiguation is performed using the personalized PageRank algorithm. Evaluation is carried out over two test datasets widely explored in the literature. Different parameters of the system are also evaluated to test robustness and scalability. Results show that the system is able to outperform state-of-the-art knowledge-based systems, obtaining more than 10% of accuracy improvement in some cases, while only requiring minimal external resources.
Copyright © 2018 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Graph-based systems; Information extraction; Natural language processing; Unified medical language system; Unsupervised machine learning; Word sense disambiguation

Mesh:

Year:  2018        PMID: 29573845     DOI: 10.1016/j.artmed.2018.03.002

Source DB:  PubMed          Journal:  Artif Intell Med        ISSN: 0933-3657            Impact factor:   5.326


  6 in total

1.  deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

Authors:  Ahmad Pesaranghader; Stan Matwin; Marina Sokolova; Ali Pesaranghader
Journal:  J Am Med Inform Assoc       Date:  2019-05-01       Impact factor: 4.497

2.  Knowledge-Based Biomedical Data Science.

Authors:  Tiffany J Callahan; Ignacio J Tripodi; Harrison Pielke-Lombardo; Lawrence E Hunter
Journal:  Annu Rev Biomed Data Sci       Date:  2020-04-07

3.  Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

Authors:  Canlin Zhang; Daniel Biś; Xiuwen Liu; Zhe He
Journal:  BMC Bioinformatics       Date:  2019-12-02       Impact factor: 3.169

4.  A Year of Papers Using Biomedical Texts: Findings from the Section on Natural Language Processing of the IMIA Yearbook.

Authors:  Natalia Grabar; Cyril Grouin
Journal:  Yearb Med Inform       Date:  2019-08-16

5.  Word sense disambiguation using hybrid swarm intelligence approach.

Authors:  Wafaa Al-Saiagh; Sabrina Tiun; Ahmed Al-Saffar; Suryanti Awang; A S Al-Khaleefa
Journal:  PLoS One       Date:  2018-12-20       Impact factor: 3.240

6.  Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.

Authors:  Shikhar Vashishth; Denis Newman-Griffis; Rishabh Joshi; Ritam Dutt; Carolyn P Rosé
Journal:  J Biomed Inform       Date:  2021-08-12       Impact factor: 6.317

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.