Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

Literature DB >> 31787096

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

Canlin Zhang¹, Daniel Biś², Xiuwen Liu², Zhe He³.

Abstract

BACKGROUND: In recent years, deep learning methods have been applied to many natural language processing tasks to achieve state-of-the-art performance. However, in the biomedical domain, they have not out-performed supervised word sense disambiguation (WSD) methods based on support vector machines or random forests, possibly due to inherent similarities of medical word senses.
RESULTS: In this paper, we propose two deep-learning-based models for supervised WSD: a model based on bi-directional long short-term memory (BiLSTM) network, and an attention model based on self-attention architecture. Our result shows that the BiLSTM neural network model with a suitable upper layer structure performs even better than the existing state-of-the-art models on the MSH WSD dataset, while our attention model was 3 or 4 times faster than our BiLSTM model with good accuracy. In addition, we trained "universal" models in order to disambiguate all ambiguous words together. That is, we concatenate the embedding of the target ambiguous word to the max-pooled vector in the universal models, acting as a "hint". The result shows that our universal BiLSTM neural network model yielded about 90 percent accuracy.
CONCLUSION: Deep contextual models based on sequential information processing methods are able to capture the relative contextual information from pre-trained input word embeddings, in order to provide state-of-the-art results for supervised biomedical WSD tasks.

Entities: Chemical Disease Gene Species

Keywords: Biomedical; LSTM; Self-attention; Word sense disambiguation

Mesh：

Year: 2019 PMID： 31787096 PMCID： PMC6886160 DOI： 10.1186/s12859-019-3079-8

Source DB: PubMed Journal: BMC Bioinformatics ISSN： 1471-2105 Impact factor: 3.169

16 in total

Biomedical word sense disambiguation with bidirectional long short-term memory and attention-based neural networks.

1. Learning to forget: continual prediction with LSTM.

2. Disambiguating ambiguous biomedical terms in biomedical narrative text: an unsupervised method.

3. Word sense disambiguation across two domains: biomedical literature and clinical notes.

4. Long short-term memory.

5. Combining corpus-derived sense profiles with estimated frequency information to disambiguate clinical abbreviations.

6. Co-occurrence graphs for word sense disambiguation in the biomedical domain.

7. Knowledge-Based Biomedical Word Sense Disambiguation with Neural Concept Embeddings

8. Knowledge-based biomedical word sense disambiguation: comparison of approaches.

9. Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

10. Machine learning and word sense disambiguation in the biomedical domain: design and evaluation issues.

1. Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.