Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

Literature DB >> 30811548

deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

Ahmad Pesaranghader^1,2, Stan Matwin^1,2, Marina Sokolova^2,3,4, Ali Pesaranghader³.

Abstract

OBJECTIVE: In biomedicine, there is a wealth of information hidden in unstructured narratives such as research articles and clinical reports. To exploit these data properly, a word sense disambiguation (WSD) algorithm prevents downstream difficulties in the natural language processing applications pipeline. Supervised WSD algorithms largely outperform un- or semisupervised and knowledge-based methods; however, they train 1 separate classifier for each ambiguous term, necessitating a large number of expert-labeled training data, an unattainable goal in medical informatics. To alleviate this need, a single model that shares statistical strength across all instances and scales well with the vocabulary size is desirable.
MATERIALS AND METHODS: Built on recent advances in deep learning, our deepBioWSD model leverages 1 single bidirectional long short-term memory network that makes sense prediction for any ambiguous term. In the model, first, the Unified Medical Language System sense embeddings will be computed using their text definitions; and then, after initializing the network with these embeddings, it will be trained on all (available) training data collectively. This method also considers a novel technique for automatic collection of training data from PubMed to (pre)train the network in an unsupervised manner.
RESULTS: We use the MSH WSD dataset to compare WSD algorithms, with macro and micro accuracies employed as evaluation metrics. deepBioWSD outperforms existing models in biomedical text WSD by achieving the state-of-the-art performance of 96.82% for macro accuracy.
CONCLUSIONS: Apart from the disambiguation improvement and unsupervised training, deepBioWSD depends on considerably less number of expert-labeled data as it learns the target and the context terms jointly. These merit deepBioWSD to be conveniently deployable in real-time biomedical applications.

Entities: Chemical Gene

Keywords: bidirectional long short-term memory network; biomedical text mining; deep neural networks; word sense disambiguation; zero-shot learning

Mesh：

Year: 2019 PMID： 30811548 PMCID： PMC7787358 DOI： 10.1093/jamia/ocy189

Source DB: PubMed Journal: J Am Med Inform Assoc ISSN： 1067-5027 Impact factor: 4.497

29 in total

deepBioWSD: effective deep neural word sense disambiguation of biomedical text data.

1. Semantic Similarity and Relatedness between Clinical Terms: An Experimental Study.

2. Long short-term memory.

3. Automatically classifying question types for consumer health questions.

4. Hyperdimensional computing approach to word sense disambiguation.

5. Corpus domain effects on distributional semantic modeling of medical terms.

6. An empirical evaluation of supervised learning approaches in assigning diagnosis codes to electronic medical records.

7. Exploiting MeSH indexing in MEDLINE to generate a data set for word sense disambiguation.

8. Supervised Learning and Knowledge-Based Approaches Applied to Biomedical Word Sense Disambiguation.

9. BRONCO: Biomedical entity Relation ONcology COrpus for extracting gene-variant-disease-drug relations.

10. The effect of word sense disambiguation accuracy on literature based discovery.

1. deepSimDEF: deep neural embeddings of gene products and Gene Ontology terms for functional analysis of genes.

2. Ambiguity in medical concept normalization: An analysis of types and coverage in electronic health record datasets.

3. Improving broad-coverage medical entity linking with semantic type prediction and large-scale datasets.

4. A deep database of medical abbreviations and acronyms for natural language processing.

Review 5. Implementing Machine Learning in Interventional Cardiology: The Benefits Are Worth the Trouble.