Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Smoothing dense spaces for improved relation extraction between drugs and adverse reactions.

Literature DB >> 31160010

Smoothing dense spaces for improved relation extraction between drugs and adverse reactions.

Sara Santiso¹, Alicia Pérez², Arantza Casillas³.

Abstract

BACKGROUND AND
OBJECTIVE: This work aims at extracting Adverse Drug Reactions (ADRs), i.e. a harm directly caused by a drug at normal doses, from Electronic Health Records (EHRs). The lack of readily available EHRs because of confidentiality issues and their lexical variability make the ADR extraction challenging. Furthermore, ADRs are rare events. Therefore, efficient representations against data sparsity are needed.
METHODS: Embedding-based characterizations are able to group semantically related words. However, dense spaces suffer from data sparsity. We employed context-aware continuous representations to enhance the modelling of infrequent events through their context and we turned to simple smoothing techniques to increase the proximity between similar words (e.g. direction cosines, truncation, Principal Component Analysis (PCA) and clustering) in an attempt to cope with data sparsity.
RESULTS: An F-measure of 0.639 for the ADR classification was achieved, obtaining an improvement of approximately 0.300 in comparison with the results obtained by a word-based characterization.
CONCLUSION: The embbeding-based representation together with the smoothing techniques increased the robustness of the ADR characterization. It was proven particularly appropriate to cope with lexical variability and data sparsity.

Keywords: Data mining; Drug-related side effects and adverse reactions; Electronic health records; Natural language processing

Mesh：

Year: 2019 PMID： 31160010 DOI： 10.1016/j.ijmedinf.2019.05.009

Source DB: PubMed Journal: Int J Med Inform ISSN： 1386-5056 Impact factor: 4.046

Keyword Cloud
Cited

1 in total

Review 1. A Year of Papers Using Biomedical Texts.

Authors: Cyril Grouin; Natalia Grabar
Journal: Yearb Med Inform Date: 2020-08-21

1 in total