Literature DB >> 31160010

Smoothing dense spaces for improved relation extraction between drugs and adverse reactions.

Sara Santiso1, Alicia Pérez2, Arantza Casillas3.   

Abstract

BACKGROUND AND
OBJECTIVE: This work aims at extracting Adverse Drug Reactions (ADRs), i.e. a harm directly caused by a drug at normal doses, from Electronic Health Records (EHRs). The lack of readily available EHRs because of confidentiality issues and their lexical variability make the ADR extraction challenging. Furthermore, ADRs are rare events. Therefore, efficient representations against data sparsity are needed.
METHODS: Embedding-based characterizations are able to group semantically related words. However, dense spaces suffer from data sparsity. We employed context-aware continuous representations to enhance the modelling of infrequent events through their context and we turned to simple smoothing techniques to increase the proximity between similar words (e.g. direction cosines, truncation, Principal Component Analysis (PCA) and clustering) in an attempt to cope with data sparsity.
RESULTS: An F-measure of 0.639 for the ADR classification was achieved, obtaining an improvement of approximately 0.300 in comparison with the results obtained by a word-based characterization.
CONCLUSION: The embbeding-based representation together with the smoothing techniques increased the robustness of the ADR characterization. It was proven particularly appropriate to cope with lexical variability and data sparsity.
Copyright © 2019 Elsevier B.V. All rights reserved.

Keywords:  Data mining; Drug-related side effects and adverse reactions; Electronic health records; Natural language processing

Mesh:

Year:  2019        PMID: 31160010     DOI: 10.1016/j.ijmedinf.2019.05.009

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  1 in total

Review 1.  A Year of Papers Using Biomedical Texts.

Authors:  Cyril Grouin; Natalia Grabar
Journal:  Yearb Med Inform       Date:  2020-08-21
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.