Literature DB >> 26141794

On the creation of a clinical gold standard corpus in Spanish: Mining adverse drug reactions.

Maite Oronoz1, Koldo Gojenola1, Alicia Pérez1, Arantza Díaz de Ilarraza1, Arantza Casillas2.   

Abstract

The advances achieved in Natural Language Processing make it possible to automatically mine information from electronically created documents. Many Natural Language Processing methods that extract information from texts make use of annotated corpora, but these are scarce in the clinical domain due to legal and ethical issues. In this paper we present the creation of the IxaMed-GS gold standard composed of real electronic health records written in Spanish and manually annotated by experts in pharmacology and pharmacovigilance. The experts mainly annotated entities related to diseases and drugs, but also relationships between entities indicating adverse drug reaction events. To help the experts in the annotation task, we adapted a general corpus linguistic analyzer to the medical domain. The quality of the annotation process in the IxaMed-GS corpus has been assessed by measuring the inter-annotator agreement, which was 90.53% for entities and 82.86% for events. In addition, the corpus has been used for the automatic extraction of adverse drug reaction events using machine learning.
Copyright © 2015 Elsevier Inc. All rights reserved.

Keywords:  Adverse drug reaction; Clinical text; Gold standard; Text mining

Mesh:

Substances:

Year:  2015        PMID: 26141794     DOI: 10.1016/j.jbi.2015.06.016

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  11 in total

1.  Clinical Natural Language Processing in 2015: Leveraging the Variety of Texts of Clinical Interest.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2016-11-10

2.  Recent advances in Swedish and Spanish medical entity recognition in clinical texts using deep neural approaches.

Authors:  Rebecka Weegar; Alicia Pérez; Arantza Casillas; Maite Oronoz
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-23       Impact factor: 2.796

3.  A computational ecosystem to support eHealth Knowledge Discovery technologies in Spanish.

Authors:  Alejandro Piad-Morffis; Yoan Gutiérrez; Yudivian Almeida-Cruz; Rafael Muñoz
Journal:  J Biomed Inform       Date:  2020-07-24       Impact factor: 6.317

4.  RCorp: a resource for chemical disease semantic extraction in Chinese.

Authors:  Yueping Sun; Li Hou; Lu Qin; Yan Liu; Jiao Li; Qing Qian
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-05       Impact factor: 2.796

5.  Evaluation of Natural Language Processing for the Identification of Crohn Disease-Related Variables in Spanish Electronic Health Records: A Validation Study for the PREMONITION-CD Project.

Authors:  Carmen Montoto; Javier P Gisbert; Iván Guerra; Rocío Plaza; Ramón Pajares Villarroya; Luis Moreno Almazán; María Del Carmen López Martín; Mercedes Domínguez Antonaya; Isabel Vera Mendoza; Jesús Aparicio; Vicente Martínez; Ignacio Tagarro; Alonso Fernandez-Nistal; Lea Canales; Sebastian Menke; Fernando Gomollón
Journal:  JMIR Med Inform       Date:  2022-02-18

6.  SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks.

Authors:  Lucas Emanuel Silva E Oliveira; Ana Carolina Peters; Adalniza Moura Pucca da Silva; Caroline Pilatti Gebeluca; Yohan Bonescki Gumiel; Lilian Mie Mukai Cintho; Deborah Ribeiro Carvalho; Sadid Al Hasan; Claudia Maria Cabral Moro
Journal:  J Biomed Semantics       Date:  2022-05-08

7.  Negation and uncertainty detection in clinical texts written in Spanish: a deep learning-based approach.

Authors:  Oswaldo Solarte Pabón; Orlando Montenegro; Maria Torrente; Alejandro Rodríguez González; Mariano Provencio; Ernestina Menasalvas
Journal:  PeerJ Comput Sci       Date:  2022-03-07

Review 8.  Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors:  Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal:  J Biomed Semantics       Date:  2018-03-30

9.  Design of an extensive information representation scheme for clinical narratives.

Authors:  Louise Deléger; Leonardo Campillos; Anne-Laure Ligozat; Aurélie Névéol
Journal:  J Biomed Semantics       Date:  2017-09-11

10.  Annotation and detection of drug effects in text for pharmacovigilance.

Authors:  Paul Thompson; Sophia Daikou; Kenju Ueno; Riza Batista-Navarro; Jun'ichi Tsujii; Sophia Ananiadou
Journal:  J Cheminform       Date:  2018-08-13       Impact factor: 5.514

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.