Literature DB >> 31150777

Supervised methods to extract clinical events from cardiology reports in Italian.

Natalia Viani1, Timothy A Miller2, Carlo Napolitano3, Silvia G Priori4, Guergana K Savova2, Riccardo Bellazzi5, Lucia Sacchi6.   

Abstract

Clinical narratives are a valuable source of information for both patient care and biomedical research. Given the unstructured nature of medical reports, specific automatic techniques are required to extract relevant entities from such texts. In the natural language processing (NLP) community, this task is often addressed by using supervised methods. To develop such methods, both reliably-annotated corpora and elaborately designed features are needed. Despite the recent advances on corpora collection and annotation, research on multiple domains and languages is still limited. In addition, to compute the features required for supervised classification, suitable language- and domain-specific tools are needed. In this work, we propose a novel application of recurrent neural networks (RNNs) for event extraction from medical reports written in Italian. To train and evaluate the proposed approach, we annotated a corpus of 75 cardiology reports for a total of 4365 mentions of relevant events and their attributes (e.g., the polarity). For the annotation task, we developed specific annotation guidelines, which are provided together with this paper. The RNN-based classifier was trained on a training set including 3335 events (60 documents). The resulting model was integrated into an NLP pipeline that uses a dictionary lookup approach to search for relevant concepts inside the text. A test set of 1030 events (15 documents) was used to evaluate and compare different pipeline configurations. As a main result, using the RNN-based classifier instead of the dictionary lookup approach allowed increasing recall from 52.4% to 88.9%, and precision from 81.1% to 88.2%. Further, using the two methods in combination, we obtained final recall, precision, and F1 score of 91.7%, 88.6%, and 90.1%, respectively. These experiments indicate that integrating a well-performing RNN-based classifier with a standard knowledge-based approach can be a good strategy to extract information from clinical text in non-English languages.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Information extraction; Natural language processing; Neural networks

Year:  2019        PMID: 31150777      PMCID: PMC6948016          DOI: 10.1016/j.jbi.2019.103219

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  20 in total

1.  Natural language processing and its future in medicine.

Authors:  C Friedman; G Hripcsak
Journal:  Acad Med       Date:  1999-08       Impact factor: 6.893

2.  Agreement, the f-measure, and reliability in information retrieval.

Authors:  George Hripcsak; Adam S Rothschild
Journal:  J Am Med Inform Assoc       Date:  2005-01-31       Impact factor: 4.497

3.  Anafora: A Web-based General Purpose Annotation Tool.

Authors:  Wei-Te Chen; Will Styler
Journal:  Proc Conf       Date:  2013-06

4.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text.

Authors:  Özlem Uzuner; Brett R South; Shuying Shen; Scott L DuVall
Journal:  J Am Med Inform Assoc       Date:  2011-06-16       Impact factor: 4.497

5.  Overcoming barriers to NLP for clinical text: the role of shared tasks and the need for additional creative solutions.

Authors:  Wendy W Chapman; Prakash M Nadkarni; Lynette Hirschman; Leonard W D'Avolio; Guergana K Savova; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2011 Sep-Oct       Impact factor: 4.497

6.  Long short-term memory.

Authors:  S Hochreiter; J Schmidhuber
Journal:  Neural Comput       Date:  1997-11-15       Impact factor: 2.026

7.  An enhanced CRFs-based system for information extraction from radiology reports.

Authors:  Andrea Esuli; Diego Marcheggiani; Fabrizio Sebastiani
Journal:  J Biomed Inform       Date:  2013-02-11       Impact factor: 6.317

8.  ConText: an algorithm for determining negation, experiencer, and temporal status from clinical reports.

Authors:  Henk Harkema; John N Dowling; Tyler Thornblade; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2009-05-10       Impact factor: 6.317

Review 9.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge.

Authors:  Weiyi Sun; Anna Rumshisky; Ozlem Uzuner
Journal:  J Am Med Inform Assoc       Date:  2013-04-05       Impact factor: 4.497

10.  Evaluating the state of the art in disorder recognition and normalization of the clinical narrative.

Authors:  Sameer Pradhan; Noémie Elhadad; Brett R South; David Martinez; Lee Christensen; Amy Vogel; Hanna Suominen; Wendy W Chapman; Guergana Savova
Journal:  J Am Med Inform Assoc       Date:  2014-08-21       Impact factor: 4.497

View more
  2 in total

Review 1.  Systematic review of current natural language processing methods and applications in cardiology.

Authors:  Meghan Reading Turchioe; Alexander Volodarskiy; Jyotishman Pathak; Drew N Wright; James Enlou Tcheng; David Slotwiner
Journal:  Heart       Date:  2022-05-25       Impact factor: 7.365

Review 2.  A Year of Papers Using Biomedical Texts.

Authors:  Cyril Grouin; Natalia Grabar
Journal:  Yearb Med Inform       Date:  2020-08-21
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.