Literature DB >> 27444186

Use of "off-the-shelf" information extraction algorithms in clinical informatics: A feasibility study of MetaMap annotation of Italian medical notes.

Emma Chiaramello1, Francesco Pinciroli2, Alberico Bonalumi3, Angelo Caroli3, Gabriella Tognola4.   

Abstract

Information extraction from narrative clinical notes is useful for patient care, as well as for secondary use of medical data, for research or clinical purposes. Many studies focused on information extraction from English clinical texts, but less dealt with clinical notes in languages other than English. This study tested the feasibility of using "off the shelf" information extraction algorithms to identify medical concepts from Italian clinical notes. Among all the available and well-established information extraction algorithms, we used MetaMap to map medical concepts to the Unified Medical Language System (UMLS). The study addressed two questions: (Q1) to understand if it would be possible to properly map medical terms found in clinical notes and related to the semantic group of "Disorders" to the Italian UMLS resources; (Q2) to investigate if it would be feasible to use MetaMap as it is to extract these medical concepts from Italian clinical notes. We performed three experiments: in EXP1, we investigated how many medical concepts of the "Disorders" semantic group found in a set of clinical notes written in Italian could be mapped to the UMLS Italian medical sources; in EXP2 we assessed how the different processing steps used by MetaMap, which are English dependent, could be used in Italian texts to map the original clinical notes on the Italian UMLS sources; in EXP3 we automatically translated the clinical notes from Italian to English using Google Translator, and then we used MetaMap to map the translated texts. Results in EXP1 showed that the Italian UMLS Metathesaurus sources covered 91% of the medical terms of the "Disorders" semantic group, as found in the studied dataset. We observed that even if MetaMap was built to analyze texts written in English, most of its processing steps worked properly also with texts written in Italian. MetaMap identified correctly about half of the concepts in the Italian clinical notes. Using MetaMap's annotation on Italian clinical notes instead of a simple text search improved our results of about 15 percentage points. MetaMap's annotation of Italian clinical notes showed recall, precision and F-measure equal to 0.53, 0.98 and 0.69, respectively. Most of the failures were due to the impossibility for MetaMap to generate meaningful variants for the Italian language, suggesting that modifying MetaMap to allow generating Italian variants could improve the performance. MetaMap's performance in annotating automatically translated English clinical notes was in line with findings in the literature, with similar recall (0.75), F-measure (0.83) and even higher precision (0.95). Most of the failures were due to a bad Italian to English translation of medical terms, suggesting that using an automatic translation tool specialized in translating medical concepts might be useful to obtain better performances. In conclusion, performances obtained using MetaMap on the fully automatic translation of the Italian text are good enough to allow to use MetaMap "as it is" in clinical practice.
Copyright © 2016 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  EHR data reuse; Failure analysis; Information extraction; Italian language; MetaMap; Unstructured clinical notes

Mesh:

Year:  2016        PMID: 27444186     DOI: 10.1016/j.jbi.2016.07.017

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  5 in total

1.  Supervised methods to extract clinical events from cardiology reports in Italian.

Authors:  Natalia Viani; Timothy A Miller; Carlo Napolitano; Silvia G Priori; Guergana K Savova; Riccardo Bellazzi; Lucia Sacchi
Journal:  J Biomed Inform       Date:  2019-05-28       Impact factor: 6.317

2.  Medical Information Extraction Model for User-generated Content.

Authors:  Fahad Kamal Alsheref
Journal:  Acta Inform Med       Date:  2019-09

3.  Validation and Improvement of a Convolutional Neural Network to Predict the Involved Pathology in a Head and Neck Surgery Cohort.

Authors:  Dorian Culié; Renaud Schiappa; Sara Contu; Boris Scheller; Agathe Villarme; Olivier Dassonville; Gilles Poissonnet; Alexandre Bozec; Emmanuel Chamorey
Journal:  Int J Environ Res Public Health       Date:  2022-09-26       Impact factor: 4.614

4.  Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.

Authors:  Martijn G Kersloot; Florentien J P van Putten; Ameen Abu-Hanna; Ronald Cornet; Derk L Arts
Journal:  J Biomed Semantics       Date:  2020-11-16

Review 5.  The Unified Medical Language System at 30 Years and How It Is Used and Published: Systematic Review and Content Analysis.

Authors:  Xia Jing
Journal:  JMIR Med Inform       Date:  2021-08-27
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.