Literature DB >> 31445289

Interpretable deep learning to map diagnostic texts to ICD-10 codes.

Aitziber Atutxa1, Arantza Díaz de Ilarraza2, Koldo Gojenola3, Maite Oronoz4, Olatz Perez-de-Viñaspre5.   

Abstract

BACKGROUND: Automatic extraction of morbid disease or conditions contained in Death Certificates is a critical process, useful for billing, epidemiological studies and comparison across countries. The fact that these clinical documents are written in regular natural language makes the automatic coding process difficult because, often, spontaneous terms diverge strongly from standard reference terminology such as the International Classification of Diseases (ICD).
OBJECTIVE: Our aim is to propose a general and multilingual approach to render Diagnostic Terms into the standard framework provided by the ICD. We have evaluated our proposal on a set of clinical texts written in French, Hungarian and Italian.
METHODS: ICD-10 encoding is a multi-class classification problem with an extensive (thousands) number of classes. After considering several approaches, we tackle our objective as a sequence-to-sequence task. According to current trends, we opted to use neural networks. We tested different types of neural architectures on three datasets in which Diagnostic Terms (DTs) have their ICD-10 codes associated. RESULTS AND
CONCLUSIONS: Our results give a new state-of-the art on multilingual ICD-10 coding, outperforming several alternative approaches, and showing the feasibility of automatic ICD-10 prediction obtaining an F-measure of 0.838, 0.963 and 0.952 for French, Hungarian and Italian, respectively. Additionally, the results are interpretable, providing experts with supporting evidence when confronted with coding decisions, as the model is able to show the alignments between the original text and each output code.
Copyright © 2019 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Electronic health records; International Classification of Diseases; Neural machine translation; Sequence-to-sequence mapping

Mesh:

Year:  2019        PMID: 31445289     DOI: 10.1016/j.ijmedinf.2019.05.015

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  3 in total

1.  Automated Coding of Under-Studied Medical Concept Domains: Linking Physical Activity Reports to the International Classification of Functioning, Disability, and Health.

Authors:  Denis Newman-Griffis; Eric Fosler-Lussier
Journal:  Front Digit Health       Date:  2021-03-10

2.  Neural Translation and Automated Recognition of ICD-10 Medical Entities From Natural Language: Model Development and Performance Assessment.

Authors:  Louis Falissard; Claire Morgand; Walid Ghosn; Claire Imbaud; Karim Bounebache; Grégoire Rey
Journal:  JMIR Med Inform       Date:  2022-04-11

3.  Automatic Identification of Patients With Unexplained Left Ventricular Hypertrophy in Electronic Health Record Data to Improve Targeted Treatment and Family Screening.

Authors:  Arjan Sammani; Mark Jansen; Nynke M de Vries; Nicolaas de Jonge; Annette F Baas; Anneline S J M Te Riele; Folkert W Asselbergs; Marish I F J Oerlemans
Journal:  Front Cardiovasc Med       Date:  2022-04-15
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.