Literature DB >> 33637859

Automatic multilabel detection of ICD10 codes in Dutch cardiology discharge letters using neural networks.

Arjan Sammani1, Ayoub Bagheri2,3, Peter G M van der Heijden3,4, Anneline S J M Te Riele2, Annette F Baas5, C A J Oosters6, Daniel Oberski3, Folkert W Asselbergs2,7,8.   

Abstract

Standard reference terminology of diagnoses and risk factors is crucial for billing, epidemiological studies, and inter/intranational comparisons of diseases. The International Classification of Disease (ICD) is a standardized and widely used method, but the manual classification is an enormously time-consuming endeavor. Natural language processing together with machine learning allows automated structuring of diagnoses using ICD-10 codes, but the limited performance of machine learning models, the necessity of gigantic datasets, and poor reliability of terminal parts of these codes restricted clinical usability. We aimed to create a high performing pipeline for automated classification of reliable ICD-10 codes in the free medical text in cardiology. We focussed on frequently used and well-defined three- and four-digit ICD-10 codes that still have enough granularity to be clinically relevant such as atrial fibrillation (I48), acute myocardial infarction (I21), or dilated cardiomyopathy (I42.0). Our pipeline uses a deep neural network known as a Bidirectional Gated Recurrent Unit Neural Network and was trained and tested with 5548 discharge letters and validated in 5089 discharge and procedural letters. As in clinical practice discharge letters may be labeled with more than one code, we assessed the single- and multilabel performance of main diagnoses and cardiovascular risk factors. We investigated using both the entire body of text and only the summary paragraph, supplemented by age and sex. Given the privacy-sensitive information included in discharge letters, we added a de-identification step. The performance was high, with F1 scores of 0.76-0.99 for three-character and 0.87-0.98 for four-character ICD-10 codes, and was best when using complete discharge letters. Adding variables age/sex did not affect results. For model interpretability, word coefficients were provided and qualitative assessment of classification was manually performed. Because of its high performance, this pipeline can be useful to decrease the administrative burden of classifying discharge diagnoses and may serve as a scaffold for reimbursement and research applications.

Entities:  

Year:  2021        PMID: 33637859     DOI: 10.1038/s41746-021-00404-9

Source DB:  PubMed          Journal:  NPJ Digit Med        ISSN: 2398-6352


  2 in total

1.  Automatic ICD Code Assignment based on ICD's Hierarchy Structure for Chinese Electronic Medical Records.

Authors:  Lingyu Cao; Dazhong Gu; Yuan Ni; Guotong Xie
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06

2.  Exploring the Privacy-Preserving Properties of Word Embeddings: Algorithmic Validation Study.

Authors:  Mohamed Abdalla; Moustafa Abdalla; Graeme Hirst; Frank Rudzicz
Journal:  J Med Internet Res       Date:  2020-07-15       Impact factor: 5.428

  2 in total
  3 in total

1.  Development of a Pipeline for Adverse Drug Reaction Identification in Clinical Notes: Word Embedding Models and String Matching.

Authors:  Marco Spruit; N Charlotte Onland-Moret; Klaske R Siegersma; Maxime Evers; Sophie H Bots; Floor Groepenhoff; Yolande Appelman; Leonard Hofstra; Igor I Tulevski; G Aernout Somsen; Hester M den Ruijter
Journal:  JMIR Med Inform       Date:  2022-01-25

2.  Automatic Identification of Patients With Unexplained Left Ventricular Hypertrophy in Electronic Health Record Data to Improve Targeted Treatment and Family Screening.

Authors:  Arjan Sammani; Mark Jansen; Nynke M de Vries; Nicolaas de Jonge; Annette F Baas; Anneline S J M Te Riele; Folkert W Asselbergs; Marish I F J Oerlemans
Journal:  Front Cardiovasc Med       Date:  2022-04-15

3.  Automatic Prediction of Recurrence of Major Cardiovascular Events: A Text Mining Study Using Chest X-Ray Reports.

Authors:  Ayoub Bagheri; T Katrien J Groenhof; Folkert W Asselbergs; Saskia Haitjema; Michiel L Bots; Wouter B Veldhuis; Pim A de Jong; Daniel L Oberski
Journal:  J Healthc Eng       Date:  2021-07-09       Impact factor: 2.682

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.