Literature DB >> 29331250

Inferred joint multigram models for medical term normalization according to ICD.

Alicia Pérez1, Aitziber Atutxa2, Arantza Casillas3, Koldo Gojenola4, Álvaro Sellart5.   

Abstract

BACKGROUND: Electronic Health Records (EHRs) are written using spontaneous natural language. Often, terms do not match standard terminology like the one available through the International Classification of Diseases (ICD).
OBJECTIVE: Information retrieval and exchange can be improved using standard terminology. Our aim is to render diagnostic terms written in spontaneous language in EHRs into the standard framework provided by the ICD.
METHODS: We tackle diagnostic term normalization employing Weighted Finite-State Transducers (WFSTs). These machines learn how to translate sequences, in the case of our concern, spontaneous representations into standard representations given a set of samples. They are highly flexible and easily adaptable to terminological singularities of each different hospital and practitioner. Besides, we implemented a similarity metric to enhance spontaneous-standard term matching.
RESULTS: From the 2850 spontaneous DTs randomly selected we found that only 7.71% were written in their standard form matching the ICD. This WFST-based system enabled matching spontaneous ICDs with a Mean Reciprocal Rank of 0.68, which means that, on average, the right ICD code is found between the first and second position among the normalized set of candidates. This guarantees efficient document exchange and, furthermore, information retrieval.
CONCLUSION: Medical term normalization was achieved with high performance. We found that direct matching of spontaneous terms using standard lexicons leads to unsatisfactory results while normalized hypothesis generation by means of WFST helped to overcome the gap between spontaneous and standard language.
Copyright © 2017 Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Electronic Health Records; Finite State Models; International Classification of Diseases; Normalization

Mesh:

Year:  2017        PMID: 29331250     DOI: 10.1016/j.ijmedinf.2017.12.007

Source DB:  PubMed          Journal:  Int J Med Inform        ISSN: 1386-5056            Impact factor:   4.046


  4 in total

1.  Automated Misspelling Detection and Correction in Persian Clinical Text.

Authors:  Azita Yazdani; Marjan Ghazisaeedi; Nasrin Ahmadinejad; Masoumeh Giti; Habibe Amjadi; Azin Nahvijou
Journal:  J Digit Imaging       Date:  2020-06       Impact factor: 4.056

2.  Improving the Path from Diagnoses to Documentation: A Cognitive Review Tool for Clinical Notes and Administrative Records.

Authors:  Yufan Guo; Joy Wu; Tyler Baldwin; David Beymer; Vandana V Mukherjee; Tanveer F Syeda-Mahmood
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

3.  Findings from the 2019 International Medical Informatics Association Yearbook Section on Health Information Management.

Authors:  Meryl Bloomrosen; Eta S Berner
Journal:  Yearb Med Inform       Date:  2019-08-16

4.  Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies.

Authors:  Martijn G Kersloot; Florentien J P van Putten; Ameen Abu-Hanna; Ronald Cornet; Derk L Arts
Journal:  J Biomed Semantics       Date:  2020-11-16
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.