Literature DB >> 31947234

UMLS mapping and Word embeddings for ICD code assignment using the MIMIC-III intensive care database.

Henning Schafer, Christoph M Friedrich.   

Abstract

Diagnosis codes are used as a billing mechanism in the Electronic Health Record and have the capability to benefit decision support systems, which aim to assist coders by suggesting a relevant subset of potential codes to choose from. Due to the large set of possible labels and length of patient records, automatic ICD code assignment is considered to be a challenging task within the field of multi-label classification. This paper introduces a baseline for automatic ICD code assignment using Support Vector Machines (SVM) and FastText with Unified Medical Language System (UMLS) metathesaurus mappings into word embedding models. Training data is obtained from the Medical Information Mart for Intensive Care (MIMIC-III) database and extended with 'is-a' relationships from ICD-9 hierarchy. FastText is evaluated with different label count estimations, of which an approach based on label cardinality yields a F1-Score of 62.2%. FastText achieves high recall results and mentionable performance improvements over previous models. Reported values are obtained through 10-fold cross-validation.

Entities:  

Mesh:

Year:  2019        PMID: 31947234     DOI: 10.1109/EMBC.2019.8856442

Source DB:  PubMed          Journal:  Conf Proc IEEE Eng Med Biol Soc        ISSN: 1557-170X


  1 in total

1.  An efficient modular framework for automatic LIONC classification of MedIMG using unified medical language.

Authors:  Surbhi Bhatia; Mohammed Alojail; Sudhakar Sengan; Pankaj Dadheech
Journal:  Front Public Health       Date:  2022-08-10
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.