Literature DB >> 29928997

Prediction task guided representation learning of medical codes in EHR.

Liwen Cui1, Xiaolei Xie2, Zuojun Shen3.   

Abstract

There have been rapidly growing applications using machine learning models for predictive analytics in Electronic Health Records (EHR) to improve the quality of hospital services and the efficiency of healthcare resource utilization. A fundamental and crucial step in developing such models is to convert medical codes in EHR to feature vectors. These medical codes are used to represent diagnoses or procedures. Their vector representations have a tremendous impact on the performance of machine learning models. Recently, some researchers have utilized representation learning methods from Natural Language Processing (NLP) to learn vector representations of medical codes. However, most previous approaches are unsupervised, i.e. the generation of medical code vectors is independent from prediction tasks. Thus, the obtained feature vectors may be inappropriate for a specific prediction task. Moreover, unsupervised methods often require a lot of samples to obtain reliable results, but most practical problems have very limited patient samples. In this paper, we develop a new method called Prediction Task Guided Health Record Aggregation (PTGHRA), which aggregates health records guided by prediction tasks, to construct training corpus for various representation learning models. Compared with unsupervised approaches, representation learning models integrated with PTGHRA yield a significant improvement in predictive capability of generated medical code vectors, especially for limited training samples.
Copyright © 2018. Published by Elsevier Inc.

Entities:  

Keywords:  Electronic health records; Healthcare resource utilization; Medical code; Natural language processing; Representation learning; Word embedding

Mesh:

Year:  2018        PMID: 29928997     DOI: 10.1016/j.jbi.2018.06.013

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  3 in total

1.  Findings from the 2019 International Medical Informatics Association Yearbook Section on Health Information Management.

Authors:  Meryl Bloomrosen; Eta S Berner
Journal:  Yearb Med Inform       Date:  2019-08-16

2.  Representation learning for clinical time series prediction tasks in electronic health records.

Authors:  Tong Ruan; Liqi Lei; Yangming Zhou; Jie Zhai; Le Zhang; Ping He; Ju Gao
Journal:  BMC Med Inform Decis Mak       Date:  2019-12-17       Impact factor: 2.796

3.  Improving the Performance of Outcome Prediction for Inpatients With Acute Myocardial Infarction Based on Embedding Representation Learned From Electronic Medical Records: Development and Validation Study.

Authors:  Yanqun Huang; Zhimin Zheng; Moxuan Ma; Xin Xin; Honglei Liu; Xiaolu Fei; Lan Wei; Hui Chen
Journal:  J Med Internet Res       Date:  2022-08-03       Impact factor: 7.076

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.