Literature DB >> 27919732

Learning from heterogeneous temporal data in electronic health records.

Jing Zhao1, Panagiotis Papapetrou2, Lars Asker3, Henrik Boström4.   

Abstract

Electronic health records contain large amounts of longitudinal data that are valuable for biomedical informatics research. The application of machine learning is a promising alternative to manual analysis of such data. However, the complex structure of the data, which includes clinical events that are unevenly distributed over time, poses a challenge for standard learning algorithms. Some approaches to modeling temporal data rely on extracting single values from time series; however, this leads to the loss of potentially valuable sequential information. How to better account for the temporality of clinical data, hence, remains an important research question. In this study, novel representations of temporal data in electronic health records are explored. These representations retain the sequential information, and are directly compatible with standard machine learning algorithms. The explored methods are based on symbolic sequence representations of time series data, which are utilized in a number of different ways. An empirical investigation, using 19 datasets comprising clinical measurements observed over time from a real database of electronic health records, shows that using a distance measure to random subsequences leads to substantial improvements in predictive performance compared to using the original sequences or clustering the sequences. Evidence is moreover provided on the quality of the symbolic sequence representation by comparing it to sequences that are generated using domain knowledge by clinical experts. The proposed method creates representations that better account for the temporality of clinical events, which is often key to prediction tasks in the biomedical domain.
Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Data mining; Electronic health records; Machine learning; Random subsequence; Time series classification

Mesh:

Year:  2016        PMID: 27919732     DOI: 10.1016/j.jbi.2016.11.006

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  14 in total

1.  Modeling asynchronous event sequences with RNNs.

Authors:  Stephen Wu; Sijia Liu; Sunghwan Sohn; Sungrim Moon; Chung-Il Wi; Young Juhn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2018-06-05       Impact factor: 6.317

2.  Unsupervised characterization of Major Depressive Disorder medication treatment pathways.

Authors:  Barrett Jones; Colin G Walsh
Journal:  AMIA Annu Symp Proc       Date:  2022-02-21

3.  Performing an Informatics Consult: Methods and Challenges.

Authors:  Alejandro Schuler; Alison Callahan; Kenneth Jung; Nigam H Shah
Journal:  J Am Coll Radiol       Date:  2018-02-13       Impact factor: 5.532

4.  Data Quality of Chemotherapy-Induced Nausea and Vomiting Documentation.

Authors:  Melissa Beauchemin; Chunhua Weng; Lillian Sung; Adrienne Pichon; Maura Abbott; Dawn L Hershman; Rebecca Schnall
Journal:  Appl Clin Inform       Date:  2021-04-21       Impact factor: 2.342

5.  High-throughput phenotyping with temporal sequences.

Authors:  Hossein Estiri; Zachary H Strasser; Shawn N Murphy
Journal:  J Am Med Inform Assoc       Date:  2021-03-18       Impact factor: 4.497

6.  A classification framework for exploiting sparse multi-variate temporal features with application to adverse drug event detection in medical records.

Authors:  Francesco Bagattini; Isak Karlsson; Jonathan Rebane; Panagiotis Papapetrou
Journal:  BMC Med Inform Decis Mak       Date:  2019-01-10       Impact factor: 2.796

Review 7.  Clinical Information Systems and Artificial Intelligence: Recent Research Trends.

Authors:  Carlo Combi; Giuseppe Pozzi
Journal:  Yearb Med Inform       Date:  2019-08-16

Review 8.  Another Round of "Clue" to Uncover the Mystery of Complex Traits.

Authors:  Shefali Setia Verma; Marylyn D Ritchie
Journal:  Genes (Basel)       Date:  2018-01-25       Impact factor: 4.096

9.  Using case-level context to classify cancer pathology reports.

Authors:  Shang Gao; Mohammed Alawad; Noah Schaefferkoetter; Lynne Penberthy; Xiao-Cheng Wu; Eric B Durbin; Linda Coyle; Arvind Ramanathan; Georgia Tourassi
Journal:  PLoS One       Date:  2020-05-12       Impact factor: 3.240

10.  Influence of medical domain knowledge on deep learning for Alzheimer's disease prediction.

Authors:  Branimir Ljubic; Shoumik Roychoudhury; Xi Hang Cao; Martin Pavlovski; Stefan Obradovic; Richard Nair; Lucas Glass; Zoran Obradovic
Journal:  Comput Methods Programs Biomed       Date:  2020-09-20       Impact factor: 5.428

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.