Literature DB >> 27896976

MISSING DATA IMPUTATION IN THE ELECTRONIC HEALTH RECORD USING DEEPLY LEARNED AUTOENCODERS.

Brett K Beaulieu-Jones1, Jason H Moore.   

Abstract

Electronic health records (EHRs) have become a vital source of patient outcome data but the widespread prevalence of missing data presents a major challenge. Different causes of missing data in the EHR data may introduce unintentional bias. Here, we compare the effectiveness of popular multiple imputation strategies with a deeply learned autoencoder using the Pooled Resource Open-Access ALS Clinical Trials Database (PRO-ACT). To evaluate performance, we examined imputation accuracy for known values simulated to be either missing completely at random or missing not at random. We also compared ALS disease progression prediction across different imputation models. Autoencoders showed strong performance for imputation accuracy and contributed to the strongest disease progression predictor. Finally, we show that despite clinical heterogeneity, ALS disease progression appears homogenous with time from onset being the most important predictor.

Entities:  

Mesh:

Year:  2017        PMID: 27896976      PMCID: PMC5144587          DOI: 10.1142/9789813207813_0021

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  17 in total

1.  Crowdsourced analysis of clinical trial data to predict amyotrophic lateral sclerosis progression.

Authors:  Robert Küffner; Neta Zach; Raquel Norel; Johann Hawe; David Schoenfeld; Liuxia Wang; Guang Li; Lilly Fang; Lester Mackey; Orla Hardiman; Merit Cudkowicz; Alexander Sherman; Gokhan Ertaylan; Moritz Grosse-Wentrup; Torsten Hothorn; Jules van Ligtenberg; Jakob H Macke; Timm Meyer; Bernhard Schölkopf; Linh Tran; Rubio Vaughan; Gustavo Stolovitzky; Melanie L Leitner
Journal:  Nat Biotechnol       Date:  2014-11-02       Impact factor: 54.908

2.  Projected gradient methods for nonnegative matrix factorization.

Authors:  Chih-Jen Lin
Journal:  Neural Comput       Date:  2007-10       Impact factor: 2.026

3.  ALSFRS-R score and its ratio: a useful predictor for ALS-progression.

Authors:  Katja Kollewe; Ulrike Mauss; Klaus Krampfl; Susanne Petri; Reinhard Dengler; Bahram Mohammadi
Journal:  J Neurol Sci       Date:  2008-08-21       Impact factor: 3.181

Review 4.  ALS motor phenotype heterogeneity, focality, and spread: deconstructing motor neuron degeneration.

Authors:  John M Ravits; Albert R La Spada
Journal:  Neurology       Date:  2009-09-08       Impact factor: 9.910

Review 5.  Clinical and genetic heterogeneity of amyotrophic lateral sclerosis.

Authors:  M Sabatelli; A Conte; M Zollino
Journal:  Clin Genet       Date:  2013-03-12       Impact factor: 4.438

6.  Performance of the Amyotrophic Lateral Sclerosis Functional Rating Scale (ALSFRS) in multicenter clinical trials.

Authors:  J M Cedarbaum; N Stambler
Journal:  J Neurol Sci       Date:  1997-10       Impact factor: 3.181

7.  Riluzole and amyotrophic lateral sclerosis survival: a population-based study in southern Italy.

Authors:  S Zoccolella; E Beghi; G Palagano; A Fraddosio; V Guerra; V Samarelli; V Lepore; I L Simone; P Lamberti; L Serlenga; G Logroscino
Journal:  Eur J Neurol       Date:  2007-03       Impact factor: 6.089

8.  Forced vital capacity (FVC) as an indicator of survival and disease progression in an ALS clinic population.

Authors:  A Czaplinski; A A Yen; S H Appel
Journal:  J Neurol Neurosurg Psychiatry       Date:  2006-03       Impact factor: 10.154

9.  Strategies for handling missing data in electronic health record derived data.

Authors:  Brian J Wells; Kevin M Chagin; Amy S Nowacki; Michael W Kattan
Journal:  EGEMS (Wash DC)       Date:  2013-12-17

10.  Deep Patient: An Unsupervised Representation to Predict the Future of Patients from the Electronic Health Records.

Authors:  Riccardo Miotto; Li Li; Brian A Kidd; Joel T Dudley
Journal:  Sci Rep       Date:  2016-05-17       Impact factor: 4.379

View more
  29 in total

1.  Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.

Authors:  Jie Tan; Georgia Doing; Kimberley A Lewis; Courtney E Price; Kathleen M Chen; Kyle C Cady; Barret Perchuk; Michael T Laub; Deborah A Hogan; Casey S Greene
Journal:  Cell Syst       Date:  2017-07-12       Impact factor: 10.304

2.  Integration of genetic and clinical information to improve imputation of data missing from electronic health records.

Authors:  Ruowang Li; Yong Chen; Jason H Moore
Journal:  J Am Med Inform Assoc       Date:  2019-10-01       Impact factor: 4.497

Review 3.  Insights into Computational Drug Repurposing for Neurodegenerative Disease.

Authors:  Manish D Paranjpe; Alice Taubes; Marina Sirota
Journal:  Trends Pharmacol Sci       Date:  2019-07-17       Impact factor: 14.819

4.  Model-Based and Model-Free Techniques for Amyotrophic Lateral Sclerosis Diagnostic Prediction and Patient Clustering.

Authors:  Ming Tang; Chao Gao; Stephen A Goutman; Alexandr Kalinin; Bhramar Mukherjee; Yuanfang Guan; Ivo D Dinov
Journal:  Neuroinformatics       Date:  2019-07

Review 5.  Informatics and machine learning to define the phenotype.

Authors:  Anna Okula Basile; Marylyn DeRiggi Ritchie
Journal:  Expert Rev Mol Diagn       Date:  2018-02-16       Impact factor: 5.225

6.  The emerging landscape of health research based on biobanks linked to electronic health records: Existing resources, statistical challenges, and potential opportunities.

Authors:  Lauren J Beesley; Maxwell Salvatore; Lars G Fritsche; Anita Pandit; Arvind Rao; Chad Brummett; Cristen J Willer; Lynda D Lisabeth; Bhramar Mukherjee
Journal:  Stat Med       Date:  2019-12-20       Impact factor: 2.373

7.  Flexible, cluster-based analysis of the electronic medical record of sepsis with composite mixture models.

Authors:  Michael B Mayhew; Brenden K Petersen; Ana Paula Sales; John D Greene; Vincent X Liu; Todd S Wasson
Journal:  J Biomed Inform       Date:  2017-12-02       Impact factor: 6.317

Review 8.  Preparing next-generation scientists for biomedical big data: artificial intelligence approaches.

Authors:  Jason H Moore; Mary Regina Boland; Pablo G Camara; Hannah Chervitz; Graciela Gonzalez; Blanca E Himes; Dokyoon Kim; Danielle L Mowery; Marylyn D Ritchie; Li Shen; Ryan J Urbanowicz; John H Holmes
Journal:  Per Med       Date:  2019-02-14       Impact factor: 2.512

9.  Predicting Missing Values in Medical Data via XGBoost Regression.

Authors:  Xinmeng Zhang; Chao Yan; Cheng Gao; Bradley A Malin; You Chen
Journal:  J Healthc Inform Res       Date:  2020-08-03

10.  Genomic data imputation with variational auto-encoders.

Authors:  Yeping Lina Qiu; Hong Zheng; Olivier Gevaert
Journal:  Gigascience       Date:  2020-08-01       Impact factor: 6.524

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.