Literature DB >> 34891747

Autopopulus: A Novel Framework for Autoencoder Imputation on Large Clinical Datasets.

Davina J Zamanzadeh, Panayiotis Petousis, Tyler A Davis, Susanne B Nicholas, Keith C Norris, Katherine R Tuttle, Alex A T Bui, Majid Sarrafzadeh.   

Abstract

The adoption of electronic health records (EHRs) has made patient data increasingly accessible, precipitating the development of various clinical decision support systems and data-driven models to help physicians. However, missing data are common in EHR-derived datasets, which can introduce significant uncertainty, if not invalidating the use of a predictive model. Machine learning (ML)-based imputation methods have shown promise in various domains for the task of estimating values and reducing uncertainty to the point that a predictive model can be employed. We introduce Autopopulus, a novel framework that enables the design and evaluation of various autoencoder architectures for efficient imputation on large datasets. Autopopulus implements existing autoencoder methods as well as a new technique that outputs a range of estimated values (rather than point estimates), and demonstrates a workflow that helps users make an informed decision on an appropriate imputation method. To further illustrate Autopopulus' utility, we use it to identify not only which imputation methods can most accurately impute on a large clinical dataset, but to also identify the imputation methods that enable downstream predictive models to achieve the best performance for prediction of chronic kidney disease (CKD) progression.

Entities:  

Mesh:

Year:  2021        PMID: 34891747      PMCID: PMC8862635          DOI: 10.1109/EMBC46164.2021.9630135

Source DB:  PubMed          Journal:  Annu Int Conf IEEE Eng Med Biol Soc        ISSN: 2375-7477


  7 in total

1.  Discretization of continuous features in clinical datasets.

Authors:  David M Maslove; Tanya Podchiyska; Henry J Lowe
Journal:  J Am Med Inform Assoc       Date:  2012-10-11       Impact factor: 4.497

2.  MISSING DATA IMPUTATION IN THE ELECTRONIC HEALTH RECORD USING DEEPLY LEARNED AUTOENCODERS.

Authors:  Brett K Beaulieu-Jones; Jason H Moore
Journal:  Pac Symp Biocomput       Date:  2017

3.  GFR decline and subsequent risk of established kidney outcomes: a meta-analysis of 37 randomized controlled trials.

Authors:  Hiddo J Lambers Heerspink; Hocine Tighiouart; Yingying Sang; Shoshana Ballew; Hasi Mondal; Kunihiro Matsushita; Josef Coresh; Andrew S Levey; Lesley A Inker
Journal:  Am J Kidney Dis       Date:  2014-10-16       Impact factor: 8.860

4.  A pattern-mixture model approach for handling missing continuous outcome data in longitudinal cluster randomized trials.

Authors:  Mallorie H Fiero; Chiu-Hsieh Hsu; Melanie L Bell
Journal:  Stat Med       Date:  2017-08-07       Impact factor: 2.373

5.  Sparse Convolutional Denoising Autoencoders for Genotype Imputation.

Authors:  Junjie Chen; Xinghua Shi
Journal:  Genes (Basel)       Date:  2019-08-28       Impact factor: 4.096

6.  Clinical Characteristics of and Risk Factors for Chronic Kidney Disease Among Adults and Children: An Analysis of the CURE-CKD Registry.

Authors:  Katherine R Tuttle; Radica Z Alicic; O Kenrik Duru; Cami R Jones; Kenn B Daratha; Susanne B Nicholas; Sterling M McPherson; Joshua J Neumiller; Douglas S Bell; Carol M Mangione; Keith C Norris
Journal:  JAMA Netw Open       Date:  2019-12-02

7.  Rationale and design of a multicenter Chronic Kidney Disease (CKD) and at-risk for CKD electronic health records-based registry: CURE-CKD.

Authors:  Keith C Norris; O Kenrik Duru; Radica Z Alicic; Kenn B Daratha; Susanne B Nicholas; Sterling M McPherson; Douglas S Bell; Jenny I Shen; Cami R Jones; Tannaz Moin; Amy D Waterman; Joshua J Neumiller; Roberto B Vargas; Alex A T Bui; Carol M Mangione; Katherine R Tuttle
Journal:  BMC Nephrol       Date:  2019-11-20       Impact factor: 2.388

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.