Literature DB >> 24370496

PARAMO: a PARAllel predictive MOdeling platform for healthcare analytic research using electronic health records.

Kenney Ng1, Amol Ghoting2, Steven R Steinhubl3, Walter F Stewart4, Bradley Malin5, Jimeng Sun2.   

Abstract

OBJECTIVE: Healthcare analytics research increasingly involves the construction of predictive models for disease targets across varying patient cohorts using electronic health records (EHRs). To facilitate this process, it is critical to support a pipeline of tasks: (1) cohort construction, (2) feature construction, (3) cross-validation, (4) feature selection, and (5) classification. To develop an appropriate model, it is necessary to compare and refine models derived from a diversity of cohorts, patient-specific features, and statistical frameworks. The goal of this work is to develop and evaluate a predictive modeling platform that can be used to simplify and expedite this process for health data.
METHODS: To support this goal, we developed a PARAllel predictive MOdeling (PARAMO) platform which (1) constructs a dependency graph of tasks from specifications of predictive modeling pipelines, (2) schedules the tasks in a topological ordering of the graph, and (3) executes those tasks in parallel. We implemented this platform using Map-Reduce to enable independent tasks to run in parallel in a cluster computing environment. Different task scheduling preferences are also supported.
RESULTS: We assess the performance of PARAMO on various workloads using three datasets derived from the EHR systems in place at Geisinger Health System and Vanderbilt University Medical Center and an anonymous longitudinal claims database. We demonstrate significant gains in computational efficiency against a standard approach. In particular, PARAMO can build 800 different models on a 300,000 patient data set in 3h in parallel compared to 9days if running sequentially.
CONCLUSION: This work demonstrates that an efficient parallel predictive modeling platform can be developed for EHR data. This platform can facilitate large-scale modeling endeavors and speed-up the research workflow and reuse of health information. This platform is only a first step and provides the foundation for our ultimate goal of building analytic pipelines that are specialized for health data researchers.
Copyright © 2013 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Electronic health records; Map reduce; Parallel computing; Predictive modeling; Scientific workflows

Mesh:

Year:  2013        PMID: 24370496      PMCID: PMC4075460          DOI: 10.1016/j.jbi.2013.12.012

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  31 in total

1.  Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomes.

Authors:  Jennifer Pittman; Erich Huang; Holly Dressman; Cheng-Fang Horng; Skye H Cheng; Mei-Hua Tsou; Chii-Ming Chen; Andrea Bild; Edwin S Iversen; Andrew T Huang; Joseph R Nevins; Mike West
Journal:  Proc Natl Acad Sci U S A       Date:  2004-05-19       Impact factor: 11.205

2.  Prediction models in cancer care.

Authors:  Andrew J Vickers
Journal:  CA Cancer J Clin       Date:  2011-06-23       Impact factor: 508.702

3.  Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.

Authors:  Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2013-03-26       Impact factor: 4.497

4.  Development and validation of a mortality risk-adjustment model for patients hospitalized for exacerbations of chronic obstructive pulmonary disease.

Authors:  Ying P Tabak; Xiaowu Sun; Richard S Johannes; Linda Hyde; Andrew F Shorr; Peter K Lindenauer
Journal:  Med Care       Date:  2013-07       Impact factor: 2.983

5.  Accurately predicting bipolar disorder mood outcomes: implications for the use of electronic databases.

Authors:  Alisa B Busch; Brian Neelon; Katya Zelevinsky; Yulei He; Sharon-Lise T Normand
Journal:  Med Care       Date:  2012-04       Impact factor: 2.983

6.  Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction.

Authors:  Di Zhao; Chunhua Weng
Journal:  J Biomed Inform       Date:  2011-05-27       Impact factor: 6.317

7.  Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records.

Authors:  Roy J Byrd; Steven R Steinhubl; Jimeng Sun; Shahram Ebadollahi; Walter F Stewart
Journal:  Int J Med Inform       Date:  2013-01-11       Impact factor: 4.046

8.  Combining knowledge and data driven insights for identifying risk factors using electronic health records.

Authors:  Jimeng Sun; Jianying Hu; Dijun Luo; Marianthi Markatou; Fei Wang; Shahram Edabollahi; Steven E Steinhubl; Zahra Daar; Walter F Stewart
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

9.  ICDA: a platform for Intelligent Care Delivery Analytics.

Authors:  David Gotz; Harry Stavropoulos; Jimeng Sun; Fei Wang
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

10.  Secondary Use of EHR: Data Quality Issues and Informatics Opportunities.

Authors:  Taxiarchis Botsis; Gunnar Hartvigsen; Fei Chen; Chunhua Weng
Journal:  Summit Transl Bioinform       Date:  2010-03-01
View more
  22 in total

1.  R-U policy frontiers for health data de-identification.

Authors:  Weiyi Xia; Raymond Heatherly; Xiaofeng Ding; Jiuyong Li; Bradley A Malin
Journal:  J Am Med Inform Assoc       Date:  2015-04-24       Impact factor: 4.497

2.  Preprocessing structured clinical data for predictive modeling and decision support. A roadmap to tackle the challenges.

Authors:  José Carlos Ferrão; Mónica Duarte Oliveira; Filipe Janela; Henrique M G Martins
Journal:  Appl Clin Inform       Date:  2016-12-07       Impact factor: 2.342

3.  Cloud-based Predictive Modeling System and its Application to Asthma Readmission Prediction.

Authors:  Robert Chen; Hang Su; Mohammed Khalilia; Sizhe Lin; Yue Peng; Tod Davis; Daniel A Hirsh; Elizabeth Searles; Javier Tejedor-Sojo; Michael Thompson; Jimeng Sun
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

4.  Improving condition severity classification with an efficient active learning based framework.

Authors:  Nir Nissim; Mary Regina Boland; Nicholas P Tatonetti; Yuval Elovici; George Hripcsak; Yuval Shahar; Robert Moskovitch
Journal:  J Biomed Inform       Date:  2016-03-22       Impact factor: 6.317

5.  omniClassifier: a Desktop Grid Computing System for Big Data Prediction Modeling.

Authors:  John H Phan; Sonal Kothari; May D Wang
Journal:  ACM BCB       Date:  2014-09

6.  Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome.

Authors:  Seyedeh Neelufar Payrovnaziri; Laura A Barrett; Daniel Bis; Jiang Bian; Zhe He
Journal:  Stud Health Technol Inform       Date:  2019-08-21

7.  Inter-labeler and intra-labeler variability of condition severity classification models using active and passive learning methods.

Authors:  Nir Nissim; Yuval Shahar; Yuval Elovici; George Hripcsak; Robert Moskovitch
Journal:  Artif Intell Med       Date:  2017-04-27       Impact factor: 5.326

8.  An electronic medical record system with treatment recommendations based on patient similarity.

Authors:  Yu Wang; Yu Tian; Li-Li Tian; Yang-Ming Qian; Jing-Song Li
Journal:  J Med Syst       Date:  2015-03-12       Impact factor: 4.460

9.  Prognosis of Clinical Outcomes with Temporal Patterns and Experiences with One Class Feature Selection.

Authors:  Robert Moskovitch; Hyunmi Choi; George Hripcsak; Nicholas Tatonetti
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2016-07-14       Impact factor: 3.710

10.  Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.

Authors:  R Andrew Taylor; Joseph R Pare; Arjun K Venkatesh; Hani Mowafi; Edward R Melnick; William Fleischman; M Kennedy Hall
Journal:  Acad Emerg Med       Date:  2016-02-13       Impact factor: 3.451

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.