Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 PARAMO: a PARAllel predictive MOdeling platform for healthcare analytic research using electronic health records.

Literature DB >> 24370496

PARAMO: a PARAllel predictive MOdeling platform for healthcare analytic research using electronic health records.

Kenney Ng¹, Amol Ghoting², Steven R Steinhubl³, Walter F Stewart⁴, Bradley Malin⁵, Jimeng Sun².

Abstract

OBJECTIVE: Healthcare analytics research increasingly involves the construction of predictive models for disease targets across varying patient cohorts using electronic health records (EHRs). To facilitate this process, it is critical to support a pipeline of tasks: (1) cohort construction, (2) feature construction, (3) cross-validation, (4) feature selection, and (5) classification. To develop an appropriate model, it is necessary to compare and refine models derived from a diversity of cohorts, patient-specific features, and statistical frameworks. The goal of this work is to develop and evaluate a predictive modeling platform that can be used to simplify and expedite this process for health data.
METHODS: To support this goal, we developed a PARAllel predictive MOdeling (PARAMO) platform which (1) constructs a dependency graph of tasks from specifications of predictive modeling pipelines, (2) schedules the tasks in a topological ordering of the graph, and (3) executes those tasks in parallel. We implemented this platform using Map-Reduce to enable independent tasks to run in parallel in a cluster computing environment. Different task scheduling preferences are also supported.
RESULTS: We assess the performance of PARAMO on various workloads using three datasets derived from the EHR systems in place at Geisinger Health System and Vanderbilt University Medical Center and an anonymous longitudinal claims database. We demonstrate significant gains in computational efficiency against a standard approach. In particular, PARAMO can build 800 different models on a 300,000 patient data set in 3h in parallel compared to 9days if running sequentially.
CONCLUSION: This work demonstrates that an efficient parallel predictive modeling platform can be developed for EHR data. This platform can facilitate large-scale modeling endeavors and speed-up the research workflow and reuse of health information. This platform is only a first step and provides the foundation for our ultimate goal of building analytic pipelines that are specialized for health data researchers.

Entities: Disease Species

Keywords: Electronic health records; Map reduce; Parallel computing; Predictive modeling; Scientific workflows

Mesh：

Year: 2013 PMID： 24370496 PMCID： PMC4075460 DOI： 10.1016/j.jbi.2013.12.012

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

31 in total

1. Integrated modeling of clinical and gene expression information for personalized prediction of disease outcomes.

Authors: Jennifer Pittman; Erich Huang; Holly Dressman; Cheng-Fang Horng; Skye H Cheng; Mei-Hua Tsou; Chii-Ming Chen; Andrea Bild; Edwin S Iversen; Andrew T Huang; Joseph R Nevins; Mike West
Journal: Proc Natl Acad Sci U S A Date: 2004-05-19 Impact factor: 11.205

2. Prediction models in cancer care.

Authors: Andrew J Vickers
Journal: CA Cancer J Clin Date: 2011-06-23 Impact factor: 508.702

3. Validation of electronic medical record-based phenotyping algorithms: results and lessons learned from the eMERGE network.

Authors: Katherine M Newton; Peggy L Peissig; Abel Ngo Kho; Suzette J Bielinski; Richard L Berg; Vidhu Choudhary; Melissa Basford; Christopher G Chute; Iftikhar J Kullo; Rongling Li; Jennifer A Pacheco; Luke V Rasmussen; Leslie Spangler; Joshua C Denny
Journal: J Am Med Inform Assoc Date: 2013-03-26 Impact factor: 4.497

4. Development and validation of a mortality risk-adjustment model for patients hospitalized for exacerbations of chronic obstructive pulmonary disease.

Authors: Ying P Tabak; Xiaowu Sun; Richard S Johannes; Linda Hyde; Andrew F Shorr; Peter K Lindenauer
Journal: Med Care Date: 2013-07 Impact factor: 2.983

5. Accurately predicting bipolar disorder mood outcomes: implications for the use of electronic databases.

Authors: Alisa B Busch; Brian Neelon; Katya Zelevinsky; Yulei He; Sharon-Lise T Normand
Journal: Med Care Date: 2012-04 Impact factor: 2.983

6. Combining PubMed knowledge and EHR data to develop a weighted bayesian network for pancreatic cancer prediction.

Authors: Di Zhao; Chunhua Weng
Journal: J Biomed Inform Date: 2011-05-27 Impact factor: 6.317

7. Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records.

Authors: Roy J Byrd; Steven R Steinhubl; Jimeng Sun; Shahram Ebadollahi; Walter F Stewart
Journal: Int J Med Inform Date: 2013-01-11 Impact factor: 4.046

8. Combining knowledge and data driven insights for identifying risk factors using electronic health records.

Authors: Jimeng Sun; Jianying Hu; Dijun Luo; Marianthi Markatou; Fei Wang; Shahram Edabollahi; Steven E Steinhubl; Zahra Daar; Walter F Stewart
Journal: AMIA Annu Symp Proc Date: 2012-11-03

9. ICDA: a platform for Intelligent Care Delivery Analytics.

Authors: David Gotz; Harry Stavropoulos; Jimeng Sun; Fei Wang
Journal: AMIA Annu Symp Proc Date: 2012-11-03

10. Secondary Use of EHR: Data Quality Issues and Informatics Opportunities.

Authors: Taxiarchis Botsis; Gunnar Hartvigsen; Fei Chen; Chunhua Weng
Journal: Summit Transl Bioinform Date: 2010-03-01

22 in total

1. R-U policy frontiers for health data de-identification.

Authors: Weiyi Xia; Raymond Heatherly; Xiaofeng Ding; Jiuyong Li; Bradley A Malin
Journal: J Am Med Inform Assoc Date: 2015-04-24 Impact factor: 4.497

2. Preprocessing structured clinical data for predictive modeling and decision support. A roadmap to tackle the challenges.

Authors: José Carlos Ferrão; Mónica Duarte Oliveira; Filipe Janela; Henrique M G Martins
Journal: Appl Clin Inform Date: 2016-12-07 Impact factor: 2.342

3. Cloud-based Predictive Modeling System and its Application to Asthma Readmission Prediction.

Authors: Robert Chen; Hang Su; Mohammed Khalilia; Sizhe Lin; Yue Peng; Tod Davis; Daniel A Hirsh; Elizabeth Searles; Javier Tejedor-Sojo; Michael Thompson; Jimeng Sun
Journal: AMIA Annu Symp Proc Date: 2015-11-05

4. Improving condition severity classification with an efficient active learning based framework.

Authors: Nir Nissim; Mary Regina Boland; Nicholas P Tatonetti; Yuval Elovici; George Hripcsak; Yuval Shahar; Robert Moskovitch
Journal: J Biomed Inform Date: 2016-03-22 Impact factor: 6.317

5. omniClassifier: a Desktop Grid Computing System for Big Data Prediction Modeling.

Authors: John H Phan; Sonal Kothari; May D Wang
Journal: ACM BCB Date: 2014-09

6. Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome.

Authors: Seyedeh Neelufar Payrovnaziri; Laura A Barrett; Daniel Bis; Jiang Bian; Zhe He
Journal: Stud Health Technol Inform Date: 2019-08-21

7. Inter-labeler and intra-labeler variability of condition severity classification models using active and passive learning methods.

Authors: Nir Nissim; Yuval Shahar; Yuval Elovici; George Hripcsak; Robert Moskovitch
Journal: Artif Intell Med Date: 2017-04-27 Impact factor: 5.326

8. An electronic medical record system with treatment recommendations based on patient similarity.

Authors: Yu Wang; Yu Tian; Li-Li Tian; Yang-Ming Qian; Jing-Song Li
Journal: J Med Syst Date: 2015-03-12 Impact factor: 4.460

9. Prognosis of Clinical Outcomes with Temporal Patterns and Experiences with One Class Feature Selection.

Authors: Robert Moskovitch; Hyunmi Choi; George Hripcsak; Nicholas Tatonetti
Journal: IEEE/ACM Trans Comput Biol Bioinform Date: 2016-07-14 Impact factor: 3.710

10. Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach.

Authors: R Andrew Taylor; Joseph R Pare; Arjun K Venkatesh; Hani Mowafi; Edward R Melnick; William Fleischman; M Kennedy Hall
Journal: Acad Emerg Med Date: 2016-02-13 Impact factor: 3.451