Literature DB >> 26483171

Implications of non-stationarity on predictive modeling using EHRs.

Kenneth Jung1, Nigam H Shah2.   

Abstract

The rapidly increasing volume of clinical information captured in Electronic Health Records (EHRs) has led to the application of increasingly sophisticated models for purposes such as disease subtype discovery and predictive modeling. However, increasing adoption of EHRs implies that in the near future, much of the data available for such purposes will be from a time period during which both the practice of medicine and the clinical use of EHRs are in flux due to historic changes in both technology and incentives. In this work, we explore the implications of this phenomenon, called non-stationarity, on predictive modeling. We focus on the problem of predicting delayed wound healing using data available in the EHR during the first week of care in outpatient wound care centers, using a large dataset covering over 150,000 individual wounds and 59,958 patients seen over a period of four years. We manipulate the degree of non-stationarity seen by the model development process by changing the way data is split into training and test sets. We demonstrate that non-stationarity can lead to quite different conclusions regarding the relative merits of different models with respect to predictive power and calibration of their posterior probabilities. Under the non-stationarity exhibited in this dataset, the performance advantage of complex methods such as stacking relative to the best simple classifier disappears. Ignoring non-stationarity can thus lead to sub-optimal model selection in this task.
Copyright © 2015 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Data mining; Machine learning; Predictive model; Prognostic model; Wound healing

Mesh:

Year:  2015        PMID: 26483171      PMCID: PMC4684770          DOI: 10.1016/j.jbi.2015.10.006

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  13 in total

1.  Evidence generating medicine: redefining the research-practice relationship to complete the evidence cycle.

Authors:  Peter J Embi; Philip R O Payne
Journal:  Med Care       Date:  2013-08       Impact factor: 2.983

2.  Defining a comprehensive verotype using electronic health records for personalized medicine.

Authors:  Mary Regina Boland; George Hripcsak; Yufeng Shen; Wendy K Chung; Chunhua Weng
Journal:  J Am Med Inform Assoc       Date:  2013-09-03       Impact factor: 4.497

3.  Using EHRs to integrate research with patient care: promises and challenges.

Authors:  Chunhua Weng; Paul Appelbaum; George Hripcsak; Ian Kronish; Linda Busacca; Karina W Davidson; J Thomas Bigger
Journal:  J Am Med Inform Assoc       Date:  2012-04-29       Impact factor: 4.497

4.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

5.  Achieving a nationwide learning health system.

Authors:  Charles P Friedman; Adam K Wong; David Blumenthal
Journal:  Sci Transl Med       Date:  2010-11-10       Impact factor: 17.956

6.  Caveats for the use of operational electronic health record data in comparative effectiveness research.

Authors:  William R Hersh; Mark G Weiner; Peter J Embi; Judith R Logan; Philip R O Payne; Elmer V Bernstam; Harold P Lehmann; George Hripcsak; Timothy H Hartzog; James J Cimino; Joel H Saltz
Journal:  Med Care       Date:  2013-08       Impact factor: 2.983

7.  Big data in health care: using analytics to identify and manage high-risk and high-cost patients.

Authors:  David W Bates; Suchi Saria; Lucila Ohno-Machado; Anand Shah; Gabriel Escobar
Journal:  Health Aff (Millwood)       Date:  2014-07       Impact factor: 6.301

8.  The coming age of data-driven medicine: translational bioinformatics' next frontier.

Authors:  Nigam H Shah; Jessica D Tenenbaum
Journal:  J Am Med Inform Assoc       Date:  2012-06       Impact factor: 4.497

9.  Assessing Google flu trends performance in the United States during the 2009 influenza virus A (H1N1) pandemic.

Authors:  Samantha Cook; Corrie Conrad; Ashley L Fowlkes; Matthew H Mohebbi
Journal:  PLoS One       Date:  2011-08-19       Impact factor: 3.240

10.  Bias associated with mining electronic health records.

Authors:  George Hripcsak; Charles Knirsch; Li Zhou; Adam Wilcox; Genevieve Melton
Journal:  J Biomed Discov Collab       Date:  2011-06-06
View more
  13 in total

1.  Comparing lagged linear correlation, lagged regression, Granger causality, and vector autoregression for uncovering associations in EHR data.

Authors:  Matthew E Levine; David J Albers; George Hripcsak
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  Scalable Electronic Phenotyping For Studying Patient Comorbidities.

Authors:  Albee Y Ling; Emily Alsentzer; Josephine Chen; Juan M Banda; Suzanne Tamang; Evan Minty
Journal:  AMIA Annu Symp Proc       Date:  2018-12-05

3.  Enhancing Prediction Models for One-Year Mortality in Patients with Acute Myocardial Infarction and Post Myocardial Infarction Syndrome.

Authors:  Seyedeh Neelufar Payrovnaziri; Laura A Barrett; Daniel Bis; Jiang Bian; Zhe He
Journal:  Stud Health Technol Inform       Date:  2019-08-21

4.  Machine learning versus traditional methods for the development of risk stratification scores: a case study using original Canadian Syncope Risk Score data.

Authors:  Lars Grant; Pil Joo; Marie-Joe Nemnom; Venkatesh Thiruganasambandamoorthy
Journal:  Intern Emerg Med       Date:  2021-11-03       Impact factor: 5.472

5.  The use of machine learning for the identification of peripheral artery disease and future mortality risk.

Authors:  Elsie Gyang Ross; Nigam H Shah; Ronald L Dalman; Kevin T Nead; John P Cooke; Nicholas J Leeper
Journal:  J Vasc Surg       Date:  2016-06-03       Impact factor: 4.268

6.  Latent Patient Cluster Discovery for Robust Future Forecasting and New-Patient Generalization.

Authors:  Ting Qian; Aaron J Masino
Journal:  PLoS One       Date:  2016-09-16       Impact factor: 3.240

7.  Electronic phenotyping with APHRODITE and the Observational Health Sciences and Informatics (OHDSI) data network.

Authors:  Juan M Banda; Yoni Halpern; David Sontag; Nigam H Shah
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2017-07-26

8.  A dataset quantifying polypharmacy in the United States.

Authors:  Katie J Quinn; Nigam H Shah
Journal:  Sci Data       Date:  2017-10-31       Impact factor: 6.444

9.  High-fidelity phenotyping: richness and freedom from bias.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2018-03-01       Impact factor: 4.497

10.  Predicting the need for a reduced drug dose, at first prescription.

Authors:  Adrien Coulet; Nigam H Shah; Maxime Wack; Mohammad B Chawki; Nicolas Jay; Michel Dumontier
Journal:  Sci Rep       Date:  2018-10-22       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.