Literature DB >> 30172760

Methodological variations in lagged regression for detecting physiologic drug effects in EHR data.

Matthew E Levine1, David J Albers2, George Hripcsak3.   

Abstract

We studied how lagged linear regression can be used to detect the physiologic effects of drugs from data in the electronic health record (EHR). We systematically examined the effect of methodological variations ((i) time series construction, (ii) temporal parameterization, (iii) intra-subject normalization, (iv) differencing (lagged rates of change achieved by taking differences between consecutive measurements), (v) explanatory variables, and (vi) regression models) on performance of lagged linear methods in this context. We generated two gold standards (one knowledge-base derived, one expert-curated) for expected pairwise relationships between 7 drugs and 4 labs, and evaluated how the 64 unique combinations of methodological perturbations reproduce the gold standards. Our 28 cohorts included patients in the Columbia University Medical Center/NewYork-Presbyterian Hospital clinical database, and ranged from 2820 to 79,514 patients with between 8 and 209 average time points per patient. The most accurate methods achieved AUROC of 0.794 for knowledge-base derived gold standard (95%CI [0.741, 0.847]) and 0.705 for expert-curated gold standard (95% CI [0.629, 0.781]). We observed a mean AUROC of 0.633 (95%CI [0.610, 0.657], expert-curated gold standard) across all methods that re-parameterize time according to sequence and use either a joint autoregressive model with time-series differencing or an independent lag model without differencing. The complement of this set of methods achieved a mean AUROC close to 0.5, indicating the importance of these choices. We conclude that time-series analysis of EHR data will likely rely on some of the beneficial pre-processing and modeling methodologies identified, and will certainly benefit from continued careful analysis of methodological perturbations. This study found that methodological variations, such as pre-processing and representations, have a large effect on results, exposing the importance of thoroughly evaluating these components when comparing machine-learning methods.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30172760      PMCID: PMC6207533          DOI: 10.1016/j.jbi.2018.08.014

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  28 in total

1.  Comparing lagged linear correlation, lagged regression, Granger causality, and vector autoregression for uncovering associations in EHR data.

Authors:  Matthew E Levine; David J Albers; George Hripcsak
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

2.  Comprehensive temporal information detection from clinical text: medical events, time, and TLINK identification.

Authors:  Sunghwan Sohn; Kavishwar B Wagholikar; Dingcheng Li; Siddhartha R Jonnalagadda; Cui Tao; Ravikumar Komandur Elayavilli; Hongfang Liu
Journal:  J Am Med Inform Assoc       Date:  2013-04-04       Impact factor: 4.497

3.  Identifying and mitigating biases in EHR laboratory tests.

Authors:  Rimma Pivovarov; David J Albers; Jorge L Sepulveda; Noémie Elhadad
Journal:  J Biomed Inform       Date:  2014-04-13       Impact factor: 6.317

4.  Learning Linear Dynamical Systems from Multivariate Time Series: A Matrix Factorization Based Framework.

Authors:  Zitao Liu; Milos Hauskrecht
Journal:  Proc SIAM Int Conf Data Min       Date:  2016-05

5.  Learning Adaptive Forecasting Models from Irregularly Sampled Multivariate Clinical Data.

Authors:  Zitao Liu; Milos Hauskrecht
Journal:  Proc Conf AAAI Artif Intell       Date:  2016-02

6.  Observational Health Data Sciences and Informatics (OHDSI): Opportunities for Observational Researchers.

Authors:  George Hripcsak; Jon D Duke; Nigam H Shah; Christian G Reich; Vojtech Huser; Martijn J Schuemie; Marc A Suchard; Rae Woong Park; Ian Chi Kei Wong; Peter R Rijnbeek; Johan van der Lei; Nicole Pratt; G Niklas Norén; Yu-Chuan Li; Paul E Stang; David Madigan; Patrick B Ryan
Journal:  Stud Health Technol Inform       Date:  2015

7.  Bridging islands of information to establish an integrated knowledge base of drugs and health outcomes of interest.

Authors:  Richard D Boyce; Patrick B Ryan; G Niklas Norén; Martijn J Schuemie; Christian Reich; Jon Duke; Nicholas P Tatonetti; Gianluca Trifirò; Rave Harpaz; J Marc Overhage; Abraham G Hartzema; Mark Khayter; Erica A Voss; Christophe G Lambert; Vojtech Huser; Michel Dumontier
Journal:  Drug Saf       Date:  2014-08       Impact factor: 5.606

8.  High-fidelity phenotyping: richness and freedom from bias.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2018-03-01       Impact factor: 4.497

9.  Next-generation phenotyping of electronic health records.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2012-09-06       Impact factor: 4.497

10.  Correlating electronic health record concepts with healthcare process events.

Authors:  George Hripcsak; David J Albers
Journal:  J Am Med Inform Assoc       Date:  2013-08-23       Impact factor: 4.497

View more
  4 in total

1.  sureLDA: A multidisease automated phenotyping method for the electronic health record.

Authors:  Yuri Ahuja; Doudou Zhou; Zeling He; Jiehuan Sun; Victor M Castro; Vivian Gainer; Shawn N Murphy; Chuan Hong; Tianxi Cai
Journal:  J Am Med Inform Assoc       Date:  2020-08-01       Impact factor: 4.497

2.  The parameter Houlihan: A solution to high-throughput identifiability indeterminacy for brutally ill-posed problems.

Authors:  David J Albers; Matthew E Levine; Lena Mamykina; George Hripcsak
Journal:  Math Biosci       Date:  2019-08-24       Impact factor: 2.144

3.  Enabling personalized decision support with patient-generated data and attributable components.

Authors:  Elliot G Mitchell; Esteban G Tabak; Matthew E Levine; Lena Mamykina; David J Albers
Journal:  J Biomed Inform       Date:  2020-12-13       Impact factor: 6.317

4.  PatientExploreR: an extensible application for dynamic visualization of patient clinical history from electronic health records in the OMOP common data model.

Authors:  Benjamin S Glicksberg; Boris Oskotsky; Phyllis M Thangaraj; Nicholas Giangreco; Marcus A Badgeley; Kipp W Johnson; Debajyoti Datta; Vivek A Rudrapatna; Nadav Rappoport; Mark M Shervey; Riccardo Miotto; Theodore C Goldstein; Eugenia Rutenberg; Remi Frazier; Nelson Lee; Sharat Israni; Rick Larsen; Bethany Percha; Li Li; Joel T Dudley; Nicholas P Tatonetti; Atul J Butte
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.