Literature DB >> 33140432

Raking and regression calibration: Methods to address bias from correlated covariate and time-to-event error.

Eric J Oh1, Bryan E Shepherd2, Thomas Lumley3, Pamela A Shaw1.   

Abstract

Medical studies that depend on electronic health records (EHR) data are often subject to measurement error, as the data are not collected to support research questions under study. These data errors, if not accounted for in study analyses, can obscure or cause spurious associations between patient exposures and disease risk. Methodology to address covariate measurement error has been well developed; however, time-to-event error has also been shown to cause significant bias, but methods to address it are relatively underdeveloped. More generally, it is possible to observe errors in both the covariate and the time-to-event outcome that are correlated. We propose regression calibration (RC) estimators to simultaneously address correlated error in the covariates and the censored event time. Although RC can perform well in many settings with covariate measurement error, it is biased for nonlinear regression models, such as the Cox model. Thus, we additionally propose raking estimators which are consistent estimators of the parameter defined by the population estimating equation. Raking can improve upon RC in certain settings with failure-time data, require no explicit modeling of the error structure, and can be utilized under outcome-dependent sampling designs. We discuss features of the underlying estimation problem that affect the degree of improvement the raking estimator has over the RC approach. Detailed simulation studies are presented to examine the performance of the proposed estimators under varying levels of signal, error, and censoring. The methodology is illustrated on observational EHR data on HIV outcomes from the Vanderbilt Comprehensive Care Clinic.
© 2020 John Wiley & Sons Ltd.

Entities:  

Keywords:  calibration; electronic health records; measurement error; misclassification; raking; survival analysis

Mesh:

Year:  2020        PMID: 33140432      PMCID: PMC7874496          DOI: 10.1002/sim.8793

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  16 in total

1.  Analysis of progression-free survival data using a discrete time survival model that incorporates measurements with and without diagnostic error.

Authors:  Sally Hunsberger; Paul S Albert; Lori Dodd
Journal:  Clin Trials       Date:  2010-11-25       Impact factor: 2.486

2.  Hazard ratio estimation for biomarker-calibrated dietary exposures.

Authors:  Pamela A Shaw; Ross L Prentice
Journal:  Biometrics       Date:  2011-10-17       Impact factor: 2.571

3.  Accounting for data errors discovered from an audit in multiple linear regression.

Authors:  Bryan E Shepherd; Chang Yu
Journal:  Biometrics       Date:  2011-01-31       Impact factor: 2.571

4.  Connections between survey calibration estimators and semiparametric models for incomplete data.

Authors:  Thomas Lumley; Pamela A Shaw; James Y Dai
Journal:  Int Stat Rev       Date:  2011-08       Impact factor: 2.217

5.  Logistic regression when the outcome is measured with uncertainty.

Authors:  L S Magder; J P Hughes
Journal:  Am J Epidemiol       Date:  1997-07-15       Impact factor: 4.897

6.  Regression calibration in failure time regression.

Authors:  C Y Wang; L Hsu; Z D Feng; R L Prentice
Journal:  Biometrics       Date:  1997-03       Impact factor: 2.571

7.  Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.

Authors:  Jessie K Edwards; Stephen R Cole; Melissa A Troester; David B Richardson
Journal:  Am J Epidemiol       Date:  2013-04-04       Impact factor: 4.897

8.  EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS.

Authors:  L E Wang; Pamela A Shaw; Hansie M Mathelier; Stephen E Kimmel; Benjamin French
Journal:  Ann Appl Stat       Date:  2016-03       Impact factor: 2.083

9.  Race and sex differences in antiretroviral therapy use and mortality among HIV-infected persons in care.

Authors:  Diana C Lemly; Bryan E Shepherd; Todd Hulgan; Peter Rebeiro; Samuel Stinnette; Robert B Blackwell; Sally Bebawy; Asghar Kheshti; Timothy R Sterling; Stephen P Raffanti
Journal:  J Infect Dis       Date:  2009-04-01       Impact factor: 5.226

10.  Measuring the quality of observational study data in an international HIV research network.

Authors:  Stephany N Duda; Bryan E Shepherd; Cynthia S Gadd; Daniel R Masys; Catherine C McGowan
Journal:  PLoS One       Date:  2012-04-06       Impact factor: 3.240

View more
  4 in total

1.  Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Authors:  Bryan E Shepherd; Pamela A Shaw
Journal:  Stat Commun Infect Dis       Date:  2020-10-07

2.  An approximate quasi-likelihood approach for error-prone failure time outcomes and exposures.

Authors:  Lillian A Boe; Lesley F Tinker; Pamela A Shaw
Journal:  Stat Med       Date:  2021-06-22       Impact factor: 2.497

3.  Two-Phase Sampling Designs for Data Validation in Settings with Covariate Measurement Error and Continuous Outcome.

Authors:  Gustavo Amorim; Ran Tao; Sarah Lotspeich; Pamela A Shaw; Thomas Lumley; Bryan E Shepherd
Journal:  J R Stat Soc Ser A Stat Soc       Date:  2021-04-15       Impact factor: 2.175

4.  Improved generalized raking estimators to address dependent covariate and failure-time outcome error.

Authors:  Eric J Oh; Bryan E Shepherd; Thomas Lumley; Pamela A Shaw
Journal:  Biom J       Date:  2021-03-11       Impact factor: 1.715

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.