Literature DB >> 21281274

Accounting for data errors discovered from an audit in multiple linear regression.

Bryan E Shepherd1, Chang Yu.   

Abstract

A data coordinating team performed onsite audits and discovered discrepancies between the data sent to the coordinating center and that recorded at sites. We present statistical methods for incorporating audit results into analyses. This can be thought of as a measurement error problem, where the distribution of errors is a mixture with a point mass at 0. If the error rate is nonzero, then even if the mean of the discrepancy between the reported and correct values of a predictor is 0, naive estimates of the association between two continuous variables will be biased. We consider scenarios where there are (1) errors in the predictor, (2) errors in the outcome, and (3) possibly correlated errors in the predictor and outcome. We show how to incorporate the error rate and magnitude, estimated from a random subset (the audited records), to compute unbiased estimates of association and proper confidence intervals. We then extend these results to multiple linear regression where multiple covariates may be incorrect in the database and the rate and magnitude of the errors may depend on study site. We study the finite sample properties of our estimators using simulations, discuss some practical considerations, and illustrate our methods with data from 2815 HIV-infected patients in Latin America, of whom 234 had their data audited using a sequential auditing plan.
© 2011, The International Biometric Society.

Entities:  

Mesh:

Year:  2011        PMID: 21281274      PMCID: PMC3092800          DOI: 10.1111/j.1541-0420.2010.01543.x

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  4 in total

Review 1.  Cohort Profile: Caribbean, Central and South America Network for HIV research (CCASAnet) collaboration within the International Epidemiologic Databases to Evaluate AIDS (IeDEA) programme.

Authors:  Catherine C McGowan; Pedro Cahn; Eduardo Gotuzzo; Denis Padgett; Jean W Pape; Marcelo Wolff; Mauro Schechter; Daniel R Masys
Journal:  Int J Epidemiol       Date:  2007-09-10       Impact factor: 7.196

2.  The effects of data entry error: an analysis of partial verification.

Authors:  J P Mullooly
Journal:  Comput Biomed Res       Date:  1990-06

3.  Mortality during the first year of potent antiretroviral therapy in HIV-1-infected patients in 7 sites throughout Latin America and the Caribbean.

Authors:  Suely H Tuboi; Mauro Schechter; Catherine C McGowan; Carina Cesar; Alejandro Krolewiecki; Pedro Cahn; Marcelo Wolff; Jean W Pape; Denis Padgett; Juan Sierra Madero; Eduardo Gotuzzo; Daniel R Masys; Bryan E Shepherd
Journal:  J Acquir Immune Defic Syndr       Date:  2009-08-15       Impact factor: 3.731

4.  CD4 cell count and initiation of antiretroviral therapy: trends in seven UK centres, 1997-2003.

Authors:  W Stöhr; Dt Dunn; K Porter; T Hill; B Gazzard; J Walsh; R Gilson; P Easterbrook; M Fisher; Ma Johnson; Vc Delpech; An Phillips; Ca Sabin
Journal:  HIV Med       Date:  2007-04       Impact factor: 3.180

  4 in total
  11 in total

1.  EVALUATING RISK-PREDICTION MODELS USING DATA FROM ELECTRONIC HEALTH RECORDS.

Authors:  L E Wang; Pamela A Shaw; Hansie M Mathelier; Stephen E Kimmel; Benjamin French
Journal:  Ann Appl Stat       Date:  2016-03       Impact factor: 2.083

2.  Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Authors:  Bryan E Shepherd; Pamela A Shaw
Journal:  Stat Commun Infect Dis       Date:  2020-10-07

3.  An approximate quasi-likelihood approach for error-prone failure time outcomes and exposures.

Authors:  Lillian A Boe; Lesley F Tinker; Pamela A Shaw
Journal:  Stat Med       Date:  2021-06-22       Impact factor: 2.497

4.  Using audit information to adjust parameter estimates for data errors in clinical trials.

Authors:  Bryan E Shepherd; Pamela A Shaw; Lori E Dodd
Journal:  Clin Trials       Date:  2012-07-30       Impact factor: 2.486

5.  Considerations for analysis of time-to-event outcomes measured with error: Bias and correction with SIMEX.

Authors:  Eric J Oh; Bryan E Shepherd; Thomas Lumley; Pamela A Shaw
Journal:  Stat Med       Date:  2017-11-29       Impact factor: 2.373

6.  Raking and regression calibration: Methods to address bias from correlated covariate and time-to-event error.

Authors:  Eric J Oh; Bryan E Shepherd; Thomas Lumley; Pamela A Shaw
Journal:  Stat Med       Date:  2020-11-02       Impact factor: 2.373

7.  Efficient semiparametric inference for two-phase studies with outcome and covariate measurement errors.

Authors:  Ran Tao; Sarah C Lotspeich; Gustavo Amorim; Pamela A Shaw; Bryan E Shepherd
Journal:  Stat Med       Date:  2020-11-03       Impact factor: 2.373

8.  Regression calibration to correct correlated errors in outcome and exposure.

Authors:  Pamela A Shaw; Jiwei He; Bryan E Shepherd
Journal:  Stat Med       Date:  2020-10-21       Impact factor: 2.373

9.  Measuring the quality of observational study data in an international HIV research network.

Authors:  Stephany N Duda; Bryan E Shepherd; Cynthia S Gadd; Daniel R Masys; Catherine C McGowan
Journal:  PLoS One       Date:  2012-04-06       Impact factor: 3.240

10.  Self-audits as alternatives to travel-audits for improving data quality in the Caribbean, Central and South America network for HIV epidemiology.

Authors:  Sarah C Lotspeich; Mark J Giganti; Marcelle Maia; Renalice Vieira; Daisy Maria Machado; Regina Célia Succi; Sayonara Ribeiro; Mario Sergio Pereira; Maria Fernanda Rodriguez; Gaetane Julmiste; Marco Tulio Luque; Yanink Caro-Vega; Fernando Mejia; Bryan E Shepherd; Catherine C McGowan; Stephany N Duda
Journal:  J Clin Transl Sci       Date:  2019-12-26
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.