Literature DB >> 34213008

Efficient odds ratio estimation under two-phase sampling using error-prone data from a multi-national HIV research cohort.

Sarah C Lotspeich1, Bryan E Shepherd1, Gustavo G C Amorim1, Pamela A Shaw2, Ran Tao1,3.   

Abstract

Persons living with HIV engage in routine clinical care, generating large amounts of data in observational HIV cohorts. These data are often error-prone, and directly using them in biomedical research could bias estimation and give misleading results. A cost-effective solution is the two-phase design, under which the error-prone variables are observed for all patients during Phase I, and that information is used to select patients for data auditing during Phase II. For example, the Caribbean, Central, and South America network for HIV epidemiology (CCASAnet) selected a random sample from each site for data auditing. Herein, we consider efficient odds ratio estimation with partially audited, error-prone data. We propose a semiparametric approach that uses all information from both phases and accommodates a number of error mechanisms. We allow both the outcome and covariates to be error-prone and these errors to be correlated, and selection of the Phase II sample can depend on Phase I data in an arbitrary manner. We devise a computationally efficient, numerically stable EM algorithm to obtain estimators that are consistent, asymptotically normal, and asymptotically efficient. We demonstrate the advantages of the proposed methods over existing ones through extensive simulations. Finally, we provide applications to the CCASAnet cohort.
© 2021 The International Biometric Society.

Entities:  

Keywords:  case-control sampling; data audits; electronic health records; measurement error; missing data; semiparametric efficiency

Year:  2021        PMID: 34213008      PMCID: PMC8720323          DOI: 10.1111/biom.13512

Source DB:  PubMed          Journal:  Biometrics        ISSN: 0006-341X            Impact factor:   2.571


  20 in total

1.  Connections between survey calibration estimators and semiparametric models for incomplete data.

Authors:  Thomas Lumley; Pamela A Shaw; James Y Dai
Journal:  Int Stat Rev       Date:  2011-08       Impact factor: 2.217

2.  The effects of misclassification on the estimation of relative risk.

Authors:  B A Barron
Journal:  Biometrics       Date:  1977-06       Impact factor: 2.571

3.  Bayesian semiparametric regression in the presence of conditionally heteroscedastic measurement and regression errors.

Authors:  Abhra Sarkar; Bani K Mallick; Raymond J Carroll
Journal:  Biometrics       Date:  2014-06-25       Impact factor: 2.571

4.  Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.

Authors:  Jessie K Edwards; Stephen R Cole; Melissa A Troester; David B Richardson
Journal:  Am J Epidemiol       Date:  2013-04-04       Impact factor: 4.897

5.  Validation data-based adjustments for outcome misclassification in logistic regression: an illustration.

Authors:  Robert H Lyles; Li Tang; Hillary M Superak; Caroline C King; David D Celentano; Yungtai Lo; Jack D Sobel
Journal:  Epidemiology       Date:  2011-07       Impact factor: 4.822

6.  Effects of misclassifications on statistical inferences in epidemiology.

Authors:  D Quade; P A Lachenbruch; F S Whaley; D K McClish; R W Haley
Journal:  Am J Epidemiol       Date:  1980-05       Impact factor: 4.897

7.  Efficient Semiparametric Inference Under Two-Phase Sampling, With Applications to Genetic Association Studies.

Authors:  Ran Tao; Donglin Zeng; Dan-Yu Lin
Journal:  J Am Stat Assoc       Date:  2017-02-28       Impact factor: 5.033

8.  Measuring the quality of observational study data in an international HIV research network.

Authors:  Stephany N Duda; Bryan E Shepherd; Cynthia S Gadd; Daniel R Masys; Catherine C McGowan
Journal:  PLoS One       Date:  2012-04-06       Impact factor: 3.240

9.  Adaptive sampling in two-phase designs: a biomarker study for progression in arthritis.

Authors:  Michael A McIsaac; Richard J Cook
Journal:  Stat Med       Date:  2015-05-07       Impact factor: 2.373

View more
  1 in total

1.  Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Authors:  Bryan E Shepherd; Pamela A Shaw
Journal:  Stat Commun Infect Dis       Date:  2020-10-07
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.