Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Literature DB >> 35880997

Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

Abstract

Objectives: Observational data derived from patient electronic health records (EHR) data are increasingly used for human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) research. There are challenges to using these data, in particular with regards to data quality; some are recognized, some unrecognized, and some recognized but ignored. There are great opportunities for the statistical community to improve inference by incorporating validation subsampling into analyses of EHR data.
Methods: Methods to address measurement error, misclassification, and missing data are relevant, as are sampling designs such as two-phase sampling. However, many of the existing statistical methods for measurement error, for example, only address relatively simple settings, whereas the errors seen in these datasets span multiple variables (both predictors and outcomes), are correlated, and even affect who is included in the study.Results/
Conclusion: We will discuss some preliminary methods in this area with a particular focus on time-to-event outcomes and outline areas of future research.

Entities: Chemical

Keywords: HIV; electronic health records; measurement error; misclassification; two-phase sampling

Year: 2020 PMID： 35880997 PMCID： PMC9204761 DOI： 10.1515/scid-2019-0015

Source DB: PubMed Journal: Stat Commun Infect Dis

Keyword Cloud
References

56 in total

1. Measurement error in the timing of events: effect on survival analyses in randomized clinical trials.

Authors: Edward L Korn; Lori E Dodd; Boris Freidlin
Journal: Clin Trials Date: 2010-09-06 Impact factor: 2.486

2. Connections between survey calibration estimators and semiparametric models for incomplete data.

Authors: Thomas Lumley; Pamela A Shaw; James Y Dai
Journal: Int Stat Rev Date: 2011-08 Impact factor: 2.217

3. Optimal two-phase sampling design for comparing accuracies of two binary classification rules.

Authors: Huiping Xu; Siu L Hui; Shaun Grannis
Journal: Stat Med Date: 2013-09-04 Impact factor: 2.373

4. Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.

Authors: Jessie K Edwards; Stephen R Cole; Melissa A Troester; David B Richardson
Journal: Am J Epidemiol Date: 2013-04-04 Impact factor: 4.897

5. Global identifiability of latent class models with applications to diagnostic test accuracy studies: A Gröbner basis approach.

Authors: Rui Duan; Ming Cao; Yang Ning; Mingfu Zhu; Bin Zhang; Aidan McDermott; Haitao Chu; Xiaohua Zhou; Jason H Moore; Joseph G Ibrahim; Daniel O Scharfstein; Yong Chen
Journal: Biometrics Date: 2019-11-06 Impact factor: 2.571

10. Efficient odds ratio estimation under two-phase sampling using error-prone data from a multi-national HIV research cohort.

Authors: Sarah C Lotspeich; Bryan E Shepherd; Gustavo G C Amorim; Pamela A Shaw; Ran Tao
Journal: Biometrics Date: 2021-07-02 Impact factor: 2.571

Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

1. Measurement error in the timing of events: effect on survival analyses in randomized clinical trials.

2. Connections between survey calibration estimators and semiparametric models for incomplete data.

3. Optimal two-phase sampling design for comparing accuracies of two binary classification rules.

4. Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.

5. Global identifiability of latent class models with applications to diagnostic test accuracy studies: A Gröbner basis approach.

6. Using EHRs to integrate research with patient care: promises and challenges.

7. Binary regression with differentially misclassified response and exposure variables.

8. Regression calibration to correct correlated errors in outcome and exposure.

9. Measuring the quality of observational study data in an international HIV research network.

10. Efficient odds ratio estimation under two-phase sampling using error-prone data from a multi-national HIV research cohort.