Literature DB >> 34718618

A cost-effective chart review sampling design to account for phenotyping error in electronic health records (EHR) data.

Ziyan Yin1, Jiayi Tong2, Yong Chen2, Rebecca A Hubbard2, Cheng Yong Tang1.   

Abstract

OBJECTIVES: Electronic health records (EHR) are commonly used for the identification of novel risk factors for disease, often referred to as an association study. A major challenge to EHR-based association studies is phenotyping error in EHR-derived outcomes. A manual chart review of phenotypes is necessary for unbiased evaluation of risk factor associations. However, this process is time-consuming and expensive. The objective of this paper is to develop an outcome-dependent sampling approach for designing manual chart review, where EHR-derived phenotypes can be used to guide the selection of charts to be reviewed in order to maximize statistical efficiency in the subsequent estimation of risk factor associations.
MATERIALS AND METHODS: After applying outcome-dependent sampling, an augmented estimator can be constructed by optimally combining the chart-reviewed phenotypes from the selected patients with the error-prone EHR-derived phenotype. We conducted simulation studies to evaluate the proposed method and applied our method to data on colon cancer recurrence in a cohort of patients treated for a primary colon cancer in the Kaiser Permanente Washington (KPW) healthcare system.
RESULTS: Simulations verify the coverage probability of the proposed method and show that, when disease prevalence is less than 30%, the proposed method has smaller variance than an existing method where the validation set for chart review is uniformly sampled. In addition, from design perspective, the proposed method is able to achieve the same statistical power with 50% fewer charts to be validated than the uniform sampling method, thus, leading to a substantial efficiency gain in chart review. These findings were also confirmed by the application of the competing methods to the KPW colon cancer data. DISCUSSION: Our simulation studies and analysis of data from KPW demonstrate that, compared to an existing uniform sampling method, the proposed outcome-dependent method can lead to a more efficient chart review sampling design and unbiased association estimates with higher statistical efficiency.
CONCLUSION: The proposed method not only optimally combines phenotypes from chart review with EHR-derived phenotypes but also suggests an efficient design for conducting chart review, with the goal of improving the efficiency of estimated risk factor associations using EHR data.
© The Author(s) 2021. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  association study; augmented estimation; cost-effective chart review; outcome-dependent sampling

Mesh:

Year:  2021        PMID: 34718618      PMCID: PMC8714286          DOI: 10.1093/jamia/ocab222

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   7.942


  17 in total

1.  The design and analysis of case-control studies with biased sampling.

Authors:  C R Weinberg; S Wacholder
Journal:  Biometrics       Date:  1990-12       Impact factor: 2.571

2.  Robust replication of genotype-phenotype associations across multiple diseases in an electronic medical record.

Authors:  Marylyn D Ritchie; Joshua C Denny; Dana C Crawford; Andrea H Ramirez; Justin B Weiner; Jill M Pulley; Melissa A Basford; Kristin Brown-Gentry; Jeffrey R Balser; Daniel R Masys; Jonathan L Haines; Dan M Roden
Journal:  Am J Hum Genet       Date:  2010-04-01       Impact factor: 11.025

3.  Identification of genomic predictors of atrioventricular conduction: using electronic medical records as a tool for genome science.

Authors:  Joshua C Denny; Marylyn D Ritchie; Dana C Crawford; Jonathan S Schildcrout; Andrea H Ramirez; Jill M Pulley; Melissa A Basford; Daniel R Masys; Jonathan L Haines; Dan M Roden
Journal:  Circulation       Date:  2010-11-01       Impact factor: 29.690

4.  An Empirical Study for Impacts of Measurement Errors on EHR based Association Studies.

Authors:  Rui Duan; Ming Cao; Yonghui Wu; Jing Huang; Joshua C Denny; Hua Xu; Yong Chen
Journal:  AMIA Annu Symp Proc       Date:  2017-02-10

5.  Semi-supervised validation of multiple surrogate outcomes with application to electronic medical records phenotyping.

Authors:  Chuan Hong; Katherine P Liao; Tianxi Cai
Journal:  Biometrics       Date:  2019-03-08       Impact factor: 2.571

6.  Detecting Lung and Colorectal Cancer Recurrence Using Structured Clinical/Administrative Data to Enable Outcomes Research and Population Health Management.

Authors:  Michael J Hassett; Hajime Uno; Angel M Cronin; Nikki M Carroll; Mark C Hornbrook; Debra Ritzwoller
Journal:  Med Care       Date:  2017-12       Impact factor: 2.983

7.  Validation data-based adjustments for outcome misclassification in logistic regression: an illustration.

Authors:  Robert H Lyles; Li Tang; Hillary M Superak; Caroline C King; David D Celentano; Yungtai Lo; Jack D Sobel
Journal:  Epidemiology       Date:  2011-07       Impact factor: 4.822

8.  Prediction of Adverse Events in Patients Undergoing Major Cardiovascular Procedures.

Authors:  Bobak J Mortazavi; Nihar Desai; Jing Zhang; Andreas Coppi; Fred Warner; Harlan M Krumholz; Sahand Negahban
Journal:  IEEE J Biomed Health Inform       Date:  2017-03-01       Impact factor: 5.772

9.  Comparative Effectiveness Research Using Observational Data: Active Comparators to Emulate Target Trials with Inactive Comparators.

Authors:  Anders Huitfeldt; Miguel A Hernan; Mette Kalager; James M Robins
Journal:  EGEMS (Wash DC)       Date:  2016-10-14

10.  Why most published research findings are false.

Authors:  John P A Ioannidis
Journal:  PLoS Med       Date:  2005-08-30       Impact factor: 11.613

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.