Literature DB >> 31617899

An augmented estimation procedure for EHR-based association studies accounting for differential misclassification.

Jiayi Tong1, Jing Huang1, Jessica Chubak2, Xuan Wang3, Jason H Moore1, Rebecca A Hubbard1, Yong Chen1.   

Abstract

OBJECTIVES: The ability to identify novel risk factors for health outcomes is a key strength of electronic health record (EHR)-based research. However, the validity of such studies is limited by error in EHR-derived phenotypes. The objective of this study was to develop a novel procedure for reducing bias in estimated associations between risk factors and phenotypes in EHR data.
MATERIALS AND METHODS: The proposed method combines the strengths of a gold-standard phenotype obtained through manual chart review for a small validation set of patients and an automatically-derived phenotype that is available for all patients but is potentially error-prone (hereafter referred to as the algorithm-derived phenotype). An augmented estimator of associations is obtained by optimally combining these 2 phenotypes. We conducted simulation studies to evaluate the performance of the augmented estimator and conducted an analysis of risk factors for second breast cancer events using data on a cohort from Kaiser Permanente Washington.
RESULTS: The proposed method was shown to reduce bias relative to an estimator using only the algorithm-derived phenotype and reduce variance compared to an estimator using only the validation data. DISCUSSION: Our simulation studies and real data application demonstrate that, compared to the estimator using validation data only, the augmented estimator has lower variance (ie, higher statistical efficiency). Compared to the estimator using error-prone EHR-derived phenotypes, the augmented estimator has smaller bias.
CONCLUSIONS: The proposed estimator can effectively combine an error-prone phenotype with gold-standard data from a limited chart review in order to improve analyses of risk factors using EHR data.
© The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  association study; bias reduction; differential misclassification; electronic health records; error in phenotype

Mesh:

Year:  2020        PMID: 31617899      PMCID: PMC7025368          DOI: 10.1093/jamia/ocz180

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  28 in total

1.  Threshold model for misclassified binary responses with applications to animal breeding.

Authors:  R Rekaya; K A Weigel; D Gianola
Journal:  Biometrics       Date:  2001-12       Impact factor: 2.571

2.  Binomial regression with misclassification.

Authors:  Carlos Daniel Paulino; Paulo Soares; John Neuhaus
Journal:  Biometrics       Date:  2003-09       Impact factor: 2.571

3.  Logistic regression when the outcome is measured with uncertainty.

Authors:  L S Magder; J P Hughes
Journal:  Am J Epidemiol       Date:  1997-07-15       Impact factor: 4.897

Review 4.  Methods of integrating data to uncover genotype-phenotype interactions.

Authors:  Marylyn D Ritchie; Emily R Holzinger; Ruowang Li; Sarah A Pendergrass; Dokyoon Kim
Journal:  Nat Rev Genet       Date:  2015-01-13       Impact factor: 53.242

5.  Improving the power of genetic association tests with imperfect phenotype derived from electronic medical records.

Authors:  Jennifer A Sinnott; Wei Dai; Katherine P Liao; Stanley Y Shaw; Ashwin N Ananthakrishnan; Vivian S Gainer; Elizabeth W Karlson; Susanne Churchill; Peter Szolovits; Shawn Murphy; Isaac Kohane; Robert Plenge; Tianxi Cai
Journal:  Hum Genet       Date:  2014-07-26       Impact factor: 4.132

6.  Accounting for misclassified outcomes in binary regression models using multiple imputation with internal validation data.

Authors:  Jessie K Edwards; Stephen R Cole; Melissa A Troester; David B Richardson
Journal:  Am J Epidemiol       Date:  2013-04-04       Impact factor: 4.897

7.  Weighted estimation for confounded binary outcomes subject to misclassification.

Authors:  Christopher A Gravel; Robert W Platt
Journal:  Stat Med       Date:  2017-10-30       Impact factor: 2.373

Review 8.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

9.  Sensitivity analysis for misclassification in logistic regression via likelihood methods and predictive value weighting.

Authors:  Robert H Lyles; Ji Lin
Journal:  Stat Med       Date:  2010-09-30       Impact factor: 2.373

10.  Comparative safety of cardiovascular medication use and breast cancer outcomes among women with early stage breast cancer.

Authors:  Denise M Boudreau; Onchee Yu; Jessica Chubak; Heidi S Wirtz; Erin J Aiello Bowles; Monica Fujii; Diana S M Buist
Journal:  Breast Cancer Res Treat       Date:  2014-02-21       Impact factor: 4.872

View more
  1 in total

1.  A cost-effective chart review sampling design to account for phenotyping error in electronic health records (EHR) data.

Authors:  Ziyan Yin; Jiayi Tong; Yong Chen; Rebecca A Hubbard; Cheng Yong Tang
Journal:  J Am Med Inform Assoc       Date:  2021-12-28       Impact factor: 7.942

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.