Literature DB >> 33843085

Evaluation of predictive model performance of an existing model in the presence of missing data.

Pin Li1, Jeremy M G Taylor1,2, Daniel E Spratt2, R Jeffery Karnes3, Matthew J Schipper1,2.   

Abstract

In medical research, the Brier score (BS) and the area under the receiver operating characteristic (ROC) curves (AUC) are two common metrics used to evaluate prediction models of a binary outcome, such as using biomarkers to predict the risk of developing a disease in the future. The assessment of an existing prediction models using data with missing covariate values is challenging. In this article, we propose inverse probability weighted (IPW) and augmented inverse probability weighted (AIPW) estimates of AUC and BS to handle the missing data. An alternative approach uses multiple imputation (MI), which requires a model for the distribution of the missing variable. We evaluated the performance of IPW and AIPW in comparison with MI in simulation studies under missing completely at random, missing at random, and missing not at random scenarios. When there are missing observations in the data, MI and IPW can be used to obtain unbiased estimates of BS and AUC if the imputation model for the missing variable or the model for the missingness is correctly specified. MI is more efficient than IPW. Our simulation results suggest that AIPW can be more efficient than IPW, and also achieves double robustness from miss-specification of either the missingness model or the imputation model. The outcome variable should be included in the model for the missing variable under all scenarios, while it only needs to be included in missingness model if the missingness depends on the outcome. We illustrate these methods using an example from prostate cancer.
© 2021 John Wiley & Sons Ltd.

Entities:  

Keywords:  Brier score; area under the ROC curve; augmented inverse probability weighting; inverse probability weighting; multiple imputation

Mesh:

Year:  2021        PMID: 33843085      PMCID: PMC8985431          DOI: 10.1002/sim.8978

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.497


  16 in total

1.  Combining diagnostic test results to increase accuracy.

Authors:  M S Pepe; M L Thompson
Journal:  Biostatistics       Date:  2000-06       Impact factor: 5.899

2.  Missing covariate data in medical research: to impute is better than to ignore.

Authors:  Kristel J M Janssen; A Rogier T Donders; Frank E Harrell; Yvonne Vergouwe; Qingxia Chen; Diederick E Grobbee; Karel G M Moons
Journal:  J Clin Epidemiol       Date:  2010-03-24       Impact factor: 6.437

3.  Using the outcome for imputation of missing predictor values was preferred.

Authors:  Karel G M Moons; Rogier A R T Donders; Theo Stijnen; Frank E Harrell
Journal:  J Clin Epidemiol       Date:  2006-06-19       Impact factor: 6.437

Review 4.  Review of inverse probability weighting for dealing with missing data.

Authors:  Shaun R Seaman; Ian R White
Journal:  Stat Methods Med Res       Date:  2011-01-10       Impact factor: 3.021

5.  Robust estimation of area under ROC curve using auxiliary variables in the presence of missing biomarker values.

Authors:  Qi Long; Xiaoxi Zhang; Brent A Johnson
Journal:  Biometrics       Date:  2010-09-03       Impact factor: 2.571

6.  The University of California, San Francisco Cancer of the Prostate Risk Assessment score: a straightforward and reliable preoperative predictor of disease recurrence after radical prostatectomy.

Authors:  Matthew R Cooperberg; David J Pasta; Eric P Elkin; Mark S Litwin; David M Latini; Janeen Du Chane; Peter R Carroll
Journal:  J Urol       Date:  2005-06       Impact factor: 7.450

7.  Multiple imputation using chained equations: Issues and guidance for practice.

Authors:  Ian R White; Patrick Royston; Angela M Wood
Journal:  Stat Med       Date:  2010-11-30       Impact factor: 2.373

8.  Assessing the performance of prediction models: a framework for traditional and novel measures.

Authors:  Ewout W Steyerberg; Andrew J Vickers; Nancy R Cook; Thomas Gerds; Mithat Gonen; Nancy Obuchowski; Michael J Pencina; Michael W Kattan
Journal:  Epidemiology       Date:  2010-01       Impact factor: 4.822

9.  Dealing with missing predictor values when applying clinical prediction models.

Authors:  Kristel J M Janssen; Yvonne Vergouwe; A Rogier T Donders; Frank E Harrell; Qingxia Chen; Diederick E Grobbee; Karel G M Moons
Journal:  Clin Chem       Date:  2009-03-12       Impact factor: 8.327

Review 10.  The CAPRA Score at 10 Years: Contemporary Perspectives and Analysis of Supporting Studies.

Authors:  Jonathan S Brajtbord; Michael S Leapman; Matthew R Cooperberg
Journal:  Eur Urol       Date:  2016-09-08       Impact factor: 20.096

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.