Literature DB >> 29160558

Simpson's paradox in the integrated discrimination improvement.

J Chipman1, D Braun2,3.   

Abstract

The integrated discrimination improvement (IDI) is commonly used to compare two risk prediction models; it summarizes the extent a new model increases risk in events and decreases risk in non-events. The IDI averages risks across events and non-events and is therefore susceptible to Simpson's paradox. In some settings, adding a predictive covariate to a well calibrated model results in an overall negative (positive) IDI. However, if stratified by that same covariate, the strata-specific IDIs are positive (negative). Meanwhile, the calibration (observed to expected ratio and Hosmer-Lemeshow Goodness of Fit Test), area under the receiver operating characteristic curve, and Brier score improve overall and by stratum. We ran extensive simulations to investigate the impact of an imbalanced covariate upon metrics (IDI, area under the receiver operating characteristic curve, Brier score, and R2), provide an analytic explanation for the paradox in the IDI, and use an investigative metric, a Weighted IDI, to better understand the paradox. In simulations, all instances of the paradox occurred under stratum-specific mis-calibration, yet there were mis-calibrated settings in which the paradox did not occur. The paradox is illustrated on Cancer Genomics Network data by calculating predictions based on two versions of BRCAPRO, a Mendelian risk prediction model for breast and ovarian cancer. In both simulations and the Cancer Genomics Network data, overall model calibration did not guarantee stratum-level calibration. We conclude that the IDI should only assess model performance among a clinically relevant subset when stratum-level calibration is strictly met and recommend calculating additional metrics to confirm the direction and conclusions of the IDI.
Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

Entities:  

Keywords:  BRCAPRO; integrated discrimination improvement; reclassification; risk prediction; simpson's paradox

Mesh:

Year:  2016        PMID: 29160558      PMCID: PMC5726308          DOI: 10.1002/sim.6862

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  28 in total

1.  Sensitivity and specificity can change in opposite directions when new predictive markers are added to risk models.

Authors:  Ben Van Calster; Ewout W Steyerberg; Ralph B D'Agostino; Michael J Pencina
Journal:  Med Decis Making       Date:  2013-12-30       Impact factor: 2.583

2.  Use of reclassification for assessment of improved prediction: an empirical evaluation.

Authors:  Ioanna Tzoulaki; George Liberopoulos; John P A Ioannidis
Journal:  Int J Epidemiol       Date:  2011-02-16       Impact factor: 7.196

3.  Net reclassification improvement and integrated discrimination improvement require calibrated models: relevance from a marker and model perspective.

Authors:  Maarten J G Leening; Ewout W Steyerberg; Ben Van Calster; Ralph B D'Agostino; Michael J Pencina
Journal:  Stat Med       Date:  2014-08-30       Impact factor: 2.373

4.  What to expect from net reclassification improvement with three categories.

Authors:  Karol M Pencina; Michael J Pencina; Ralph B D'Agostino
Journal:  Stat Med       Date:  2014-08-31       Impact factor: 2.373

Review 5.  Net reclassification improvement: computation, interpretation, and controversies: a literature review and clinician's guide.

Authors:  Maarten J G Leening; Moniek M Vedder; Jacqueline C M Witteman; Michael J Pencina; Ewout W Steyerberg
Journal:  Ann Intern Med       Date:  2014-01-21       Impact factor: 25.391

6.  Extensions of net reclassification improvement calculations to measure usefulness of new biomarkers.

Authors:  Michael J Pencina; Ralph B D'Agostino; Ewout W Steyerberg
Journal:  Stat Med       Date:  2010-11-05       Impact factor: 2.373

7.  Evaluating the yield of medical tests.

Authors:  F E Harrell; R M Califf; D B Pryor; K L Lee; R A Rosati
Journal:  JAMA       Date:  1982-05-14       Impact factor: 56.272

8.  Decision curve analysis: a novel method for evaluating prediction models.

Authors:  Andrew J Vickers; Elena B Elkin
Journal:  Med Decis Making       Date:  2006 Nov-Dec       Impact factor: 2.583

9.  A unified inference procedure for a class of measures to assess improvement in risk prediction systems with survival data.

Authors:  Hajime Uno; Lu Tian; Tianxi Cai; Isaac S Kohane; L J Wei
Journal:  Stat Med       Date:  2012-10-05       Impact factor: 2.373

10.  Multiple diseases in carrier probability estimation: accounting for surviving all cancers other than breast and ovary in BRCAPRO.

Authors:  Hormuzd A Katki; Amanda Blackford; Sining Chen; Giovanni Parmigiani
Journal:  Stat Med       Date:  2008-09-30       Impact factor: 2.373

View more
  4 in total

1.  First things first: risk model performance metrics should reflect the clinical application.

Authors:  Kathleen F Kerr; Holly Janes
Journal:  Stat Med       Date:  2017-12-10       Impact factor: 2.373

Review 2.  Clinical risk reclassification at 10 years.

Authors:  Nancy R Cook; Olga V Demler; Nina P Paynter
Journal:  Stat Med       Date:  2017-12-10       Impact factor: 2.373

3.  Quantitative Genomic Dissection of Soybean Yield Components.

Authors:  Alencar Xavier; Katy M Rainey
Journal:  G3 (Bethesda)       Date:  2020-02-06       Impact factor: 3.154

4.  Development and Validation of a Nomogram Based on Nutritional Indicators and Tumor Markers for Prognosis Prediction of Pancreatic Ductal Adenocarcinoma.

Authors:  Haoran Li; Fang Zhou; Zhifei Cao; Yuchen Tang; Yujie Huang; Ye Li; Bin Yi; Jian Yang; Peng Du; Dongming Zhu; Jian Zhou
Journal:  Front Oncol       Date:  2021-05-31       Impact factor: 6.244

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.