Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates.

Literature DB >> 26279737

Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates.

Erin LeDell¹, Maya Petersen¹, Mark van der Laan¹.

Abstract

In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC.

Entities: Chemical Disease Species

Keywords: AUC; ROC; binary classification; confidence intervals; cross-validation; influence curve; influence function; machine learning; model selection; variance estimation

Year: 2015 PMID： 26279737 PMCID： PMC4533123 DOI： 10.1214/15-EJS1035

Source DB: PubMed Journal: Electron J Stat ISSN： 1935-7524 Impact factor: 1.125

2 in total

1. ROCR: visualizing classifier performance in R.

Authors: Tobias Sing; Oliver Sander; Niko Beerenwinkel; Thomas Lengauer
Journal: Bioinformatics Date: 2005-08-11 Impact factor: 6.937

2. Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors: Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal: J Stat Softw Date: 2010 Impact factor: 6.440

2 in total

67 in total

1. Improved small-sample estimation of nonlinear cross-validated prediction metrics.

Authors: David Benkeser; Maya Petersen; Mark J van der Laan
Journal: J Am Stat Assoc Date: 2019-10-21 Impact factor: 5.033

2. Acute disseminated encephalomyelitis: prognostic value of early follow-up brain MRI.

Authors: Diederik L H Koelman; David C Benkeser; Joshua P Klein; Farrah J Mateen
Journal: J Neurol Date: 2017-07-10 Impact factor: 4.849

3. Radiomics and machine learning of multisequence multiparametric prostate MRI: Towards improved non-invasive prostate cancer characterization.

Authors: Jussi Toivonen; Ileana Montoya Perez; Parisa Movahedi; Harri Merisaari; Marko Pesola; Pekka Taimen; Peter J Boström; Jonne Pohjankukka; Aida Kiviniemi; Tapio Pahikkala; Hannu J Aronen; Ivan Jambor
Journal: PLoS One Date: 2019-07-08 Impact factor: 3.240

4. Super Learner Analysis of Electronic Adherence Data Improves Viral Prediction and May Provide Strategies for Selective HIV RNA Monitoring.

Authors: Maya L Petersen; Erin LeDell; Joshua Schwab; Varada Sarovar; Robert Gross; Nancy Reynolds; Jessica E Haberer; Kathy Goggin; Carol Golin; Julia Arnsten; Marc I Rosen; Robert H Remien; David Etoori; Ira B Wilson; Jane M Simoni; Judith A Erlen; Mark J van der Laan; Honghu Liu; David R Bangsberg
Journal: J Acquir Immune Defic Syndr Date: 2015-05-01 Impact factor: 3.731

5. Early diagnosis of bloodstream infections in the intensive care unit using machine-learning algorithms.

Authors: Michael Roimi; Ami Neuberger; Anat Shrot; Mical Paul; Yuval Geffen; Yaron Bar-Lavie
Journal: Intensive Care Med Date: 2020-01-07 Impact factor: 17.440

6. Using electronic health records to identify candidates for human immunodeficiency virus pre-exposure prophylaxis: An application of super learning to risk prediction when the outcome is rare.

Authors: Susan Gruber; Douglas Krakower; John T Menchaca; Katherine Hsu; Rebecca Hawrusik; Judith C Maro; Noelle M Cocoros; Benjamin A Kruskal; Ira B Wilson; Kenneth H Mayer; Michael Klompas
Journal: Stat Med Date: 2020-06-24 Impact factor: 2.373

Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates.

1. ROCR: visualizing classifier performance in R.

2. Regularization Paths for Generalized Linear Models via Coordinate Descent.

1. Improved small-sample estimation of nonlinear cross-validated prediction metrics.

2. Acute disseminated encephalomyelitis: prognostic value of early follow-up brain MRI.

3. Radiomics and machine learning of multisequence multiparametric prostate MRI: Towards improved non-invasive prostate cancer characterization.

4. Super Learner Analysis of Electronic Adherence Data Improves Viral Prediction and May Provide Strategies for Selective HIV RNA Monitoring.

5. Early diagnosis of bloodstream infections in the intensive care unit using machine-learning algorithms.

6. Using electronic health records to identify candidates for human immunodeficiency virus pre-exposure prophylaxis: An application of super learning to risk prediction when the outcome is rare.

7. Constrained binary classification using ensemble learning: an application to cost-efficient targeted PrEP strategies.

8. Penalized nonparametric scalar-on-function regression via principal coordinates.

9. Prediction of Occult Invasive Disease in Ductal Carcinoma in Situ Using Deep Learning Features.

10. AUC-Maximizing Ensembles through Metalearning.