Literature DB >> 26279737

Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates.

Erin LeDell1, Maya Petersen1, Mark van der Laan1.   

Abstract

In binary classification problems, the area under the ROC curve (AUC) is commonly used to evaluate the performance of a prediction model. Often, it is combined with cross-validation in order to assess how the results will generalize to an independent data set. In order to evaluate the quality of an estimate for cross-validated AUC, we obtain an estimate of its variance. For massive data sets, the process of generating a single performance estimate can be computationally expensive. Additionally, when using a complex prediction method, the process of cross-validating a predictive model on even a relatively small data set can still require a large amount of computation time. Thus, in many practical settings, the bootstrap is a computationally intractable approach to variance estimation. As an alternative to the bootstrap, we demonstrate a computationally efficient influence curve based approach to obtaining a variance estimate for cross-validated AUC.

Entities:  

Keywords:  AUC; ROC; binary classification; confidence intervals; cross-validation; influence curve; influence function; machine learning; model selection; variance estimation

Year:  2015        PMID: 26279737      PMCID: PMC4533123          DOI: 10.1214/15-EJS1035

Source DB:  PubMed          Journal:  Electron J Stat        ISSN: 1935-7524            Impact factor:   1.125


  2 in total

1.  ROCR: visualizing classifier performance in R.

Authors:  Tobias Sing; Oliver Sander; Niko Beerenwinkel; Thomas Lengauer
Journal:  Bioinformatics       Date:  2005-08-11       Impact factor: 6.937

2.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

  2 in total
  67 in total

1.  Improved small-sample estimation of nonlinear cross-validated prediction metrics.

Authors:  David Benkeser; Maya Petersen; Mark J van der Laan
Journal:  J Am Stat Assoc       Date:  2019-10-21       Impact factor: 5.033

2.  Acute disseminated encephalomyelitis: prognostic value of early follow-up brain MRI.

Authors:  Diederik L H Koelman; David C Benkeser; Joshua P Klein; Farrah J Mateen
Journal:  J Neurol       Date:  2017-07-10       Impact factor: 4.849

3.  Radiomics and machine learning of multisequence multiparametric prostate MRI: Towards improved non-invasive prostate cancer characterization.

Authors:  Jussi Toivonen; Ileana Montoya Perez; Parisa Movahedi; Harri Merisaari; Marko Pesola; Pekka Taimen; Peter J Boström; Jonne Pohjankukka; Aida Kiviniemi; Tapio Pahikkala; Hannu J Aronen; Ivan Jambor
Journal:  PLoS One       Date:  2019-07-08       Impact factor: 3.240

4.  Super Learner Analysis of Electronic Adherence Data Improves Viral Prediction and May Provide Strategies for Selective HIV RNA Monitoring.

Authors:  Maya L Petersen; Erin LeDell; Joshua Schwab; Varada Sarovar; Robert Gross; Nancy Reynolds; Jessica E Haberer; Kathy Goggin; Carol Golin; Julia Arnsten; Marc I Rosen; Robert H Remien; David Etoori; Ira B Wilson; Jane M Simoni; Judith A Erlen; Mark J van der Laan; Honghu Liu; David R Bangsberg
Journal:  J Acquir Immune Defic Syndr       Date:  2015-05-01       Impact factor: 3.731

5.  Early diagnosis of bloodstream infections in the intensive care unit using machine-learning algorithms.

Authors:  Michael Roimi; Ami Neuberger; Anat Shrot; Mical Paul; Yuval Geffen; Yaron Bar-Lavie
Journal:  Intensive Care Med       Date:  2020-01-07       Impact factor: 17.440

6.  Using electronic health records to identify candidates for human immunodeficiency virus pre-exposure prophylaxis: An application of super learning to risk prediction when the outcome is rare.

Authors:  Susan Gruber; Douglas Krakower; John T Menchaca; Katherine Hsu; Rebecca Hawrusik; Judith C Maro; Noelle M Cocoros; Benjamin A Kruskal; Ira B Wilson; Kenneth H Mayer; Michael Klompas
Journal:  Stat Med       Date:  2020-06-24       Impact factor: 2.373

7.  Constrained binary classification using ensemble learning: an application to cost-efficient targeted PrEP strategies.

Authors:  Wenjing Zheng; Laura Balzer; Mark van der Laan; Maya Petersen
Journal:  Stat Med       Date:  2017-04-06       Impact factor: 2.373

8.  Penalized nonparametric scalar-on-function regression via principal coordinates.

Authors:  Philip T Reiss; David L Miller; Pei-Shien Wu; Wen-Yu Hua
Journal:  J Comput Graph Stat       Date:  2016-08-02       Impact factor: 2.302

9.  Prediction of Occult Invasive Disease in Ductal Carcinoma in Situ Using Deep Learning Features.

Authors:  Bibo Shi; Lars J Grimm; Maciej A Mazurowski; Jay A Baker; Jeffrey R Marks; Lorraine M King; Carlo C Maley; E Shelley Hwang; Joseph Y Lo
Journal:  J Am Coll Radiol       Date:  2018-02-02       Impact factor: 5.532

10.  AUC-Maximizing Ensembles through Metalearning.

Authors:  Erin LeDell; Mark J van der Laan; Maya Petersen
Journal:  Int J Biostat       Date:  2016-05-01       Impact factor: 0.968

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.