Literature DB >> 32914457

An efficient variance estimator of AUC and its applications to binary classification.

Qing Wang1, Alexandria Guo1.   

Abstract

The area under the ROC (receiver operating characteristic) curve, AUC, is one of the most commonly used measures to evaluate the performance of a binary classifier. Due to sampling variation, the model with the largest observed AUC score is not necessarily optimal, so it is crucial to assess the variation of AUC estimate. We extend the proposal by Wang and Lindsay and devise an unbiased variance estimator of AUC estimate that is of a two-sample U-statistic form. The proposal can be easily generalized to estimate the variance of a K-sample U-statistic (K ≥ 2). To make our developed variance estimator more applicable, we employ a partition-resampling scheme that is computationally efficient. Simulation studies suggest that the developed AUC variance estimator yields much better or comparable performance to jackknife and bootstrap variance estimators, and computational times that are about 10 to 30 times faster than the times of its counterparts. In practice, the proposal can be used in the one-standard-error rule for model selection, or to construct an asymptotic confidence interval of AUC in binary classification. In addition to conducting simulation studies, we illustrate its practical applications using two real datasets in medical sciences.
© 2020 John Wiley & Sons, Ltd.

Keywords:  AUC; ROC; U-statistic; binary classification; one-standard-error rule; variance estimation

Mesh:

Year:  2020        PMID: 32914457     DOI: 10.1002/sim.8725

Source DB:  PubMed          Journal:  Stat Med        ISSN: 0277-6715            Impact factor:   2.373


  2 in total

1.  Detection of changes in literary writing style using N-grams as style markers and supervised machine learning.

Authors:  Germán Ríos-Toledo; Juan Pablo Francisco Posadas-Durán; Grigori Sidorov; Noé Alejandro Castro-Sánchez
Journal:  PLoS One       Date:  2022-07-20       Impact factor: 3.752

2.  MicrobioSee: A Web-Based Visualization Toolkit for Multi-Omics of Microbiology.

Authors:  JinHui Li; Yimeng Sang; Sen Zeng; Shuming Mo; Zufan Zhang; Sheng He; Xinying Li; Guijiao Su; Jianping Liao; Chengjian Jiang
Journal:  Front Genet       Date:  2022-04-08       Impact factor: 4.772

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.