Literature DB >> 26323286

A comparison of confidence/credible interval methods for the area under the ROC curve for continuous diagnostic tests with small sample size.

Dai Feng1, Giuliana Cortese2, Richard Baumgartner1.   

Abstract

The receiver operating characteristic (ROC) curve is frequently used as a measure of accuracy of continuous markers in diagnostic tests. The area under the ROC curve (AUC) is arguably the most widely used summary index for the ROC curve. Although the small sample size scenario is common in medical tests, a comprehensive study of small sample size properties of various methods for the construction of the confidence/credible interval (CI) for the AUC has been by and large missing in the literature. In this paper, we describe and compare 29 non-parametric and parametric methods for the construction of the CI for the AUC when the number of available observations is small. The methods considered include not only those that have been widely adopted, but also those that have been less frequently mentioned or, to our knowledge, never applied to the AUC context. To compare different methods, we carried out a simulation study with data generated from binormal models with equal and unequal variances and from exponential models with various parameters and with equal and unequal small sample sizes. We found that the larger the true AUC value and the smaller the sample size, the larger the discrepancy among the results of different approaches. When the model is correctly specified, the parametric approaches tend to outperform the non-parametric ones. Moreover, in the non-parametric domain, we found that a method based on the Mann-Whitney statistic is in general superior to the others. We further elucidate potential issues and provide possible solutions to along with general guidance on the CI construction for the AUC when the sample size is small. Finally, we illustrate the utility of different methods through real life examples.

Keywords:  AUC; Bayesian MCMC; Behrens–Fisher problem; Mann–Whitney; Wald statistic; bootstrap; empirical likelihood; higher order asymptotic; jackknife; kernel smoothing; profile likelihood; signed log-likelihood ratio statistic; small sample size

Mesh:

Substances:

Year:  2015        PMID: 26323286     DOI: 10.1177/0962280215602040

Source DB:  PubMed          Journal:  Stat Methods Med Res        ISSN: 0962-2802            Impact factor:   3.021


  4 in total

1.  Confidence intervals of the Mann-Whitney parameter that are compatible with the Wilcoxon-Mann-Whitney test.

Authors:  Michael P Fay; Yaakov Malinovsky
Journal:  Stat Med       Date:  2018-07-08       Impact factor: 2.373

2.  Cerebrospinal Fluid (1,3)-Beta-d-Glucan Testing Is Useful in Diagnosis of Coccidioidal Meningitis.

Authors:  David A Stevens; Yonglong Zhang; Malcolm A Finkelman; Demosthenes Pappagianis; Karl V Clemons; Marife Martinez
Journal:  J Clin Microbiol       Date:  2016-08-24       Impact factor: 5.948

3.  Tournament leave-pair-out cross-validation for receiver operating characteristic analysis.

Authors:  Ileana Montoya Perez; Antti Airola; Peter J Boström; Ivan Jambor; Tapio Pahikkala
Journal:  Stat Methods Med Res       Date:  2018-08-20       Impact factor: 3.021

4.  Association of PET-based stages of amyloid deposition with neuropathological markers of Aβ pathology.

Authors:  Stefan J Teipel; Anna G M Temp; Fedor Levin; Martin Dyrba; Michel J Grothe
Journal:  Ann Clin Transl Neurol       Date:  2020-11-02       Impact factor: 4.511

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.