Literature DB >> 32165781

Monte Carlo studies of bootstrap variability in ROC analysis with data dependency.

Jin Chu Wu1, Alvin F Martin1, Raghu N Kacker1.   

Abstract

ROC analysis involving two large datasets is an important method for analyzing statistics of interest for decision making of a classifier in many disciplines. And data dependency due to multiple use of the same subjects exists ubiquitously in order to generate more samples because of limited resources. Hence, a two-layer data structure is constructed and the nonparametric two-sample two-layer bootstrap is employed to estimate standard errors of statistics of interest derived from two sets of data, such as a weighted sum of two probabilities. In this article, to reduce the bootstrap variance and ensure the accuracy of computation, Monte Carlo studies of bootstrap variability were carried out to determine the appropriate number of bootstrap replications in ROC analysis with data dependency. It is suggested that with a tolerance 0.02 of the coefficient of variation, 2,000 bootstrap replications be appropriate under such circumstances.

Entities:  

Keywords:  Bootstrap replications; Bootstrap variability; Data dependency; Large datasets; ROC analysis; Standard error

Year:  2018        PMID: 32165781      PMCID: PMC7067283          DOI: 10.1080/03610918.2018.1521974

Source DB:  PubMed          Journal:  Commun Stat Simul Comput        ISSN: 0361-0918            Impact factor:   1.118


  4 in total

1.  The Impact of Data Dependence on Speaker Recognition Evaluation.

Authors:  Jin Chu Wu; Alvin F Martin; Craig S Greenberg; Raghu N Kacker
Journal:  IEEE/ACM Trans Audio Speech Lang Process       Date:  2016-09-30

2.  A method of comparing the areas under receiver operating characteristic curves derived from the same cases.

Authors:  J A Hanley; B J McNeil
Journal:  Radiology       Date:  1983-09       Impact factor: 11.105

3.  Measures, Uncertainties, and Significance Test in Operational ROC Analysis.

Authors:  Jin Chu Wu; Alvin F Martin; Raghu N Kacker
Journal:  J Res Natl Inst Stand Technol       Date:  2011-02-01

4.  A novel measure and significance testing in data analysis of cell image segmentation.

Authors:  Jin Chu Wu; Michael Halter; Raghu N Kacker; John T Elliott; Anne L Plant
Journal:  BMC Bioinformatics       Date:  2017-03-14       Impact factor: 3.169

  4 in total
  1 in total

1.  Diagnostic Accuracy with Total Adenosine Deaminase as a Biomarker for Discriminating Pleural Transudates and Exudates in a Population-Based Cohort Study.

Authors:  Bernardo Henrique Ferraz Maranhão; Cyro Teixeira da Silva Junior; Jorge Luiz Barillo; Carmem Lucia Teixeira de Castro; Joeber Bernardo Soares de Souza; Patricia Siqueira Silva; Roberto Stirbulov
Journal:  Dis Markers       Date:  2021-04-10       Impact factor: 3.434

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.