Nancy A Obuchowski1. 1. Department of Quantitative Health Sciences/Wb4, The Cleveland Clinic Foundation, 9500 Euclid Ave, Cleveland, OH 44195, USA. nobuchow@bio.ri.ccf.org
Abstract
RATIONALE AND OBJECTIVES: Investigators often need to assess the accuracies of diagnostic tests when the gold standard is not binary-scale. The objective of this article is to describe nonparametric estimators of diagnostic test accuracy when the gold standard is continuous, ordinal, and nominal scale. MATERIALS AND METHODS: A nonparametric method of estimating and comparing the area under receiver operating characteristic (ROC) curves, proposed by DeLong et al, is extended to situations in which the gold standard is not binary. Two examples illustrate the methods. RESULTS: Measures of diagnostic test accuracy, their variance, and tests for comparing two diagnostic tests' accuracies in paired designs are presented for situations in which the gold standard is continuous, ordinal, and nominal scale. These summary measures of diagnostic test accuracy are analogous in form and interpretation to the area under the ROC curve. CONCLUSION: Dichotomizing the outcomes of a gold standard so that traditional ROC methods can be applied can lead to bias. The methods described here are useful for assessing and comparing summary test accuracy when the gold standard is not binary scale. They have limitations similar to other summary indices.
RATIONALE AND OBJECTIVES: Investigators often need to assess the accuracies of diagnostic tests when the gold standard is not binary-scale. The objective of this article is to describe nonparametric estimators of diagnostic test accuracy when the gold standard is continuous, ordinal, and nominal scale. MATERIALS AND METHODS: A nonparametric method of estimating and comparing the area under receiver operating characteristic (ROC) curves, proposed by DeLong et al, is extended to situations in which the gold standard is not binary. Two examples illustrate the methods. RESULTS: Measures of diagnostic test accuracy, their variance, and tests for comparing two diagnostic tests' accuracies in paired designs are presented for situations in which the gold standard is continuous, ordinal, and nominal scale. These summary measures of diagnostic test accuracy are analogous in form and interpretation to the area under the ROC curve. CONCLUSION: Dichotomizing the outcomes of a gold standard so that traditional ROC methods can be applied can lead to bias. The methods described here are useful for assessing and comparing summary test accuracy when the gold standard is not binary scale. They have limitations similar to other summary indices.
Authors: Ben Van Calster; Yvonne Vergouwe; Caspar W N Looman; Vanya Van Belle; Dirk Timmerman; Ewout W Steyerberg Journal: Eur J Epidemiol Date: 2012-10-07 Impact factor: 8.082
Authors: Si Eun Lee; Kyunghwa Han; Jin Hur; Young Jin Kim; Hye-Jeong Lee; Yoo Jin Hong; Dong Jin Im; Byoung Wook Choi Journal: Eur Radiol Date: 2017-12-08 Impact factor: 5.315
Authors: Samer S El-Kamary; Mona M Mohamed; Maissa El-Raziky; Michelle D Shardell; Olfat G Shaker; Wafaa A ElAkel; Gamal Esmat Journal: Liver Int Date: 2013-03-20 Impact factor: 5.828