| Literature DB >> 28694878 |
Peizhou Liao1, Hao Wu2, Tianwei Yu3.
Abstract
The receiver operating characteristic (ROC) curve is an important tool for the evaluation and comparison of predictive models when the outcome is binary. If the class membership of the outcomes are known, ROC can be constructed for a model, and the ROC with greater area under the curve (AUC) indicates better performance. However in practice, imperfect reference standards often exist, in which class membership of every data point are not fully determined. This situation is especially prevalent in high-throughput biomedical data because obtaining perfect reference standards for all data points is either too costly or technically impractical. To construct ROC curves for these data, the common practice is to either ignore the uncertainties in references, or remove data points with high uncertainties. Such approaches may cause bias to the ROC curves and generate misleading results in method evaluation. Here we present a framework to incorporate membership uncertainties into the construction of ROC curve, termed the expected ROC or "eROC" curve. We develop an efficient procedure for the estimation of eROC curve. The advantages of using eROC are demonstrated using simulated and real data.Entities:
Keywords: High-throughput Data; Imperfect Reference Standards; ROC curve
Year: 2016 PMID: 28694878 PMCID: PMC5501420 DOI: 10.1007/s12561-016-9159-7
Source DB: PubMed Journal: Stat Biosci ISSN: 1867-1764