| Literature DB >> 16542249 |
Margaret Sullivan Pepe1, Tianxi Cai, Gary Longton.
Abstract
No single biomarker for cancer is considered adequately sensitive and specific for cancer screening. It is expected that the results of multiple markers will need to be combined in order to yield adequately accurate classification. Typically, the objective function that is optimized for combining markers is the likelihood function. In this article, we consider an alternative objective function-the area under the empirical receiver operating characteristic curve (AUC). We note that it yields consistent estimates of parameters in a generalized linear model for the risk score but does not require specifying the link function. Like logistic regression, it yields consistent estimation with case-control or cohort data. Simulation studies suggest that AUC-based classification scores have performance comparable with logistic likelihood-based scores when the logistic regression model holds. Analysis of data from a proteomics biomarker study shows that performance can be far superior to logistic regression derived scores when the logistic regression model does not hold. Model fitting by maximizing the AUC rather than the likelihood should be considered when the goal is to derive a marker combination score for classification or prediction.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16542249 DOI: 10.1111/j.1541-0420.2005.00420.x
Source DB: PubMed Journal: Biometrics ISSN: 0006-341X Impact factor: 2.571