Stanley S Levinson1. 1. Laboratory Service, Department of Veterans Affairs Medical Center, Louisville, Kentucky 40206, USA. levinson@louisville.edu
Abstract
BACKGROUND: A useful biomarker should improve clinical management in an economically reasonable way. This should be determined from well-designed outcome studies that show clinical management can be altered on the basis of the biomarker. It is important not to confuse results from testing prior to outcome with outcome studies. CONTENT: This chapter reviews statistical tests used to evaluate studies performed prior to final outcome studies and criteria that assess whether or not a biomarker should be considered for outcome studies at each step. I review how relative risk and odds ratios are related to receiver operator characteristic (ROC) plot analysis. Other statistical techniques such as reclassification and the Hosmer Lemeshow test that have been suggested for evaluation of diagnostic usefulness are considered. Weaknesses of each technique are discussed. SUMMARY: I consider ROC analysis to be a mainstay against which other statistical tests of diagnostic performance should be compared. The importance of expressing data in terms of predictive values is emphasized. Tests showing weak diagnostic associations with a disease are difficult to evaluate for outcome study application, because there is usually great difference in between-study variance so that the true relationship between the biomarker, its diagnostic ability, and predictive capability are unclear.
BACKGROUND: A useful biomarker should improve clinical management in an economically reasonable way. This should be determined from well-designed outcome studies that show clinical management can be altered on the basis of the biomarker. It is important not to confuse results from testing prior to outcome with outcome studies. CONTENT: This chapter reviews statistical tests used to evaluate studies performed prior to final outcome studies and criteria that assess whether or not a biomarker should be considered for outcome studies at each step. I review how relative risk and odds ratios are related to receiver operator characteristic (ROC) plot analysis. Other statistical techniques such as reclassification and the Hosmer Lemeshow test that have been suggested for evaluation of diagnostic usefulness are considered. Weaknesses of each technique are discussed. SUMMARY: I consider ROC analysis to be a mainstay against which other statistical tests of diagnostic performance should be compared. The importance of expressing data in terms of predictive values is emphasized. Tests showing weak diagnostic associations with a disease are difficult to evaluate for outcome study application, because there is usually great difference in between-study variance so that the true relationship between the biomarker, its diagnostic ability, and predictive capability are unclear.
Authors: Ewout W Steyerberg; Michael J Pencina; Hester F Lingsma; Michael W Kattan; Andrew J Vickers; Ben Van Calster Journal: Eur J Clin Invest Date: 2011-07-05 Impact factor: 4.686