| Literature DB >> 19455238 |
Sreelatha Meleth1, Chakrapani Chatla, Venkat R Katkoori, Billie Anderson, James M Hardin, Nirag C Jhala, Al Bartolucci, William E Grizzle, Upender Manne.
Abstract
BACKGROUND: Although a majority of studies in cancer biomarker discovery claim to use proportional hazards regression (PHREG) to the study the ability of a biomarker to predict survival, few studies use the predicted probabilities obtained from the model to test the quality of the model. In this paper, we compared the quality of predictions by a PHREG model to that of a linear discriminant analysis (LDA) in both training and test set settings.Entities:
Keywords: colorectal cancer; linear discriminant analysis; predictive models; proportional hazards regression; survival
Year: 2007 PMID: 19455238 PMCID: PMC2675853
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Predicted Probabilities for the Combinations of Variables in PHREG and LDA.
| I & II + Low + Bcl-2 negative | 0.69 | 0.72 |
| III & IV + Low + Bcl-2 negative | 0.27 | 0.35 |
| I & II + High + Bcl-2 negative | 0.55 | 0.38 |
| I & II + Low + Bcl-2 positive | 0.78 | 0.86 |
| III & IV + High + Bcl-2 negative | 0.11 | 0.07 |
| I & II + High + Bcl-2 positive | 0.67 | 0.53 |
| III & IV + Low + Bcl-2 positive | 0.41 | 0.47 |
| III & IV + High + Bcl-2 positive | 0.23 | 0.08 |
| 76 | 76 | |
| 73 | 73 | |
| 25.5 | 25.5 | |
a Variables were selected by the backward selection procedure in Proc PHREG in SAS.
b Tumor stage was dichotomized as node negative (Stage I + II) and node positive (Stage III + IV).
c Tumor differentiation dichotomized as low grade and high grade.
d Phenotypic expression of Bcl-2 dichotomized as negative and positive as described in the methods section
e Predicted outcome of survival probability beyond 5 years after surgery for the combination of variables.
f Variables were selected using iterative bootstrapping sampling method described in the methods section.
g Two variable selection procedures were tried (iterative bootstrapping and the backward selection) and the tumor stage was excluded from both the models.
Significant Variables Obtained in a Proportional Hazards Regression Model.
| Tumor stage | 3.60 (2.58 – 5.027) | <0.0001 | |||
| Bcl-2 expression | 0.67 (0.493 – 0.92) | <0.0138 | Bcl-2 expression | 0.58 (0.42 – 0.79) | 0.0006 |
| Tumor differentiation | 1.63 (1.11 – 2.38) | 0.0112 | Tumor differentiation | 2.03 (1.39 – 2.95) | 0.0002 |
HR: hazard ratio; CI: confidence interval
a Variables were selected using the stepwise discriminant analysis as described in the methods section.
b Stage dichotomized as node negative (Stage I +II) and node positive (Stage III + IV).
c Phenotypic expression of Bcl-2 dichotomized as negative and positive.
d Tumor differentiation dichotomized as low grade and high grade.
Figure 1.Accuracy of Prediction in Test Set versus Bin Size in Training Set. The X-axis represents proportion of true deaths and true survivors accurately predicted in each bin in the test data set. The y-axis represents the proportion of individuals in each bin in the training data set. A bin is a particular combination of the variables in the model; e.g. All individuals with tumor stage I or II, negative for Bcl-2 expression, and with well differentiated tumors belong to one bin.