| Literature DB >> 22824262 |
Hung-Chia Chen1, Ralph L Kodell, Kuang Fu Cheng, James J Chen.
Abstract
BACKGROUND: Cancer survival studies are commonly analyzed using survival-time prediction models for cancer prognosis. A number of different performance metrics are used to ascertain the concordance between the predicted risk score of each patient and the actual survival time, but these metrics can sometimes conflict. Alternatively, patients are sometimes divided into two classes according to a survival-time threshold, and binary classifiers are applied to predict each patient's class. Although this approach has several drawbacks, it does provide natural performance metrics such as positive and negative predictive values to enable unambiguous assessments.Entities:
Mesh:
Year: 2012 PMID: 22824262 PMCID: PMC3410808 DOI: 10.1186/1471-2288-12-102
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Performance metrics of the five prediction models for the breast cancer data: Somers’ correlation (D); estimates of the hazard ratio (HR) with 95% confidence limits (CI), and p-value for Cox Models I and II; and Rfor Cox Model I and Brier score (IBS) for Cox Model II
| A | −0.333 | 0.311 | 2.37 | 1.21 | 4.62 | 0.012 | 4.85 | 1.38 | 17.04 | 0.014 | 0.124 |
| B | −0.310 | 0.114 | 1.71 | 0.87 | 3.39 | 0.123 | 1.99 | 0.75 | 5.27 | 0.167 | 0.148 |
| C | −0.099 | 0.009 | 1.08 | 0.75 | 1.56 | 0.669 | 2.83 | 1.01 | 7.94 | 0.048 | 0.136 |
| D | −0.310 | 0.248 | 2.50 | 1.12 | 5.57 | 0.026 | 2.36 | 0.82 | 6.76 | 0.111 | 0.144 |
| E | −0.111 | 0.058 | 1.24 | 0.84 | 1.83 | 0.280 | 2.83 | 1.01 | 7.94 | 0.048 | 0.136 |
Figure 1ROC curves for patients’ survival with AUC measures evaluated at 4, 5, and 6 years metastasis-free times for the five models.
P-values of randomization test based on 10,000 permutations for the three measures: Somers’ correlation (D), p-value of the hazard ratio, and Rfrom fitting the Cox proportional hazards model using the risk scores as independent variable
| A | 0.0369 | 0.0225 | 0.0261 | 0.0369 | 0.0225 | 0.0261 |
| B | 0.0707 | 0.3622 | 0.3802 | 0.0499 | 0.1747 | 0.1788 |
| C | 0.2684 | 0.6027 | 0.6084 | 0.3146 | 0.6761 | 0.6838 |
| D | 0.0487 | 0.0718 | 0.0661 | 0.0387 | 0.0445 | 0.0383 |
| E | 0.2534 | 0.3170 | 0.3255 | 0.2588 | 0.3078 | 0.3166 |
P-values of randomization test based on 10,000 permutations for the AUC measures evaluated at 4, 5, and 6 years metastasis-free times
| A | 0.1134 | 0.0206 | 0.0508 | 0.1123 | 0.0195 | 0.0508 |
| B | 0.2692 | 0.2762 | 0.1293 | 0.2369 | 0.1898 | 0.0785 |
| C | 0.3786 | 0.2523 | 0.1920 | 0.4024 | 0.3083 | 0.2680 |
| D | 0.2243 | 0.2109 | 0.1020 | 0.1687 | 0.1503 | 0.0635 |
| E | 0.3686 | 0.3443 | 0.2991 | 0.3470 | 0.3007 | 0.3227 |
The 97 total patients were randomly split into a training set and a test set
| A | 0.7630 | 0.5032 | 0.5164 |
| B | 0.5408 | 0.4628 | 0.4778 |
| C | 0.2876 | 0.2844 | 0.2964 |
| D | 0.5810 | 0.4286 | 0.4382 |
| E | 0.3302 | 0.2600 | 0.2702 |
The numbers of patients for the training and test sets were 49 and 48, respectively. The values are the proportion that the estimated p-values were less than or equal to 0.05 from a total of 10,000 computations, based on 5,000 randomly splits.
Effect of training and test set sizes on the power for Model A
| 78:19 | 0.3945 | 0.2623 | 0.2819 |
| 65:32 | 0.6000 | 0.4164 | 0.4312 |
| 49:48* | 0.7630 | 0.5032 | 0.5164 |
| 32:65 | 0.7746 | 0.5226 | 0.5294 |
| 25:72 | 0.7042 | 0.5166 | 0.5232 |
| 19:78 | 0.563 | 0.4058 | 0.412 |
* From Table 4.
The 97 total patients were randomly split into a training set and a test set. The numbers of patients for the training set investigated were 78, 65, 32, 25, and 19. The values are the proportion that the estimated p-values were less than or equal to 0.05 from a total of 10,000 computations, based on 10,000 randomly splits.
The numbers of misclassifications for five binary classifiers using the support vector machine (SVM), random forest (RF) and logistic regression (LR) classification algorithms
| | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| SVM | 4 | high | 28 | 10 | 10 | 6 | 7 | 7 | 7 |
| | | low | 50 | 9 | 1 | 2 | 3 | 2 | 3 |
| | 5 | high | 34 | 12 | 9 | 1 | 3 | 2 | 5 |
| | | low | 44 | 7 | 1 | 2 | 2 | 2 | 2 |
| | 6 | high | 43 | 14 | 5 | 2 | 3 | 2 | 2 |
| | | low | 35 | 5 | 1 | 1 | 2 | 1 | 1 |
| RF | 4 | high | 28 | 10 | 10 | 6 | 5 | 5 | 7 |
| | | low | 50 | 9 | 2 | 1 | 3 | 1 | 3 |
| | 5 | high | 34 | 12 | 10 | 1 | 5 | 5 | 6 |
| | | low | 44 | 7 | 1 | 2 | 2 | 0 | 2 |
| | 6 | high | 43 | 14 | 6 | 1 | 5 | 3 | 6 |
| | low | 35 | 5 | 2 | 1 | 2 | 1 | 2 | |
| LR | 4 | high | 28 | 10 | 7 | 5 | 6 | 6 | 6 |
| | | low | 50 | 9 | 1 | 2 | 3 | 4 | 4 |
| | 5 | high | 34 | 12 | 8 | 3 | 6 | 7 | 8 |
| | | low | 44 | 7 | 0 | 4 | 2 | 3 | 1 |
| | 6 | high | 43 | 14 | 6 | 1 | 3 | 3 | 7 |
| low | 35 | 5 | 0 | 3 | 2 | 2 | 1 | ||
The binary classifiers are developed based on the 4-year, 5-year, and 6-year metastasis-free times to define the high and low risk classes.
Figure 2Plot of metastasis-free survival time (vertical axis) of the 19 test patients versus the rank of the estimated risk score (horizontal axis) from the five risk prediction models. The patients were numbered according to the ranks of their survival times. The patients on the left have high estimated risk scores (low ranks) and on the right have low estimated risk scores. Performance of a risk prediction model can be assessed by analyzing the relationship between survival times and risk scores (see text). For example, the horizontal line represents a cutoff at the 5 year metastasis-free time and the vertical line is the median of the training scores. A ROC curve can be constructed by enumerating all 19 vertical cutoffs and AUC can be computed (Figure 1).
Figure 3Loge-rank p-values of the five models for all possible sizes of low-risk group.