| Literature DB >> 23658834 |
Bi-Qing Li1, Tao Huang, Jian Zhang, Ning Zhang, Guo-Hua Huang, Lei Liu, Yu-Dong Cai.
Abstract
Colorectal cancer can be grouped into Dukes A, B, C, and D stages based on its developments. Generally speaking, more advanced patients have poorer prognosis. To integrate progression stage prediction systems with recurrence prediction systems, we proposed an ensemble prognostic model for colorectal cancer. In this model, each patient was assigned a most possible stage and a most possible recurrence status. If a patient was predicted to be recurrence patient in advanced stage, he would be classified into high risk group. The ensemble model considered both progression stages and recurrence status. High risk patients and low risk patients predicted by the ensemble model had a significant different disease free survival (log-rank test p-value, 0.0016) and disease specific survival (log-rank test p-value, 0.0041). The ensemble model can better distinguish the high risk and low risk patients than the stage prediction model and the recurrence prediction model alone. This method could be applied to the studies of other diseases and it could significantly improve the prediction performance by ensembling heterogeneous information.Entities:
Mesh:
Year: 2013 PMID: 23658834 PMCID: PMC3642113 DOI: 10.1371/journal.pone.0063494
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1IFS curve showing the overall prediction accuracies versus gene numbers for CRC stages prediction model.
The IFS curves were drawn based on the data in File S2. The overall prediction accuracy reached the peak when the number of genes was 25. The 25 genes thus obtained were used to compose the optimal gene set for the CRC stage predictor.
Figure 2IFS curve showing the MCC versus gene numbers for CRC recurrence prediction model.
The IFS curves were drawn based on the data in File S4. The MCC reached the peak when the number of genes was 110. The 110 genes thus obtained were used to compose the optimal gene set for the CRC recurrence predictor.
Survival time comparison of four different criteria for evaluating risk.
| DFS logrank p-value | DSS logrank p-value | |
| Proposal 1 | 0.0016 | 0.0041 |
| Proposal 2 | 0.0021 | 0.0055 |
| Proposal 3 | 0.3313 | 0.2556 |
| Proposal 4 | 0.5525 | 0.5606 |
DFS: disease-free survival. DSS: disease specific survival.
Figure 3Survival curve for the first proposal.
(A) Disease free survival (log-rank test, p-value = 0.0016). (B) Disease specific survival (log-rank test, p-value = 0.0041).
Figure 4Protein-protein interaction network between stage related genes and recurrence related genes.
Yellow round rectangles represent stage related genes while red elipses represent recurrence related genes. 20 of the 25 stage related genes and 61 of the 110 recurrence related genes were presented.