| Literature DB >> 33928165 |
Danyang Tong1, Yu Tian1, Qiancheng Ye1, Jun Li2, Kefeng Ding2, Jingsong Li1,3.
Abstract
BACKGROUND: Colon cancer has high morbidity and mortality rates among cancers. Existing clinical staging systems cannot accurately assess the prognostic risk of colon cancer patients. This study was aimed at improving the prognostic performance of the colon cancer clinical staging system through knowledge-based clinical-molecular integrated analysis.Entities:
Mesh:
Year: 2021 PMID: 33928165 PMCID: PMC8051523 DOI: 10.1155/2021/9987819
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1The overall pipeline of the study. Step 1: calculation of PDS; step 2: identification of molecular prognostic factors; step 3: construction of clinical-molecular integrative prognostic model; step 4: assessment of the integrative prognostic model.
Detailed information of the data used for analysis.
| Characteristic | Discovery set | Validation set | Reference set |
|---|---|---|---|
| TCGA-COAD | CPTAC | Normal samples | |
| Patients, | 374 | 98 | 41 |
| Survival status, | |||
| Alive | 293 (78.3) | 90 (91.8) | 29 (70.7) |
| Dead | 81 (21.7) | 8 (8.2) | 12 (29.3) |
| Agea in years, mean (SD, range) | 66.75 (12.73, 31-90) | 65.43 (11.56, 35-93) | 70.34 (13.23, 40-90) |
| Gender, | |||
| Male | 199 (53.2) | 41 (41.8) | 20 (48.8) |
| Female | 175 (46.8) | 57 (58.2) | 21 (51.2) |
| Overall survival time in months, mean (median, range) | 30.24 (24.27, 0.47-150.07) | 27.96 (30, 1-44) | 27.66 (24.37, 0-101.40) |
| T stage, | Not available | ||
| T1 | 9 (2.4) | 0 (0) | |
| T2 | 65 (17.4) | 12 (12.2) | |
| T3 | 258 (69.0) | 73 (74.5) | |
| T4 | 42 (11.2) | 13 (13.3) | |
| N stage, | Not available | ||
| N0 | 226 (60.4) | 52 (53.1) | |
| N1 | 84 (22.5) | 31 (31.6) | |
| N2 | 64 (17.1) | 15 (15.3) | |
| M stage, n (%) | Not available | ||
| M0 | 315 (84.2) | 91 (92.9) | |
| M1 | 59 (15.8) | 7 (7.1) | |
| Number of genes, | 10877 | 10877 | 10877 |
aThe characteristic “Age” refers to the age at initial diagnosis in the discovery set and reference set but refers to the age at procurement in the validation set. SD: standard deviation.
Description of the pathways identified by the random survival forest.
| KEGG pathway ID | Pathway name | Number of genes involved in the pathway in this study |
|---|---|---|
| hsa00450 | Selenocompound metabolism—Homo sapiens (human) | 13 |
| hsa00532 | Glycosaminoglycan biosynthesis—chondroitin sulfate/dermatan sulfate—Homo sapiens (human) | 16 |
| hsa02010 | ABC transporters—Homo sapiens (human) | 24 |
| hsa04380 | Osteoclast differentiation—Homo sapiens (human) | 105 |
| hsa04614 | Renin-angiotensin system—Homo sapiens (human) | 14 |
| hsa04750 | Inflammatory mediator regulation of TRP channels—Homo sapiens (human) | 65 |
| hsa04911 | Insulin secretion—Homo sapiens (human) | 41 |
| hsa04971 | Gastric acid secretion—Homo sapiens (human) | 40 |
| hsa04975 | Fat digestion and absorption—Homo sapiens (human) | 13 |
| hsa05032 | Morphine addiction—Homo sapiens (human) | 38 |
| hsa05133 | Pertussis—Homo sapiens (human) | 56 |
| hsa05152 | Tuberculosis—Homo sapiens (human) | 128 |
| hsa05167 | Kaposi sarcoma-associated herpesvirus infection—Homo sapiens (human) | 148 |
| hsa05321 | Inflammatory bowel disease (IBD)—Homo sapiens (human) | 34 |
Bias-corrected C-indexes of 27 different clinical-molecular integrated models.
| Covariates used in the model | Bias-corrected Harrell's C-index (±95% CI) |
|---|---|
| T, N, M, hsa00532, hsa04911, hsa05133, hsa05152 | 0.775 ± 0.0038 |
| T, N, M, hsa00532 | 0.773 ± 0.0038 |
| T, N, M, hsa00532, hsa04911, hsa05133 | 0.773 ± 0.0038 |
| T, N, M, hsa02010, hsa05152, hsa05321 | 0.773 ± 0.0038 |
| T, N, M, hsa02010, hsa05167, hsa05321 | 0.772 ± 0.0037 |
| T, N, M, hsa00532, hsa04380, hsa04911, hsa05133 | 0.772 ± 0.0039 |
| T, N, M, hsa00532, hsa05133, hsa05152 | 0.772 ± 0.0040 |
| T, N, M, hsa00532, hsa04380, hsa04971, hsa05133 | 0.771 ± 0.0039 |
| T, N, M, hsa00532, hsa05133, hsa05167 | 0.771 ± 0.0040 |
| T, N, M, hsa00532, hsa04975, hsa05133 | 0.770 ± 0.0041 |
| T, N, M, hsa02010, hsa04911, hsa05133, hsa05152 | 0.768 ± 0.0040 |
| T, N, M, hsa02010, hsa05133, hsa05152 | 0.768 ± 0.0040 |
| T, N, M, hsa00532, hsa04971, hsa05133 | 0.767 ± 0.0038 |
| T, N, M, hsa00532, hsa04380, hsa05133 | 0.766 ± 0.0041 |
| T, N, M, hsa02010, hsa04911, hsa05133 | 0.764 ± 0.0038 |
| T, N, M, hsa02010, hsa05133, hsa05167 | 0.764 ± 0.0040 |
| T, N, M, hsa00532, hsa04750, hsa05133 | 0.764 ± 0.0041 |
| T, N, M, hsa02010, hsa04911 | 0.763 ± 0.0037 |
| T, N, M, hsa02010, hsa04380, hsa04911, hsa05133 | 0.763 ± 0.0040 |
| T, N, M, hsa05133, hsa05152 | 0.761 ± 0.0042 |
| T, N, M, hsa02010, hsa04750, hsa05133 | 0.759 ± 0.0040 |
| T, N, M, hsa00450, hsa04911 | 0.758 ± 0.0036 |
| T, N, M, hsa04380, hsa05133 | 0.758 ± 0.0043 |
| T, N, M, hsa04911, hsa05133 | 0.756 ± 0.0039 |
| T, N, M, hsa00450, hsa04911, hsa05133, hsa05152 | 0.756 ± 0.0040 |
| T, N, M, hsa04380, hsa04911, hsa05133 | 0.754 ± 0.0041 |
| T, N, M, hsa05133, hsa05167 | 0.754 ± 0.0042 |
| T, N, M | 0.746 ± 0.0040 |
CI: confidence interval.
Figure 2Observation of the PDS of the pathway hsa00532 in the discovery set. (a) Density distribution of the PDS of the pathway hsa00532 in the discovery set. (b) KM curve plotted based on two groups of patients in the discovery set divided by the PDS with a threshold of 0.6779. (c) KM curve plotted based on three groups of patients in the discovery set divided by the PDS with thresholds of 0.5 and 0.6779. (d) KM curve plotted based on two groups of stage II patients in the discovery set divided by the PDS with a threshold of 0.6779.
Regression coefficients of the knowledge-based clinical-molecular integrated prognostic model.
| Covariate | Coefficient ± SE | HR | 95% CI |
|
|---|---|---|---|---|
| T stage | ||||
| T2 | −1.62 ± 1.42 | 0.20 | 0.012-3.18 | .25 |
| T3 | 0.38 ± 1.02 | 1.47 | 0.20-10.79 | .71 |
| T4 | 1.22 ± 1.05 | 3.38 | 0.43-26.37 | .25 |
| N stage | ||||
| N1 | −0.01 ± 0.31 | 0.99 | 0.54-1.80 | .96 |
| N2 | 0.67 ± 0.30 | 1.95 | 1.07-3.54 | .03 |
| M stage | ||||
| M1 | 1.02 ± 0.28 | 2.78 | 1.60-4.82 | <.001 |
| hsa00532∗ | 2.86 ± 1.42 | 17.53 | 1.08-283.24 | .04 |
SE: standard error; HR: hazard ratio; CI: confidence interval. ∗Covariate hsa00532 used in the model is the PDS of pathway has00532.
Figure 3Nomogram for predicting the 3-year overall survival of colon cancer patients. To use the nomogram, first, the position of each variable of an individual patient on the corresponding axis should be found. Next, a line to the point axis for the number of points should be drawn upwards to determine the number of points of each variable. Then, the points from all the variables should be added. Finally, a line from the total point axis should be drawn downward to determine the likelihood of 3-year survival probabilities at the lower line of the nomogram.
Figure 4C-index of our pathway-based integrated model and other models for patients in the discovery set (a) and validation set (b).
Figure 5Calibration plot of our pathway-based integrated model (a) and clinical model (b) at the 3-year time point.
Figure 6C-index of our pathway-based integrated model and other models for nonmetastatic patients in the discovery set (a) and the validation set (b).