| Literature DB >> 31422414 |
Liang Song1, Xiao-Yan Wang2, Xiao-Feng He3.
Abstract
BACKGROUND The aim of the study was to identify a multigene prognostic factor in patients with gastric cancer (GC). MATERIAL AND METHODS Random survival forest (RSF) was performed to screen survival-related genes and develop a multigene combination based on the cumulative hazard function of each GC patient in TCGA-STAD and GSE15459. Kaplan-Meier curve and univariate and multivariable Cox proportional hazards regression model were applied to evaluate the prognostic performance of the 5-gene combination. C-index was used to compare the prognostic performance of the 5-gene combination and another 9-gene signature in GC. Gene set enrichment analysis (GSEA) was conducted. RESULTS We obtained 19 survival-related genes through univariate Cox proportional hazards analysis in the training set, 5 of which were identified and were used to develop a 5-gene combination through RSF. Patients in the 5-gene combination low-risk group had better overall survival (OS) than those in the 5-gene combination high-risk group, and the 5-gene combination was demonstrated to be an independent prognostic factor in patients with GC. The 5-gene combination outperformed the 9-gene signature in predicting the OS of GC patients, and it might affect the prognosis of GC patients through E2F signaling, MYC signaling, and G2M checkpoint. CONCLUSIONS We introduce a 5-gene combination that can predict the survival of GC patients and might be an independent prognostic factor in GC.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31422414 PMCID: PMC6713029 DOI: 10.12659/MSM.914815
Source DB: PubMed Journal: Med Sci Monit ISSN: 1234-1010
Figure 1The prognosis role of the 5-gene combination in patients with gastric cancer. (A) Optimal cutoff to classify gastric cancer in to the 5-gene combination low-risk group and high-risk group. (B) The overall survival of gastric patients in the 5-gene combination low-risk group and 5-gene signature high-risk group in the training set. (C) The overall survival of gastric patients in the 5-gene combination low-risk group and 5-gene signature high-risk group in the test set. (D) The overall survival of gastric patients in the 5-gene combination low-risk group and 5-gene signature high-risk group in the validation set.
Figure 2Comparison of the C-index of the 5-gene combination and the 9-gene signature in the training set, test, and validation set.
Genes sets enriched in the 5-gene combination low-risk group.
| Gene set | SIZE | ES | NES | NOM P value | FDR |
|---|---|---|---|---|---|
| E2F signaling | 193 | 0.7650 | 1.9174 | 0.0038 | 0.0408 |
| MYC signaling | 58 | 0.8257 | 1.8950 | <0.0001 | 0.0285 |
| G2M checkpoint | 194 | 0.6679 | 1.8672 | 0.0058 | 0.0282 |
ES – enrichment score; NES – normalized enrichment score; NOM P value – normal P value; FDR q value – false discovery rate
Characteristics of gastric patients in the training set and test.
| No. of samples | Training set | Test set | P value | GSE15459 |
|---|---|---|---|---|
| n=194 | n=194 | n=192 | ||
| Median age in years (range) | 66 (34–86) | 68 (30–90) | P>0.05 | 66.55 (23.4–92.4) |
| Female (%) | 69 (35.57) | 67 (34.54) | P>0.05 | 67 (34.9) |
| Male (%) | 125 (64.43) | 127 (65.46) | P>0.05 | 125 (65.1) |
| Stage I (%) | 24 (12.37) | 27 (13.92) | P>0.05 | 31 (16.15) |
| Stage II (%) | 58 (29.9) | 63 (32.47) | P>0.05 | 29 (15.1) |
| Stage III (%) | 91 (46.91) | 74 (38.14) | P>0.05 | 72 (37.5) |
| stage IV (%) | 19 (9.79) | 19 (9.79) | P>0.05 | 60 (31.25) |
| Grade 1 (%) | 4 (2.06) | 6 (3.09) | P>0.05 | NA |
| Grade 2 (%) | 69 (35.57) | 68 (35.05) | P>0.05 | NA |
| Grade 3 (%) | 118 (60.82) | 114 (58.76) | P>0.05 | NA |
| No. of deaths (%) | 71 (36.6) | 86 (44.33) | P>0.05 | 95 (49.48) |
Genes that were associated with the overall survival of patients with gastric cancer patients.
| Genes | Coefficients | HR | LCI | UCI | p Value | FDR |
|---|---|---|---|---|---|---|
| FLJ16779 | 0.3368 | 1.4005 | 1.2126 | 1.6175 | <0.0001 | 0.0157 |
| FRMD7 | 0.2597 | 1.2966 | 1.1537 | 1.4571 | <0.0001 | 0.0441 |
| CPNE8 | 0.3698 | 1.4474 | 1.2220 | 1.7145 | <0.0001 | 0.0637 |
| APOD | 0.3599 | 1.4332 | 1.2159 | 1.6894 | <0.0001 | 0.0608 |
| PRR20A | 0.2271 | 1.2549 | 1.1338 | 1.3890 | <0.0001 | 0.0398 |
| LOC113230 | −0.3293 | 0.7195 | 0.6170 | 0.8389 | <0.0001 | 0.0906 |
| NRP1 | 0.3764 | 1.4570 | 1.2298 | 1.7262 | <0.0001 | 0.0462 |
| MAGED4B | 0.3286 | 1.3890 | 1.1914 | 1.6193 | <0.0001 | 0.0922 |
| SLC22A16 | 0.3050 | 1.3566 | 1.1763 | 1.5644 | <0.0001 | 0.0940 |
| ZNF804B | 0.2239 | 1.2510 | 1.1322 | 1.3822 | <0.0001 | 0.0371 |
| GABRG1 | 0.2633 | 1.3013 | 1.1667 | 1.4513 | <0.0001 | 0.0077 |
| TBX22 | 0.2275 | 1.2554 | 1.1301 | 1.3946 | <0.0001 | 0.0767 |
| PRTG | 0.3190 | 1.3757 | 1.1851 | 1.5971 | <0.0001 | 0.0948 |
| SLC7A2 | 0.3209 | 1.3783 | 1.1874 | 1.5999 | <0.0001 | 0.0840 |
| CGB5 | 0.3143 | 1.3693 | 1.1959 | 1.5679 | <0.0001 | 0.0184 |
| CGB1 | 0.2726 | 1.3134 | 1.1671 | 1.4781 | <0.0001 | 0.0207 |
| SERPINE1 | 0.3661 | 1.4421 | 1.2303 | 1.6902 | <0.0001 | 0.0213 |
| PCDHB5 | 0.3158 | 1.3714 | 1.1866 | 1.5849 | <0.0001 | 0.0647 |
| GPX3 | 0.3591 | 1.4321 | 1.2136 | 1.6899 | <0.0001 | 0.0719 |
HR – hazards ratio; LCI – lower limit of confidence interval; UCI – upper limit of confidence interval; FDR – false discovery rate.
Univariate and multivariable Cox proportional hazards regression model of the overall survival of gastric cancer patients in the training set.
| Variable | Univariate analysis | Multivariable analysis | ||||||
|---|---|---|---|---|---|---|---|---|
| HR | LCI | UCI | P value | HR | LCI | UCI | P value | |
| 5-gene combination | 1.1136 | 1.0952 | 1.1324 | <0.0001 | 1.1242 | 1.1033 | 1.1456 | <0.0001 |
| Age | 1.0117 | 0.9890 | 1.0350 | 0.3158 | 1.0415 | 1.0147 | 1.0690 | 0.0022 |
| Gender Male | 1.5118 | 0.9007 | 2.5374 | 0.1178 | 0.8170 | 0.4615 | 1.4462 | 0.4878 |
| Gender Female | Reference | Reference | ||||||
| Pathologic stage | 1.1777 | 1.0381 | 1.3362 | 0.0111 | 1.2341 | 1.0807 | 1.4092 | 0.0019 |
| Grade | 1.4874 | 0.9373 | 2.3604 | 0.0919 | 0.9076 | 0.5262 | 1.5654 | 0.7273 |
HR – hazards ratio; LCI – lower limit of confidence interval; UCI – upper limit of confidence interval; FDR – false discovery rate.
Univariate and multivariable Cox proportional hazards regression model of the overall survival of gastric cancer patients in the test set.
| Variable | Univariate analysis | Multivariable analysis | ||||||
|---|---|---|---|---|---|---|---|---|
| HR | LCI | UCI | P value | HR | LCI | UCI | P value | |
| 5-gene combination | 1.0341 | 1.0174 | 1.0511 | 0.0001 | 1.0232 | 1.0044 | 1.0424 | 0.0156 |
| Age | 1.0280 | 1.0068 | 1.0495 | 0.0093 | 1.0489 | 1.0227 | 1.0757 | 0.0002 |
| Gender Male | 0.9718 | 0.6202 | 1.5229 | 0.9007 | 1.0586 | 0.6543 | 1.7127 | 0.8166 |
| Gender Female | Reference | Reference | ||||||
| Pathologic stage | 1.3168 | 1.1666 | 1.4863 | <0.0001 | 1.3224 | 1.1648 | 1.5012 | <0.0001 |
| Grade | 1.4126 | 0.9209 | 2.1669 | 0.1136 | 1.3735 | 0.8688 | 2.1714 | 0.1745 |
HR – hazards ratio; LCI – lower limit of confidence interval; UCI – upper limit of confidence interval; FDR – false discovery rate.
Univariate and multivariable Cox proportional hazards regression model of the overall survival of gastric cancer patients in the validation set.
| Variable | Univariate analysis | Multivariable analysis | ||||||
|---|---|---|---|---|---|---|---|---|
| HR | LCI | UCI | P value | HR | LCI | UCI | P value | |
| 5-gene combination | 1.0618 | 1.0059 | 1.1209 | 0.0299 | 1.0559 | 0.9995 | 1.1155 | 0.0522 |
| Age | 0.9944 | 0.9786 | 1.0105 | 0.4943 | 1.0044 | 0.9882 | 1.0209 | 0.5960 |
| Gender Male | 1.1431 | 0.7414 | 1.7623 | 0.5450 | 0.8669 | 0.5499 | 1.3666 | 0.5385 |
| Gender Female | Reference | Reference | ||||||
| Lauren classification Intestinal | 0.7527 | 0.4956 | 1.1433 | 0.1828 | 0.7522 | 0.4781 | 1.1834 | 0.2180 |
| Lauren classification Mixed | 0.5549 | 0.2357 | 1.3067 | 0.1777 | 0.4796 | 0.2003 | 1.1481 | 0.0990 |
| Lauren classification Diffuse | Reference | Reference | ||||||
| Stage | 1.9697 | 1.5647 | 2.4795 | <0.0001 | 2.0460 | 1.6065 | 2.6057 | <0.0001 |
HR – hazards ratio; LCI – lower limit of confidence interval; UCI – upper limit of confidence interval; FDR – false discovery rate.