| Literature DB >> 33976744 |
Guoguang Wang1, Tian Zhan1, Fan Li1, Jian Shen1, Xiang Gao1, Lei Xu1, Yuan Li2, Jianping Zhang1.
Abstract
Gastric cancer represents a major public health problem. Owing to the great heterogeneity of GC, conventional clinical characteristics are limited in the accurate prediction of individual outcomes and survival. This study aimed to establish a robust gene signature to predict the prognosis of GC based on multiple datasets. Initially, we downloaded raw data from four independent datasets of The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO), and performed univariate Cox proportional hazards regression analysis to identify prognostic genes associated with overall survival (OS) from each dataset. Thirteen common genes from four datasets were screened as candidate prognostic signatures. Then, a risk score model was developed based on this 13‑gene signature and validated by four independent datasets and the entire cohort. Patients with a high-risk score had poorer OS and recurrence-free survival (RFS). Multivariate regression and stratified analysis revealed that the 13-gene signature was not only an independent predictive factor but also associated with recurrence when adjusting for other clinical factors. Furthermore, in the high-risk group, gene set enrichment analysis (GSEA) showed that the mTOR signaling pathway and MAPK signaling pathway were significantly enriched. The present study provided a robust and reliable gene signature for prognostic prediction of both OS and RFS of patients with GC, which may be useful for delivering individualized management of patients. © The author(s).Entities:
Keywords: Kyoto Encyclopedia of Genes and Genomes pathway; drug resistance; gastric cancer; prognostic signature; recurrence
Year: 2021 PMID: 33976744 PMCID: PMC8100809 DOI: 10.7150/jca.49658
Source DB: PubMed Journal: J Cancer ISSN: 1837-9664 Impact factor: 4.207
Figure 1Study flow diagram.
General information of the 13 genes for constructing the prognostic signature
| Gene stable ID | Gene name | Gene type | Chromosome | Gene start (bp) | Gene end (bp) |
|---|---|---|---|---|---|
| ENSG00000213638 | ADAT3 | Protein coding | 19 | 1905399 | 1913447 |
| ENSG00000157111 | TMEM171 | Protein coding | 5 | 73120569 | 73131809 |
| ENSG00000057019 | DCBLD2 | Protein coding | 3 | 98795941 | 98901695 |
| ENSG00000277443 | MARCKS | Protein coding | 6 | 113857345 | 113863475 |
| ENSG00000115295 | CLIP4 | Protein coding | 2 | 29097705 | 29189643 |
| ENSG00000119326 | CTNNAL1 | Protein coding | 9 | 108942569 | 109013522 |
| ENSG00000150867 | PIP4K2A | Protein coding | 10 | 22534854 | 22714578 |
| ENSG00000205189 | ZBTB10 | Protein coding | 8 | 80485619 | 80526265 |
| ENSG00000099250 | NRP1 | Protein coding | 10 | 33177492 | 33336262 |
| ENSG00000175315 | CST6 | Protein coding | 11 | 66012008 | 66013505 |
| ENSG00000100979 | PLTP | Protein coding | 20 | 45898621 | 45912155 |
| ENSG00000156535 | CD109 | Protein coding | 6 | 73695785 | 73828316 |
| ENSG00000153814 | JAZF1 | Protein coding | 7 | 27830573 | 28180795 |
Univariate regression analysis of the 13 genes and overall survival of GC patients in 4 datasets
| Genes | GSE15459 | GSE62254 | GSE57303 | TCGA | ||||
|---|---|---|---|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | HR (95% CI) | HR (95% CI) | |||||
| ADAT3 | 0.46 (0.24-0.86) | 1.52E-02 | 0.17 (0.07-0.40) | 4.09E-05 | 0.04 (0.00-0.39) | 5.49E-03 | 0.95 (0.92-0.98) | 1.77E-03 |
| CD109 | 1.39 (1.11-1.75) | 4.73E-03 | 0.64 (0.52-0.80) | 5.85E-05 | 1.59 (1.01-2.50) | 4.68E-02 | 1.01 (1.01-1.02) | 3.28E-05 |
| CLIP4 | 1.34 (1.06-1.69) | 1.49E-02 | 1.37 (1.06-1.77) | 1.78E-02 | 2.00 (1.19-3.35) | 8.93E-03 | 1.04 (1.01-1.07) | 7.57E-03 |
| CST6 | 1.19 (1.05-1.34) | 4.96E-03 | 1.75 (1.18-2.60) | 5.72E-03 | 1.34 (1.02-1.76) | 3.76E-02 | 1.00 (1.00-1.00) | 3.78E-04 |
| CTNNAL1 | 1.35 (1.03-1.77) | 2.90E-02 | 1.79 (1.36-2.37) | 4.09E-05 | 2.06 (1.11-3.83) | 2.22E-02 | 1.02 (1.01-1.03) | 1.89E-03 |
| DCBLD2 | 1.94 (1.31-2.87) | 9.55E-04 | 1.67 (1.36-2.06) | 1.20E-06 | 2.62 (1.45-4.74) | 1.41E-03 | 1.02 (1.00-1.03) | 3.85E-02 |
| JAZF1 | 1.34 (1.04-1.73) | 2.26E-02 | 1.57 (1.28-1.93) | 1.46E-05 | 1.57 (1.00-2.46) | 4.93E-02 | 1.03 (1.00-1.05) | 1.69E-02 |
| MARCKS | 2.25 (1.44-3.53) | 3.92E-04 | 1.22 (1.02-1.45) | 2.65E-02 | 2.44 (1.26-4.71) | 7.85E-03 | 1.01 (1.00-1.01) | 3.20E-05 |
| NRP1 | 2.38 (1.53-3.70) | 1.15E-04 | 1.81 (1.24-2.63) | 2.06E-03 | 2.21 (1.08-4.54) | 3.00E-02 | 1.02 (1.01-1.03) | 9.26E-05 |
| PIP4K2A | 1.54 (1.03-2.30) | 3.43E-02 | 1.73 (1.35-2.20) | 1.03E-05 | 3.00 (1.14-7.88) | 2.55E-02 | 1.02 (1.00-1.03) | 9.27E-03 |
| PLTP | 1.21 (1.00-1.45) | 4.64E-02 | 1.35 (1.12-1.62) | 1.32E-03 | 1.39 (1.01-1.91) | 4.21E-02 | 1.00 (1.00-1.00) | 3.34E-02 |
| TMEM171 | 0.72 (0.53-0.98) | 3.73E-02 | 1.25 (1.03-1.53) | 2.56E-02 | 0.50 (0.27-0.91) | 2.28E-02 | 0.99 (0.98-1.00) | 4.90E-02 |
| ZBTB10 | 1.61 (1.18-2.18) | 2.58E-03 | 1.42 (1.00-2.00) | 4.76E-02 | 1.81 (1.06-3.08) | 2.99E-02 | 1.03 (1.00-1.05) | 3.43E-02 |
HR, hazard ratio; CI, confidence interval.
Figure 2Risk-score analysis of GC patients in the four datasets. In each dataset, the gene expression profiles, and patients' survival status are displayed. The black-dotted line represents the median cut-off, dividing patients into high- and low-risk groups.
Figure 3Kaplan-Meier and ROC curves for the 13-gene signature in the four datasets. Patients with high risk scores had poor outcome in terms of overall survival.
Univariate and multivariate Cox regression analyses of the gene signature and overall survival of GC patients in 4 independent datasets
| Variables | Patients (N) | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | ||||
| Age (≤65/>65) | 45/23 | 0.6 (0.3-1.4) | 2.50E-01 | ||
| Gender (Male/Female) | 52/18 | 0.6 (0.2-1.3) | 1.60E-01 | ||
| Lauren subtype (Diffuse/Intestinal) | 35/20 | 1.1 (0.5-2.3) | 8.70E-01 | ||
| Stage (I+II/III+IV) | 13/57 | 1.5 (0.6-4.0) | 3.70E-01 | ||
| Risk score (Low/High) | 35/35 | 3.0 (1.5-6.1) | 2.10E-03 | 3.0 (1.5-6.1) | 2.07E-03 |
| Age (≤65/>65) | 81/101 | 1.0 (0.6-1.4) | 7.80E-01 | ||
| Gender (Male/Female) | 116/66 | 0.7 (0.5-1.1) | 1.30E-01 | ||
| Lauren subtype (Diffuse/Intestinal) | 73/91 | 0.9 (0.6-1.3) | 5.40E-01 | ||
| Stage (I+II/III+IV) | 59/123 | 6.5 (3.6-12) | 6.40E-10 | 6.2 (3.4-11.2) | 2.69E-09 |
| Risk score (Low/High) | 91/91 | 2.7 (1.7-4.1) | 9.20E-06 | 2.4 (1.5-3.7) | 9.84E-05 |
| Age (≤65/>65) | 171/128 | 1.3 (1.0-1.8) | 7.90E-02 | ||
| Gender (Male/Female) | 198/101 | 1.1 (0.8-1.6) | 5.20E-01 | ||
| Lauren subtype (Diffuse/Intestinal) | 124/140 | 0.6 (0.4-0.8) | 3.40E-03 | 0.9 (0.6-1.2) | 4.12E-01 |
| Stage (I+II/III+IV) | 125/172 | 3.5 (2.4-5.1) | 8.20E-11 | 3.5 (2.3-5.4) | 1.36E-08 |
| Risk score (Low/High) | 150/149 | 2.1 (1.5-2.9) | 1.40E-05 | 1.7 (1.2-2.5) | 4.27E-03 |
| Age (≤65/>65) | 152/178 | 1.6 (1.1-2.2) | 1.10E-02 | 1.7 (1.1-2.7) | 2.55E-02 |
| Gender (Male/Female) | 216/117 | 0.8 (0.5-1.1) | 1.20E-01 | ||
| Stage (I+II/III+IV) | 151/168 | 1.8 (1.2-2.5) | 1.80E-03 | 1.3 (0.8-2.0) | 2.85E-01 |
| Lauren subtype (Diffuse/Intestinal) | 58/70 | 1.3 (0.8-2.2) | 3.30E-01 | ||
| Recurrence (No/Yes) | 209/57 | 3.7 (2.4-5.7) | 2.40E-09 | 3.6 (2.3-5.7) | 3.04E-08 |
| Risk score (Low/High) | 167/166 | 1.9 (1.4-2.7) | 1.70E-04 | 2.0 (1.2-3.3) | 4.89E-03 |
HR, hazard ratio.
Univariate and multivariate Cox regression analyses of the gene signature and overall survival of GC patients in entire cohort
| Variables | Patients (N) | Univariate analysis | Multivariate analysis | ||
|---|---|---|---|---|---|
| HR (95% CI) | HR (95% CI) | ||||
| Age (≤65/>65) | 449/430 | 1.3 (1.0-1.5) | 1.40E-02 | 1.6 (1.3-1.9) | 9.10E-06 |
| Gender (Male/Female) | 582/302 | 0.9 (0.7-1.0) | 1.30E-01 | ||
| Lauren subtype (Diffuse/Intestinal) | 300/326 | 0.8 (0.6-1.0) | 5.50E-02 | ||
| Stage (I+II/III+IV) | 348/520 | 2.9 (2.3-3.7) | 2.60E-20 | 2.9 (2.3-3.6) | 2.00E-16 |
| Risk score (Low/High) | 443/441 | 2.2 (1.8-2.7) | 2.30E-14 | 2.1 (1.7-2.6) | 2.59E-12 |
HR, hazard ratio; CI, confidence interval.
Figure 4The 13-gene signature is associated with cancer recurrence.
Figure 5Kaplan-Meier analysis of overall survival for patients stratified by age, gender, Lauren'subtype, and stage.
Figure 6Oncological KEGG pathways enriched in the high-risk group from 4 independent datasets.
Figure 713-gene signature may be related to chemotherapy resistance in GC.