| Literature DB >> 36241983 |
Zining Liu1, Hua Liu2, Yinkui Wang3, Ziyu Li4.
Abstract
BACKGROUND: Stomach adenocarcinoma (STAD) is a highly heterogeneous disease and is among the leading causes of cancer-related death worldwide. At present, TNM stage remains the most effective prognostic factor for STAD. Exploring the changes in gene expression levels associated with TNM stage development may help oncologists to better understand the commonalities in the progression of STAD and may provide a new way of identifying early-stage STAD so that optimal treatment approaches can be provided.Entities:
Keywords: Bioinformatics analysis; Gastric adenocarcinoma; Prognostic marker; Tumor progression
Mesh:
Substances:
Year: 2022 PMID: 36241983 PMCID: PMC9564244 DOI: 10.1186/s12876-022-02510-8
Source DB: PubMed Journal: BMC Gastroenterol ISSN: 1471-230X Impact factor: 2.847
Fig. 1Flow chart of samples selection
Demographic and clinicopathologic characteristics in training and validation cohorts (GSE62254)
| Variables | Training | Validation | |
|---|---|---|---|
| N | 172 | 86 | |
| Sex | |||
| Female | 61 (35.5) | 24 (27.9) | 0.281 |
| Male | 111 (64.5) | 62 (72.1) | |
| Age (years) | |||
| ≤ 65 | 105 (61.0) | 43 (50.0) | 0.119 |
| > 65 | 67 (39.0) | 43 (50.0) | |
| Signet ring | |||
| No | 144 (83.7) | 77 (89.5) | 0.286 |
| Yes | 28 (16.3) | 9 (10.5) | |
| Perineural invasion | |||
| No | 87 (63.0) | 51 (68.9) | 0.481 |
| Yes | 51 (37.0) | 23 (31.1) | |
| Lymphovascular invasion | |||
| No | 43 (27.2) | 21 (25.9) | 0.953 |
| Yes | 115 (72.8) | 60 (74.1) | |
| T stage | |||
| 2–3 | 113 (65.7) | 57 (66.3) | 0.817 |
| 4a | 50 (29.1) | 26 (30.2) | |
| 4b | 9 ( 5.2) | 3 ( 3.5) | |
| N stage | |||
| 0 | 19 (11.0) | 13 (15.1) | 0.755 |
| 1 | 37 (21.5) | 16 (18.6) | |
| 2 | 45 (26.2) | 22 (25.6) | |
| 3a | 42 (24.4) | 24 (27.9) | |
| 3b | 29 (16.9) | 11 (12.8) | |
| Number of positive lymph nodes | 8.06 ± 9.27 | 7.87 ± 9.96 | 0.879 |
| Lauren | |||
| diffuse | 81 (47.1) | 30 (34.9) | 0.126 |
| intestinal | 79 (45.9) | 51 (59.3) | |
| mixed | 12 ( 7.0) | 5 ( 5.8) | |
| Tumor location (%) | |||
| Cardia/Upper | 19 (11.0) | 7 ( 8.1) | 0.613 |
| Middle | 65 (37.8) | 30 (34.9) | |
| Antrum/Distal | 88 (51.2) | 49 (57.0) | |
| ACRG.sub (%) | |||
| EMT | 25 (14.5) | 10 (11.6) | 0.809 |
| MSI | 41 (23.8) | 24 (27.9) | |
| TP53neg | 59 (34.3) | 31 (36.0) | |
| TP53positive | 47 (27.3) | 21 (24.4) | |
| Stage by AJCC 8th | |||
| Earlier stage (≤ IIa) | 49 (28.5) | 24 (27.9) | 1.000 |
| Later stage (> IIa) | 123 (71.5) | 62 (72.1) |
Demographic and clinicopathologic characteristics in TCGA dataset
| Variables | Overall |
|---|---|
| N | 156 |
| Sex | |
| Female | 43 (27.6) |
| Male | 113 (72.4) |
| Age (years) | |
| ≤ 65 | 52 (33.8) |
| > 65 | 102 (66.2) |
| Signet ring | |
| No | 150 (96.2) |
| Yes | 6 ( 3.8) |
| T stage | |
| 1 | 14 ( 9.0) |
| 2 | 36 (23.1) |
| 3 | 52 (33.3) |
| 4a | 39 (25.0) |
| 4b | 15 ( 9.6) |
| N stage | |
| 0 | 44 (28.2) |
| 1 | 27 (17.3) |
| 2 | 33 (21.2) |
| 3a | 31 (19.9) |
| 3b | 21 (13.5) |
| Lauren | |
| Diffuse | 31 (19.9) |
| Intestinal | 93 (59.6) |
| Not specified | 32 (20.5) |
| Number of positive lymph nodes | 7.15 ± 10.11 |
| Stage by AJCC 8th | |
| Earlier stage (≤ IIa) | 44 (28.2) |
| Later stage (> IIa) | 112 (71.8) |
Fig. 2Differentially expressed stage-related genes for STAD in (A) TCGA and (B) training datasets; Venn diagram showing overlapped (C) upregulated and D down-regulated DEGs between TCGA and training dataset; E the expression heatmap of the 22 overlapped DEGs
Fig. 3A GO analysis of the 22 overlapped DEGs; B KEGG pathway of DEGs. C Net plot of the pathways enriched with DEGs, as identified by KEGG pathway analysis
Fig. 4LASSO logistic analysis via 10‑fold cross‑validation with minimum criteria. A Tuning parameter selection via 10‑fold cross‑validation with minimum criteria in the LASSO model. B LASSO coefficient profiles of 22 candidate DEGs. LASSO. C The expression level of the nine hub genes between the earlier-stage and later-stage groups as identified by LASSO regression
LASSO regression results. Genes selected by the LASSO logistic regression, with the estimated coefficients and odds ratio
| Gene | Coefficient | Odds ratio |
|---|---|---|
| MYOCD | 0.02258333 | 1.0228403 |
| GHRL | 0.14381831 | 1.1546743 |
| SCRG1 | 0.10258011 | 1.1080261 |
| TYRP1 | 0.03295502 | 1.0335041 |
| LYPD6B | 0.30825199 | 1.3610439 |
| THBS4 | 0.27385904 | 1.3150294 |
| TNFRSF17 | 0.04663749 | 1.0477421 |
| SERPINB2 | −0.18898024 | 0.8278029 |
| NEBL | −0.24998835 | 0.7788099 |
Fig. 5Receiver operating characteristic curve based on the training, validation set 1, training-validation and validation set 2 (GSE15459); The ROC performances in validation set 2 before and after batch correction
The AUC performances of the 9 hub genes on other clinicopathologic phenotypes
| Variables | AUC (95% CI) |
|---|---|
| N+ | 0.728 (0.647–0.808) |
| T4 | 0.687 (0.617–0.756) |
| Signet ring | 0.711 (0.617–0.805) |
| Lauren diffuse type | 0.707 (0.643–0.771) |
| Diffuse + Mixed | 0.709 (0.647–0.772) |
| Antrum | 0.602 (0.531–0.673) |
| Cardia | 0.563 (0.441–0.685) |
| Age (> 65 years) | 0.611 (0.543–0.680) |
| Gender (male) | 0.608 (0.533–0.683) |
Fig. 6Gene set enrichment analysis analysis based on the training-validation set A Top five GSEA enrichment analysis results of the KEGG pathways for the later-stage group. B Network plots for GSEA. Network plot showing enriched upregulated pathways (in red) and downregulated pathways (in blue) for gene expression data samples with higher stage. Top 50% significant KEGG were included in this network
(A) Univariate Cox regression in Training-Validation dataset (N = 258). (B) Univariate Cox regression in the whole ACRG cohort (N = 300)
| Variables | Overall survival | Disease free survival | ||
|---|---|---|---|---|
| Hazard ratio | Hazard ratio | |||
| (A) N = 258 | ||||
| Male | 0.98 (0.67–1.45) | 0.940 | 1.01 (0.66–1.54) | 0.977 |
| Age (per 1 year increase) | 1.01 (0.99–1.03) | 0.255 | 1.00 (0.98–1.02) | 0.893 |
| Tumor location | ||||
| Upper | 1.00 | 1.000 | 1.00 | 1.000 |
| Middle | 0.61 (0.34–1.08) | 0.089 | 0.76 (0.40–1.44) | 0.397 |
| Lower | 0.56 (0.32–0.99) | 0.045 | 0.56 (0.30–1.07) | 0.082 |
| Whole | 2.91 (0.66–12.82) | 0.158 | 3.33 (0.74–15.02) | 0.117 |
| Total resection | 0.34 (0.23–0.50) | < 0.001 | 0.31 (0.21–0.48) | < 0.001 |
| T stage | ||||
| T2-3 | 1.00 | 1.000 | 1.00 | 1.000 |
| T4a | 2.48 (1.70–3.61) | < 0.001 | 2.73 (1.81–4.13) | < 0.001 |
| T4b | 1.73 (0.79–3.78) | 0.169 | 1.69 (0.72–3.94) | 0.229 |
| N stage | ||||
| N0 | 1.00 | 1.000 | 1.00 | 1.000 |
| N1 | 2.79 (1.06–7.37) | 0.039 | 2.73 (0.92–8.13) | 0.070 |
| N2 | 2.08 (0.79–5.54) | 0.140 | 1.71 (0.56–5.20) | 0.344 |
| N3a | 4.82 (1.90–12.25) | 0.001 | 5.60 (1.98–15.83) | 0.001 |
| N3b | 10.09 (3.91–26.03) | < 0.001 | 11.81 (4.13–33.77) | < 0.001 |
| T4 stage | 2.35 (1.64–3.38) | < 0.001 | 2.54 (1.71–3.79) | < 0.001 |
| N + stage | 3.87 (1.58–9.48) | 0.003 | 4.04 (1.48–10.99) | 0.006 |
| High stage | 2.18 (1.36–3.50) | 0.001 | 2.35 (1.37–4.02) | 0.002 |
| MLH1 positivity | 1.76 (1.09–2.85) | 0.021 | 1.86 (1.07–3.23) | 0.027 |
| Lauren classification | ||||
| Intestinal | 1.00 | 1.000 | 1.00 | 1.000 |
| Mixed | 2.18 (1.13–4.19) | 0.020 | 1.89 (0.85–4.23) | 0.120 |
| Diffused | 1.59 (1.09–2.33) | 0.017 | 1.49 (0.98–2.25) | 0.062 |
| Poor differentiation | 1.50 (1.03–2.17) | 0.035 | 1.40 (0.93–2.11) | 0.106 |
| ACRG subtype | ||||
| TP53 negative | 1.00 | 1.000 | 1.00 | 1.000 |
| TP53positive | 0.85 (0.53–1.36) | 0.496 | 0.97 (0.58–1.63) | 0.904 |
| MSI | 0.65 (0.39–1.09) | 0.107 | 0.63 (0.34–1.16) | 0.134 |
| EMT | 1.86 (1.14–3.06) | 0.014 | 2.08 (1.23–3.51) | 0.007 |
| Chemotherapy | 0.55 (0.35–0.85) | 0.007 | 0.55 (0.34–0.88) | 0.012 |
| Mesenchymal phenotype | 1.93 (1.32–2.81) | 0.001 | 2.09 (1.38–3.15) | < 0.001 |
| 9-gene score | 1.28 (1.08–1.50) | 0.003 | 1.30 (1.10–1.55) | 0.003 |
| MYOCD | 1.38 (1.23–1.55) | < 0.001 | 1.42 (1.25–1.61) | < 0.001 |
| GHRL | 1.03 (0.93–1.14) | 0.585 | 1.04 (0.94–1.16) | 0.399 |
| SCRG1 | 1.27 (1.18–1.38) | < 0.001 | 1.30 (1.19–1.41) | < 0.001 |
| TYRP1 | 1.20 (1.08–1.33) | 0.001 | 1.22 (1.09–1.36) | 0.001 |
| LYPD6B | 1.00 (0.89–1.13) | 0.982 | 0.96 (0.85–1.09) | 0.557 |
| THBS4 | 1.27 (1.15–1.40) | < 0.001 | 1.29 (1.16–1.44) | < 0.001 |
| TNFRSF17 | 0.95 (0.85–1.05) | 0.298 | 0.95 (0.85–1.06) | 0.388 |
| SERPINB2 | 1.00 (0.90–1.10) | 0.955 | 0.94 (0.84–1.06) | 0.34 |
| NEBL | 1.01 (0.92–1.11) | 0.865 | 1.01 (0.91–1.12) | 0.854 |
| Borrmann type | ||||
| Borrmann I or EGC | 1.00 | 1.000 | 1.00 | 1.000 |
| Borrmann II | 0.71 (0.29–1.73) | 0.455 | 0.62 (0.25–1.54) | 0.302 |
| Borrmann III | 1.49 (0.64–3.44) | 0.352 | 1.24 (0.53–2.90) | 0.615 |
| Borrmann IV | 3.62 (1.48–8.89) | 0.005 | 3.31 (1.33–8.25) | 0.010 |
| (B) N = 300 | ||||
| Male | 0.90 (0.65–1.27) | 0.559 | 0.96 (0.66–1.39) | 0.825 |
| Age (per 1 year increase) | 1.01 (1.00–1.03) | 0.181 | 1.00 (0.99–1.02) | 0.715 |
| Tumor location | ||||
| Upper | 1.00 | 1.000 | 1.00 | 1.000 |
| Middle | 1.09 (0.76–1.56) | 0.631 | 1.21 (0.82–1.78) | 0.330 |
| Lower | 1.66 (1.02–2.70) | 0.041 | 1.62 (0.93–2.83) | 0.087 |
| Whole | 3.27 (1.42–7.56) | 0.006 | 2.21 (0.80–6.11) | 0.127 |
| Subtotal resection | 0.38 (0.27–0.52) | < 0.001 | 0.38 (0.27–0.55) | < 0.001 |
| T stage | ||||
| T2-3 | 1.00 | 1.000 | 1.00 | 1.000 |
| T4a | 2.37 (1.69–3.32) | < 0.001 | 2.52 (1.73–3.67) | < 0.001 |
| T4b | 2.51 (1.46–4.32) | < 0.001 | 2.71 (1.53–4.79) | 0.001 |
| N stage | ||||
| N0 | 1.00 | 1.000 | 1.00 | 1.000 |
| N1 | 1.74 (0.86–3.54) | 0.124 | 1.71 (0.76–3.83) | 0.191 |
| N2 | 3.37 (1.65–6.85) | < 0.001 | 3.94 (1.77–8.79) | 0.001 |
| N3 | 6.94 (3.36–14.33) | < 0.001 | 7.46 (3.31–16.83) | < 0.001 |
| MLH1 positivity | 2.03 (1.28–3.22) | 0.003 | 2.10 (1.25–3.56) | 0.005 |
| Lauren classification | ||||
| Intestinal | 1.00 | 1.000 | 1.00 | 1.000 |
| Diffused | 1.69 (0.68–4.20) | 0.260 | 2.70 (0.97–7.50) | 0.057 |
| Mixed | 1.75 (1.26–2.42) | < 0.001 | 1.63 (1.13–2.34) | 0.009 |
| Poor differentiation | 1.60 (1.14–2.24) | 0.006 | 1.51 (1.05–2.19) | 0.028 |
| ACRG subtype | ||||
| TP53 negative | 1.00 | 1.000 | 1.00 | 1.000 |
| TP53positive | 0.78 (0.52–1.18) | 0.246 | 0.82 (0.52–1.29) | 0.391 |
| MSI | 0.52 (0.32–0.84) | 0.008 | 0.48 (0.27–0.85) | 0.012 |
| EMT | 1.56 (1.02–2.40) | 0.041 | 1.62 (1.03–2.55) | 0.037 |
| Chemotherapy | 0.48 (0.32–0.73) | < 0.001 | 0.49 (0.31–0.76) | 0.001 |
| Mesenchymal phenotype | 1.79 (1.29–2.50) | < 0.001 | 1.92 (1.34–2.76) | < 0.001 |
| 9-gene score | 1.29 (1.12–1.49) | < 0.001 | 1.31 (1.13–1.52) | < 0.001 |
| MYOCD | 1.34 (1.21–1.48) | < 0.001 | 1.37 (1.23–1.52) | < 0.001 |
| GHRL | 1.05 (0.97–1.14) | 0.193 | 1.07 (0.98–1.16) | 0.148 |
| SCRG1 | 1.23 (1.15–1.32) | < 0.001 | 1.24 (1.15–1.34) | < 0.001 |
| TYRP1 | 1.18 (1.08–1.30) | < 0.001 | 1.19 (1.08–1.31) | < 0.001 |
| LYPD6B | 1.02 (0.92–1.13) | 0.760 | 0.99 (0.89–1.11) | 0.896 |
| THBS4 | 1.23 (1.13–1.34) | < 0.001 | 1.24 (1.14–1.36) | < 0.001 |
| TNFRSF17 | 0.95 (0.87–1.04) | 0.240 | 0.94 (0.86–1.04) | 0.218 |
| SERPINB2 | 0.98 (0.89–1.07) | 0.653 | 0.94 (0.84–1.04) | 0.225 |
| NEBL | 1.00 (0.92–1.09) | 0.985 | 1.01 (0.92–1.11) | 0.882 |
| Borrmann type | ||||
| Borrmann I or EGC | 1.00 | 1.000 | 1.00 | 1.000 |
| Borrmann II | 0.87 (0.37–2.08) | 0.757 | 0.83 (0.34–1.99) | 0.670 |
| Borrmann III | 1.70 (0.74–3.91) | 0.209 | 1.42 (0.61–3.27) | 0.417 |
| Borrmann IV | 4.07 (1.70–9.76) | 0.002 | 3.33 (1.37–8.11) | 0.008 |
(A) Multivariate backward stepwise Cox regression on OS Training-Validation (N = 258, including 2 cases with NA entries). (B) Multivariate backward stepwise Cox regression on OS whole ACRG (N = 300, including 3 cases with NA entries)
| Hazard ratio | ||
|---|---|---|
| (A) (N = 258) | ||
| N stage | ||
| 0 | 1.00 | 1.000 |
| 1 | 2.77 (1.04–7.39) | 0.042 |
| 2 | 2.06 (0.77–5.47) | 0.149 |
| 3a | 4.50 (1.75–11.56) | 0.002 |
| 3b | 8.78 (3.36–22.90) | < 0.001 |
| Chemotherapy | 0.48 (0.31–0.75) | 0.001 |
| SCRG1 | 1.21 (1.11–1.32) | < 0.001 |
| (B) (N = 300) | ||
| T stage | ||
| T2-3 | 1.00 | 1.000 |
| T4a | 1.51 (1.03–2.20) | 0.034 |
| T4b | 1.61 (0.91–2.85) | 0.104 |
| N stage (6th AJCC) | ||
| N0 | 1.00 | 1.000 |
| N1 | 1.71 (0.84–3.48) | 0.143 |
| N2 | 2.92 (1.42–6.00) | 0.004 |
| N3 | 5.14 (2.45–10.81) | 0.000 |
| MLH1 positivity | 1.60 (1.00–2.57) | 0.051 |
| Chemotherapy | 0.43 (0.28–0.66) | 0.000 |
| SCRG1 | 1.14 (1.05–1.24) | 0.001 |
Fig. 7Receiver operating characteristic (ROC) curve analysis for stage prediction of our signature and other gene sets appeared in previous studies
The collection of gene signatures of STAD used for comparison
| Study | Genes | AUC | |
|---|---|---|---|
| Ours | MYOCD, GHRL, SCRG1, TYRP1, LYPD6B, THBS4, TNFRSF17, SERPINB2, NEBL | 0.742 | 1.000 |
| Cho et al. | CTNNB1, EXOSC3, TOP2A, TRANK1, LZTR1, CCL5 | 0.657 | 0.003 |
| Hou et al. | TRPC1, SGCE, TNFRSF11A, LRRN1, HLF, CYS1, PPP1R14A, NOVNBEA, CES1, RGN | 0.686 | 0.026 |
| Wang et al. | NR1I2, LGALSL, C1ORF198, CST2, LAMP5, FOXS1, CES1P1, MMP7, COL8A1 | 0.703 | 0.060 |
| Liu et al. | TOP2A, COL1A1, COL1A2, NDC80, COL3A1, CDKN3, CEP55, TPX2, TIMP1 | 0.686 | 0.108 |
| Peng et al. | ACOT7, CES1, IPMK, NES, PBX3, TMEM245, MIR6756, RAB11FIP4, RBPMS2, RPS27L, TPMT, TNFRSF11A | 0.656 | 0.006 |
| Yu et al. | MFAP2, SPP1, COL1A1, BGN, COL11A1, COL10A1, MXRA5, COMP, AGRN | 0.681 | 0.046 |
| Dai et al. | DCLK1, FLRT2, MCC, PRICKLE1, RIMS1, SLC25A15, SLCO2A1, CDO1, GHR, CD109, SELP, UPK1B, CD36 | 0.673 | 0.011 |
| Guan et al. | HBB, C4orf48, MANEAL, CXCL3, TRIM31, TMEM200A, SERPINE1, F5, NOXO1, DKK1 | 0.685 | 0.101 |
| Jiang et al. | AKAP12, ANGPTL1, CYS1, MLLT11, NAV3, NBEA, NOV, PTN, TUSC3, ZSCAN18 | 0.666 | 0.006 |
AUC: area under curve; †P-value stands for Venkatraman permutation test