Literature DB >> 34746312

Discovery and Validation of an Epithelial-Mesenchymal Transition-Based Signature in Gastric Cancer by Genomics and Prognosis Analysis.

Huiyong Xu1, Huilai Wan1, Maoshu Zhu1, Lianghua Feng1, Hui Zhang1, Fengbing Su1.   

Abstract

OBJECTIVE: Epithelial-mesenchymal transition (EMT) exerts a key function in cancer initiation and progression. Herein, we aimed to develop an EMT-based prognostic signature in gastric cancer.
METHODS: The gene expression profiles of gastric cancer were obtained from TCGA dataset as a training set and GSE66229 and GSE84437 datasets as validation sets. By LASSO regression and Cox regression analyses, key prognostic EMT-related genes were screened for developing a risk score (RS) model. Potential small molecular compounds were predicted by the CMap database based on the RS model. GSEA was employed to explore signaling pathways associated with the RS. ESTIMATE and seven algorithms (TIMER, CIBERSORT, CIBERSORT-ABS, QUANTISEQ, MCPCOUNTER, XCELL, and EPIC) were applied to assess the RS and immune microenvironment.
RESULTS: This study developed an EMT-related gene signature comprised of SERPINE1, PCOLCE2, MATN3, and DKK1. High-RS patients displayed poorer survival outcomes than those with low RS. ROC curves demonstrated the robustness of the model in predicting the prognosis. After external validation, the RS model was an independent risk factor for gastric cancer. Several compounds were predicted for gastric cancer treatment based on the RS model. ECM receptor interaction, focal adhesion, pathway in cancer, TGF-beta, and WNT pathways were distinctly activated in high-RS samples. Also, high RS was significantly associated with increased stromal and immune scores and increased infiltration of CD4+ T cell, CD8+ T cell, cancer-associated fibroblast, and macrophage in gastric cancer tissues.
CONCLUSION: Our findings suggested that the EMT-related gene model may robustly predict gastric cancer prognosis, which could improve the efficacy of personalized therapy.
Copyright © 2021 Huiyong Xu et al.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34746312      PMCID: PMC8570100          DOI: 10.1155/2021/9026918

Source DB:  PubMed          Journal:  Biomed Res Int            Impact factor:   3.411


1. Introduction

Gastric cancer represents a common aggressive malignancy and a common cause of cancer-related deaths globally due to its rapid progress to advanced stages and badly metastatic characteristics [1]. The incidence and prevalence of gastric cancer vary geographically [2]. Despite the improvement in clinical outcomes by implementing standard D2 lymphadenectomy as well as development of chemotherapy and targeted therapy, the overall survival rate of gastric cancer patients is <30% [3]. As a heterogeneous malignancy [4], survival outcomes may greatly vary even for subjects with similar clinical characteristics and therapy regimens, indicating that traditional clinicopathologic characteristics are inadequate for prognosis prediction and risk stratification [5]. Hence, it is important to develop novel clinical tools for predicting the prognosis of gastric cancer. Epithelial-mesenchymal transition (EMT), a well-characterized embryological process, is a critical molecular step during the process of distant metastases [6-8]. Clinically, EMT is in relation to unfavorable survival outcomes of gastric cancer [9]. During the EMT process, gastric cancer cells lose the expression of cellular adhesion proteins like E-cadherin and tight junction proteins as well as express many mesenchymal markers like N-cadherin, Vimentin, and ZEB1 [10]. The mesenchymal phenotype also may raise resistance to chemotherapy and contribute to a desirable prognosis [11]. Therefore, an in-depth comprehension on the mechanisms of the EMT process in gastric cancer is required for promoting the progress of specific treatment strategies. Because various large datasets are easily accessible, exploring the gene signatures underlying the mechanisms of gastric cancer has flourished [12-14]. Despite the extensive research on the mechanisms of EMT in gastric cancer, the prognostic value of EMT-related genes is still inconclusive. Hence, this study constructed an EMT-based signature for predicting survival outcomes of gastric cancer patients. After external verification, this signature might be a robust prognostic prediction tool and assist clinical strategy.

2. Materials and Methods

2.1. Gene Expression Profiles and Data Processing

RNA-sequencing (RNA-seq) profiles of 32 normal samples and 350 gastric cancer samples were downloaded from The Cancer Genome Atlas (TCGA) via Genomic Data Commons (GDC; https://portal.gdc.cancer.gov/). Also, the matched clinical information was also retrieved. RNA-seq data were converted to transcripts per kilobase million (TPM) values. This dataset was used as the training set. From the Gene Expression Omnibus (GEO; http://www.ncbi.nlm.nih.gov/geo/), microarray expression profiling and clinical information of 400 cases of gastric cancer were retrieved from the GSE66229 dataset on the GPL570 platform ([HG-U133_Plus_2] Affymetrix Human Genome U133 Plus 2.0 Array) [15]. Furthermore, expression profiles and clinical features of 433 gastric cancer were obtained from the GSE84437 dataset on the GPL6947 platform (Illumina HumanHT-12 V3.0 expression beadchip) [16]. The raw microarray data were adjusted by background, normalized, and log transformed. The GSE66229 and GSE84437 datasets were employed as the validation sets. The “HALLMARK_EPITHELIAL_MESENCHYMAL_TRANSITION” gene set was retrieved from the Gene Set Enrichment Analysis (GSEA) database (http://software.broadinstitute.org/gsea/index.jsp) [17] (Supplementary Table 1).

2.2. Differential Expression Analysis

The expression of EMT-related genes in 350 gastric cancer tissue specimens was compared with 32 normal tissues in TCGA dataset using the limma package [18]. The ∣log fold‐change | >1 and adjusted p < 0.05 were set as cutoff criteria. Differentially expressed EMT-related genes were visualized into volcano plots and heatmaps.

2.3. Functional and Pathway Enrichment Analysis

Biological functions of differentially expressed EMT-related genes were analyzed via the clusterProfiler package, containing Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis [19]. Terms with false discovery rate (FDR) < 0.05 were significantly enriched.

2.4. Small Molecular Compound Prediction

Differentially expressed genes with ∣log fold‐change | >1 and adjusted p < 0.05 were screened between the high- and low-RS groups. Then, up- and downregulated tags were separately uploaded onto Connectivity Map (CMap) [20]. The match between these genes and small molecular compounds from CMap was evaluated through a connectivity score from −1 to 1. Positive scores denote stimulative effects of compounds on the query signatures. Meanwhile, negative scores implicate inhibitory effects of compounds on the query signatures.

2.5. Generation and Verification of a Risk Score (RS) Model

In TCGA dataset, differentially expressed EMT-related genes with prognostic value were filtered via univariate Cox regression analyses. Genes with p < 0.05 were included for least absolute shrinkage and selection operator (LASSO) Cox regression model analyses using the glmnet package [21]. The penalized Cox regression model with LASSO penalty was employed for achieving shrinkage and variable selection. Tenfold cross-validation was presented for determining the optimal value of the penalty parameter λ. Based on λ value, factors with the matched coefficients were chosen. RS of each patient was determined on the basis of the expression levels of genes and their coefficients. According to the median value, patients were split into the high- and low-RS groups. Kaplan-Meier curves and log-rank test were employed for analyzing the overall survival (OS) difference between the high- and low-RS groups. Receiver operating characteristic (ROC) analysis was conducted for detecting the predictive accuracy of this RS model in the prognosis. Furthermore, the RS model was externally validated in the GSE66229 and GSE84437 datasets.

2.6. Screening Independent Prognostic Factors

Univariate Cox regression analysis was applied for evaluating the significance of the RS model and clinical characteristics in predicting gastric cancer patients' OS. Factors with p < 0.05 were included for multivariable logistic regression analysis, and confounding factors were excluded. The hazard ratio (HR) and 95% confidence interval (CI) were calculated. The results were visualized into a forest plot.

2.7. Subgroup Analysis

To evaluate the predictive sensitivity of the RS model in gastric cancer OS, patients were split into subgroups based on clinical features, as follows: age (>65 and ≤65), gender (female and male), M (M0 and M1), N (N0 and N1-3), T (T1-2 and T3-4), and stage (I-II and III-IV). The survival difference between the high- and low-RS samples was compared in each subgroup.

2.8. Development of a Prognostic Nomogram

RS and traditional clinicopathological characteristics were included in the nomogram through the rms package. To assess the performance of the nomogram in predicting 1-, 3-, and 5-year OS time, nomogram-predicted OS probability was compared with actual survival time by calibration curves. Furthermore, the predictive efficacy of this nomogram was externally verified in the GSE66229 and GSE84437 datasets.

2.9. GSEA

The GSEA method was applied for exploring the potential KEGG pathways activated in high-RS gastric cancer samples. The reference gene set was retrieved from “c2.cp.kegg.v7.1.symbols” file. The significantly enriched pathways were screened with FDR < 0.05.

2.10. Estimation of Immune Score, Stromal Score, and Tumor Purity

The immune score, stromal score, and tumor purity were estimated in gastric cancer tissue specimens via the Estimation of STromal and Immune cells in MAlignant Tumor tissues using Expression data (ESTIMATE) algorithm [22].

2.11. Analysis of Immune Cell Infiltrations

To reveal the associations of the risk score and diverse tumor-infiltrating immune cells, seven algorithms including TIMER, CIBERSORT, CIBERSORT-ABS, QUANTISEQ, MCPCOUNTER, XCELL, and EPIC were applied for quantifying the infiltration levels. Differences in immune-infiltrating cell fractions were estimated between the high- and low-risk groups.

2.12. Statistical Analysis

All statistical analyses were conducted using R software (version 3.6.2; https://www.r-project.org/). Comparisons between groups were carried out with Student's t-test and Wilcoxon rank-sum test. The Spearman correlation test was applied to assess the correlation between immune cells. p values < 0.05 were considered statistically significant.

3. Results

3.1. Identification of Dysregulated EMT-Related Genes and Their Functions in Gastric Cancer

Following the comparison of expression of EMT-related genes between gastric cancer and normal tissues, 79 differentially expressed EMT-related genes with ∣log fold‐change | >1 and adjusted p < 0.05 were identified (Supplementary Table 2). Among them, 67 EMT-related genes were upregulated and 12 were downregulated in gastric cancer (Figures 1(a) and 1(b)). GO enrichment analyses were conducted to elucidate the functional characteristics of these differentially expressed EMT-related genes. Our data showed that these genes were markedly enriched in extracellular matrix (ECM) organization, extracellular structure organization, and collagen fibril organization (Figure 1(c)). Meanwhile, these genes were distinctly related to several key pathways like focal adhesion, ECM-receptor interaction, PI3K-Akt signaling pathway, and proteoglycans in cancer (Figure 1(d)). Hence, it is required to illustrate their clinical implications in gastric cancer.
Figure 1

Identification of dysregulated EMT-related genes, biological functions in gastric cancer. (a) Volcano plot depicting the dysregulated EMT-related genes between gastric cancer and normal tissues. X-axis represents log fold-change, and Y-axis indicates -log10 (adjusted p value). Red and green dots represent up- and downregulated EMT-related genes in gastric cancer, and black dots represent no significant genes. (b) Heatmaps for dysregulated EMT-related genes between tumor and normal tissues. X-axis represents the sample type, and Y-axis depicts dysregulated EMT-related genes. Red and blue show up- and downregulation in gastric cancer, respectively. (c, d) The top ten GO and KEGG terms enriched by dysregulated EMT-related genes.

3.2. Generation of a Prognostic EMT-Related RS Model for Gastric Cancer

By the mRNA expression profiling of TCGA dataset, we screened 35 EMT-related genes associated with OS of gastric cancer with univariable Cox regression analysis (Figure 2(a); Table 1). These genes were further analyzed using LASSO Cox regression model analysis. As a result, we generated a 4-EMT-related gene model for gastric cancer (Figures 2(b) and 2(c)). The RS was determined for each gastric cancer, as follows: RS = 0.127258355254692∗SERPINE1 expression + 0.04303645817321∗PCOLCE2 expression + 0.128510051263955∗MATN3 expression + 0.0116209970037921∗DKK1 expression. Because the median RS was convenient for clinical application, this study set the median value as the cutoff value, and patients were split into the high- and low-RS groups (Figure 2(d)). We compared the survival status between groups. In Figure 2(e), more deaths occurred in the high-RS group. Furthermore, for each patient, high RS was indicative of an unfavorable prognosis (p = 8.321e − 05; Figure 2(f)). However, there was no significant difference in clinical characteristics between the high- and low-RS groups (Table 2). The area under the curve (AUC) of the RS model was 0.763, indicating good performance in predicting patients' OS (Figure 2(g)). Our univariate Cox regression analysis showed that age (p = 0.033), stage (p = 0.002), N (p = 0.022), and RS (p < 0.001) were distinctly associated with a poor prognosis (Figure 2(h)). Under multivariate Cox regression analysis, age (p = 0.004), stage (p = 0.005), and RS (p < 0.001) were independent risk factors for the gastric cancer prognosis (Figure 2(i)).
Figure 2

Generation of a prognostic EMT-related gene model for gastric cancer in TCGA dataset. (a) Univariate Cox regression analysis for prognosis-related EMT-related genes in gastric cancer. (b) Selecting the optimal parameter (λ) in the LASSO model using 10-fold cross-verification. (c) LASSO coefficient profiles of prognosis-related EMT genes. (d) Distribution of RS in gastric cancer patients and determination of the cutoff value of high-RS (red) and low-RS (green) groups according to RS median. (e) Distribution of survival status (dead: red and alive: green) in high- and low-RS groups. (f) Kaplan-Meier OS curves for the high- and low-RS groups. (g) The time-dependent ROC for the RS model. (h) Univariate and (i) multivariate Cox regression analyses of RS and other clinical features.

Table 1

Prognosis-related EMT-related gene signatures for gastric cancer by univariate Cox regression analysis.

IDHRHR.95LHR.95H p valueIDHRHR.95LHR.95H p value
CTHRC11.2004091.067681.3496370.002248THBS21.1196981.0159751.2340090.022637
INHBA1.1771761.0339141.3402890.013751SFRP11.0909711.0029151.1867590.042586
COL1A11.129361.0135761.258370.027503COL5A11.1422091.0062161.2965820.039804
BGN1.1796471.0392631.3389950.010597LOX1.252521.0902021.4390050.001475
COL4A11.2154421.0276671.4375270.022685PCOLCE21.2495041.0857891.4379030.001879
TIMP11.1863591.008041.3962220.039751CDH111.2085691.0524991.3877810.007247
COL5A21.1930861.039621.3692080.011969SFRP41.0789541.0054061.1578820.034891
THY11.2045121.0399181.3951580.013062MATN31.2787411.1319431.4445777.75E-05
FAP1.1675081.0316611.3212440.014135NID21.2353691.0575391.4431030.007689
COL3A11.1504731.0272651.2884580.015291MYL91.0937981.0050171.1904210.037909
CALU1.2602931.0011951.5864440.048823FN11.1245771.0182541.2420030.020507
ADAM121.1833441.0442761.3409310.008311PRRX11.1408971.0118251.2864340.031407
COL1A21.1518051.0243021.295180.018221LUM1.195841.0544131.3562370.005352
SPARC1.2632891.092891.4602560.00157DCN1.1593581.0313131.3033010.013275
SERPINE11.240281.1170381.3771195.51E-05FBLN11.1102471.0173391.211640.019002
PDGFRB1.1894391.0287261.3752580.019162MFAP51.1177421.0107261.2360890.030178
VCAN1.230741.0793191.4034030.001938ACTA21.1194721.0162731.233150.02219
DKK11.0676241.0027751.1366670.040693
Table 2

Clinical characteristics of high- and low-RS gastric cancer patients in TCGA dataset.

CharacteristicsHigh risk (N = 175)Low risk (N = 175)Total (N = 350) p value
Age<6581691500.2348
≥6594106200
StageStage I2128490.619
Stage II5556111
Stage III7976155
Stage IV201535
TT1313160.0757
T2393574
T37883161
T4524395
TX314
MM01551573120.9404
M1121123
MX8715
NN049551040.8117
N1454893
N2363672
N3403171
NX5510
GenderFemale60641240.7374
Male115111226
GradeG14590.9717
G26263125
G3104103207
GX549

3.3. Subgroup Analysis of the Prognostic Value of the EMT-Related RS Model

SERPINE1, PCOLCE2, MATN3, and DKK1 expression was compared between the high- and low-RS groups. In Figure 3(a), there were increased expression levels in the high- than low-RS groups. To assess whether the EMT-related RS model could sensitively predict gastric cancer patients' prognosis, we carried out subgroup analysis. Our data showed that high RS was predictive of undesirable survival outcomes compared with low RS in each subgroup including age ≥ 65 (p = 0.002; Figure 3(b)) and age < 65 (p = 0.009; Figure 3(c)), female (p = 0.024; Figure 3(d)) and male (p = 0.002; Figure 3(e)), M0 (p < 0.001; Figure 3(f)) and M1 (p = 0.590; Figure 3(g)), N0 (p = 0.001; Figure 3(h)) and N1-3 (p = 0.005; Figure 3(i)), T1-2 (p = 0.003Figure 3(j)) and T3-4 (p = 0.006; Figure 3(k)), stage I-II (p < 0.001; Figure 3(l)) and stage III-IV (p = 0.042; Figure 3(m)).
Figure 3

Subgroup analysis of the prognostic value of the EMT-related RS model. (a) Heatmap of the expression of SERPINE1, PCOLCE2, MATN3, and DKK1 in high- and low-RS groups. Red and green show up- and downregulation. Kaplan-Meier curves between high- and low-RS gastric cancer patients in different subgroups including (b) age ≥ 65 and (c) age < 65; (d) female and (e) male; (f) M0 and (g) M1; (h) N0 and (i) N1-3; (j) T1-2 and (k) T3-4; (l) stage I-II and (m) stage III-IV.

3.4. External Validation of the EMT-Related RS Model

The predictive efficacy of the EMT-related RS model was externally verified in the GSE66229 and GSE84437 datasets. With the same formula, we calculated the RS of each patient. In the GSE66229 dataset, patients were split into the high- and low-RS groups based on the median value (Figure 4(a)). As expected, more deaths were found in the high-RS group (Figure 4(b)). The clinical features between groups were compared, and we found that high RS was in relation to late stage, T, and M (Table 3). Furthermore, high-RS patients exhibited more undesirable survival outcomes (p = 7.802e − 07; Figure 4(c)). AUC of the RS model was 0.675 (Figure 4(d)). Similarly, we split patients in the GSE84437 dataset into the high- and low-RS groups (Figure 4(e)). There were more patients with dead status in the high-RS group (Figure 4(f)). In Figure 4(g), high RS was distinctly related to poor prognosis (p = 5.333e − 03). And AUC of the model was 0.637 (Figure 4(h)). Consistent with TCGA dataset, increased SERPINE1, PCOLCE2, MATN3, and DKK1 expression was detected in the high-RS group than the low-RS group in GSE66229 (Figure 5(a)) and GSE84437 (Figure 5(b)) datasets. Following univariate (Figure 5(c)) and multivariate (Figure 5(d)) Cox regression analyses, the RS model was markedly correlated with gastric cancer prognosis in the GSE66229 dataset. Consistently, in the GSE84437 dataset, the RS model was also a risk factor for prognosis according to univariate (Figure 5(e)) and multivariate (Figure 5(f)) Cox regression analyses. Collectively, the EMT-related RS model displayed good generalizability in clinical practice.
Figure 4

External validation of the EMT-related RS model in GSE66229 and GSE84437 datasets. (a) Distribution of RS in gastric cancer samples and determination of the cutoff value of high-RS (red) and low-RS (green) groups according to RS median in the GSE66229 dataset. (b) Distribution of survival status (red: dead and green: alive) in high- and low-RS groups in GSE66229 dataset. (c) Kaplan-Meier OS curves of high- and low-RS groups in GSE66229 dataset. (d) ROC curves of the RS model in GSE66229 dataset. (e) Distribution of RS in gastric cancer samples and determination of the cutoff value of high-RS (red) and low-RS (green) groups according to RS median in GSE84437 dataset. (f) Distribution of survival status (red: dead and green: alive) in high- and low-RS groups in GSE84437 dataset. (g) Kaplan-Meier OS curves of high- and low-RS groups in GSE84437 dataset. (h) ROC curves of the RS model in GSE84437 dataset.

Table 3

Clinical characteristics of gastric cancer patients in the GSE66229 dataset.

CharacteristicsHigh risk (N = 150)Low risk (N = 150)Total (N = 300) p value
Age<6587741610.1647
≥656376139
StageStage I921300.0073
Stage II405696
Stage III554095
Stage IV453277
NA112
TT275111186<0.0001
T3603191
T414721
NA112
MM01311422730.0437
M119827
NN01424380.1309
N16269131
N2473380
N3272451
GenderFemale53481010.6251
Male97102199
Figure 5

External validation of the independency of the EMT-related RS model in predicting prognosis in GSE66229 and GSE84437 datasets. (a, b) Heatmap of the expression of SERPINE1, PCOLCE2, MATN3, and DKK1 in high- and low-RS groups in (a) GSE66229 and (b) GSE84437 datasets. Red and green indicate up- and downregulation. (c) Univariate and (d) multivariate Cox regression analyses of the RS model and other clinicopathological characteristics in GSE66229 dataset. (e) Univariate and (f) multivariate Cox regression analyses of the RS model and other clinicopathological characteristics in GSE84437 dataset.

3.5. Development of a Prognostic Nomogram Based on the EMT-Related RS Model

Independent risk factors were included in the prognostic nomogram for gastric cancer. In TCGA dataset, the nomogram including age, stage, and RS was constructed for predicting patients' survival duration (Figure 6(a)). The calibration curves confirmed that the nomogram-predicted 1-, 3-, and 5-year survival probabilities were in accord with observed survival duration (Figures 6(b)–6(d)). Similarly, the nomogram was developed in the GSE66229 dataset (Figure 6(e)). The well predictive efficacy was verified by the calibration curves (Figures 6(f)–6(h)). Meanwhile, the nomogram was validated in the GSE84437 dataset (Figures 6(i)–6(l)).
Figure 6

Discovery and verification of a prognostic nomogram based on the EMT-related RS model. (a) Establishment of a prognostic nomogram in TCGA dataset. (b–d) The calibration curves for the relationships between the nomogram-predicted and actual 1-, 3-, and 5-year survival probabilities. (e) Validation of the prognostic nomogram in GSE66229 dataset and (f–h) the calibration curves for the relationships between the nomogram-predicted and actual 1-, 3-, and 5-year survival probabilities. (i) Validation of the prognostic nomogram in GSE84437 dataset and (j–l) the calibration curves for the relationships between the nomogram-predicted and actual 1-, 3-, and 5-year survival probabilities.

3.6. Prediction of Underlying Small Molecular Compounds for Gastric Cancer Based on Dysregulated EMT-Related Genes

Totally, 209 differentially expressed genes were identified between the high- and low-RS groups (Supplementary Table 3). Based on them, underlying compounds were predicted by the CMap database, as listed in Table 4. The mechanism of action analysis was then conducted to investigate the shared mechanisms among the compounds. In Figure 7(a), estrogen receptor agonist was shared by dienestrol and diethylstilbestrol.
Table 4

Potential small compounds for treating gastric cancer based on dysregulated EMT-related genes.

RankCMap nameMean n Enrichment p SpecificityPercent nonnull
1Puromycin0.69440.9290.000040.0562100
2Trolox C0.46140.890.00014075
3Cloxacillin-0.4874-0.8690.0006075
4Indoprofen-0.3074-0.8150.002130.033350
5Diethylstilbestrol-0.3386-0.6630.004070.008250
6Caffeic acid0.39830.8530.00605066
7Benzamil-0.3026-0.6290.0081050
8STOCK1N-35874-0.6132-0.9160.014470.0331100
9Fasudil-0.4692-0.9040.018630100
10Amrinone0.5140.6880.019750.014775
1151558770.41940.6750.024410.131375
12Eticlopride-0.2794-0.6730.02570.075850
13Meropenem0.30940.6680.027110.016350
1416-Phenyltetranorprostaglandin E2-0.4864-0.6670.027650.047675
15Thapsigargin-0.4963-0.7570.029340.219466
16Pronetalol0.26540.6570.031910.008950
17Chloropyrazine-0.3284-0.6390.040480.064950
18Naltrexone-0.4185-0.5760.041330.089960
19Oxolamine-0.3554-0.6360.042550.150
20Oxybenzone-0.3134-0.6350.043350.126850
21Carisoprodol-0.3654-0.6330.044060.024850
22Piperine-0.3934-0.6270.047820.011850
Figure 7

Screening potential small molecular compounds and activated pathways associated with RS model in gastric cancer. (a) Candidate small molecular compounds that were predicted by the CMAP database based on differentially expressed EMT-related genes. X-axis shows mechanism of action, and y-axis represents small compounds. (b–d) Activated pathways in high-RS gastric cancer samples in (b) TCGA, (c) GSE66229, and (d) GSE84437 datasets.

3.7. Identification of the EMT-Related Gene Model Associated Signaling Pathways

In TCGA dataset, ECM receptor interaction (NES = 2.24, FDR = 0.004), focal adhesion (NES = 2.13, FDR = 0.007), pathway in cancer (NES = 2.06, FDR = 0.011), TGF-beta signaling pathway (NES = 2.01, FDR = 0.011), and Wnt signaling pathway (NES = 1.79, FDR = 0.033) were markedly activated in high-RS gastric cancer specimens (Figure 7(b)). The above activated pathways were confirmed in the GSE66229 (Figure 7(c)) and GSE84437 (Figure 7(d)) datasets.

3.8. Associations between the EMT-Related RS Model and Immune Microenvironment of Gastric Cancer

Using the ESTIMATE algorithm, we estimated the stromal score, immune score, and tumor purity of gastric cancer tissues from TCGA dataset and analyzed their relationships with the RS. Our data showed that high RS was distinctly related to increased stromal and immune scores as well as lowered tumor purity in gastric cancer (Figure 8(a)). Seven algorithms including TIMER, CIBERSORT, CIBERSORT-ABS, QUANTISEQ, MCPCOUNTER, XCELL, and EPIC were employed to estimate the immune cell infiltrations in each sample. We compared the differences in immune cell infiltrations between the high- and low-RS groups. In Figure 8(b), higher infiltration levels of CD4+ T cell, CD8+ T cell, cancer-associated fibroblast, and macrophage were found in the high-RS group than the low-RS group.
Figure 8

The relationships between the EMT-related RS model and immune microenvironment of gastric cancer. (a) Violin plots of stromal score, immune score, and tumor purity in high- and low-RS groups. (b) Heatmap showing infiltration levels of immune cells in high- and low-RS groups using seven algorithms including TIMER, CIBERSORT, CIBERSORT-ABS, QUANTISEQ, MCPCOUNTER, XCELL, and EPIC. ∗p < 0.05; ∗∗∗∗p < 0.0001.

4. Discussion

EMT-based gene signatures have been developed in bladder cancer [23], glioma [24], and colorectal cancer [25]. EMT is determined to be closely associated with gastric cancer progression and prognosis. Increased motility and invasiveness mediated by the EMT process are key during the initiation of cancer metastasis. However, no studies have reported the prognostic value of EMT-based signatures in gastric cancer. Here, we developed an EMT-related RS model that was comprised of SERPINE1, PCOLCE2, MATN3, and DKK1 in gastric cancer via the LASSO method, which may classify gastric cancer patients into the high- and low-risk categories. This LASSO method has been widely applied for analyzing high-dimensional data, which may screen feature signatures with robust prognostic potential and weak correlations among them to avoid overfitting [26]. Alterations in gene expression are in relation to the carcinogenic process. Here, we screened 67 upregulated and 12 downregulated EMT-related genes in gastric cancer. These genes were distinctly enriched in ECM organization, extracellular structure organization, and collagen fibril organization as well as several cancer-related pathways like focal adhesion, ECM-receptor interaction, PI3K-Akt signaling pathway, and proteoglycans in cancer, highlighting their critical implications in gastric cancer pathogenesis. By the LASSO method, we generated an EMT-based signature containing SERPINE1, PCOLCE2, MATN3, and DKK1. After validation, this signature was independently predictive of survival outcomes. Previously, SERPINE1 upregulation was found in gastric cancer and in relation to unfavorable prognoses [27]. Furthermore, it was tightly correlated to the EMT process in gastric cancer [28]. As an oncogene, it may facilitate tumor cell proliferation, migration, and invasion in gastric cancer through mediating the EMT process [29]. The roles of SERPINE1 on angiogenesis and metastasis in gastric cancer were also found [30]. MATN3 was aberrantly methylated and dysregulated in gastric cancer and related to an undesirable prognosis [31]. DKK1, as an inhibitor of Wnt signaling, was also in relation to survival outcomes of gastric cancer [32]. Nevertheless, more research should be conducted for investigating the roles of PCOLCE2 in gastric cancer progression. To facilitate personalized prediction of the patient's prognosis, we generated the nomogram by incorporating the RS model and traditional clinicopathological characteristics. These model-predicted survival probabilities were highly consistent with actual survival probabilities. Several small molecular compounds were predicted for treating gastric cancer based on the RS model such as puromycin, trolox C, cloxacillin, indoprofen, diethylstilbestrol, and caffeic acid. In our future studies, we will verify the therapeutic effects of these compounds on antigastric cancer by experiments. Our GSEA demonstrated that ECM receptor interaction, focal adhesion, pathway in cancer, TGF-beta signaling pathway, and Wnt signaling pathway were markedly activated in high-RS gastric cancer, indicating that this model was in relation to these pathways. The immune microenvironment exerts a key role in tumor progression. Our further analysis found tight associations between this model and immune microenvironment. This indicated that EMT might participate in reshaping the immune microenvironment of gastric cancer, which will be validated in our future research.

5. Conclusion

Collectively, our study established an EMT-based signature that may robustly predict gastric cancer prognosis and improve the efficacy of personalized therapy. The predictive performance will be verified in a larger cohort of gastric cancer.
  32 in total

1.  Identification of functional lncRNAs in gastric cancer by integrative analysis of GEO and TCGA data.

Authors:  Xianqin Zhang; Wanfeng Zhang; Yuyou Jiang; Kun Liu; Longke Ran; Fangzhou Song
Journal:  J Cell Biochem       Date:  2019-05-28       Impact factor: 4.429

Review 2.  Regulatory networks defining EMT during cancer initiation and progression.

Authors:  Bram De Craene; Geert Berx
Journal:  Nat Rev Cancer       Date:  2013-02       Impact factor: 60.716

3.  SERPINE1 as a cancer-promoting gene in gastric adenocarcinoma: facilitates tumour cell proliferation, migration, and invasion by regulating EMT.

Authors:  Jun-Dong Yang; Lin Ma; Zhen Zhu
Journal:  J Chemother       Date:  2019-11-14       Impact factor: 1.714

Review 4.  EMT Transition States during Tumor Progression and Metastasis.

Authors:  Ievgenia Pastushenko; Cédric Blanpain
Journal:  Trends Cell Biol       Date:  2018-12-26       Impact factor: 20.808

Review 5.  Context-dependent EMT programs in cancer metastasis.

Authors:  Nicole M Aiello; Yibin Kang
Journal:  J Exp Med       Date:  2019-04-11       Impact factor: 14.307

6.  Tumor-associated neutrophils induce EMT by IL-17a to promote migration and invasion in gastric cancer cells.

Authors:  Sen Li; Xiliang Cong; Hongyu Gao; Xiuwen Lan; Zhiguo Li; Wenpeng Wang; Shubin Song; Yimin Wang; Chunfeng Li; Hongfeng Zhang; Yuzhou Zhao; Yingwei Xue
Journal:  J Exp Clin Cancer Res       Date:  2019-01-07

7.  Statistical predictions with glmnet.

Authors:  Solveig Engebretsen; Jon Bohlin
Journal:  Clin Epigenetics       Date:  2019-08-23       Impact factor: 6.551

8.  Single-cell dissection of intratumoral heterogeneity and lineage diversity in metastatic gastric adenocarcinoma.

Authors:  Ruiping Wang; Minghao Dang; Kazuto Harada; Guangchun Han; Fang Wang; Melissa Pool Pizzi; Meina Zhao; Ghia Tatlonghari; Shaojun Zhang; Dapeng Hao; Yang Lu; Shuangtao Zhao; Brian D Badgwell; Mariela Blum Murphy; Namita Shanbhag; Jeannelyn S Estrella; Sinchita Roy-Chowdhuri; Ahmed Adel Fouad Abdelhakeem; Yuanxin Wang; Guang Peng; Samir Hanash; George A Calin; Xingzhi Song; Yanshuo Chu; Jianhua Zhang; Mingyao Li; Ken Chen; Alexander J Lazar; Andrew Futreal; Shumei Song; Jaffer A Ajani; Linghua Wang
Journal:  Nat Med       Date:  2021-01-04       Impact factor: 53.440

9.  Deconvolution of diffuse gastric cancer and the suppression of CD34 on the BALB/c nude mice model.

Authors:  Seon-Jin Yoon; Jungmin Park; Youngmin Shin; Yuna Choi; Sahng Wook Park; Seok-Gu Kang; Hye Young Son; Yong-Min Huh
Journal:  BMC Cancer       Date:  2020-04-15       Impact factor: 4.430

10.  An EMT-related gene signature for the prognosis of human bladder cancer.

Authors:  Rui Cao; Lushun Yuan; Bo Ma; Gang Wang; Wei Qiu; Ye Tian
Journal:  J Cell Mol Med       Date:  2019-10-28       Impact factor: 5.310

View more
  4 in total

Review 1.  Cancer-Associated Fibroblasts: Mechanisms of Tumor Progression and Novel Therapeutic Targets.

Authors:  Ralf-Peter Czekay; Dong-Joo Cheon; Rohan Samarakoon; Stacie M Kutz; Paul J Higgins
Journal:  Cancers (Basel)       Date:  2022-02-27       Impact factor: 6.639

2.  Metastasis Related Epithelial-Mesenchymal Transition Signature Predicts Prognosis and Response to Immunotherapy in Gastric Cancer.

Authors:  Junquan Song; Rongyuan Wei; Shiying Huo; Jianpeng Gao; Xiaowen Liu
Journal:  Front Immunol       Date:  2022-06-13       Impact factor: 8.786

3.  Identifying Differential Expression Genes and Prognostic Signature Based on Subventricular Zone Involved Glioblastoma.

Authors:  Qing Yuan; Fu-Xing Zuo; Hong-Qing Cai; Hai-Peng Qian; Jing-Hai Wan
Journal:  Front Genet       Date:  2022-07-08       Impact factor: 4.772

4.  Identification of candidate biomarkers associated with gastric cancer prognosis based on an integrated bioinformatics analysis.

Authors:  Yong Liu; Da-Xiu Wang; Xiao-Jing Wan; Xian-Hong Meng
Journal:  J Gastrointest Oncol       Date:  2022-08
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.