Literature DB >> 33343628

Identification of EMT-Related Gene Signatures to Predict the Prognosis of Patients With Endometrial Cancer.

Luya Cai1, Chuan Hu2, Shanshan Yu3, Lixiao Liu1, Jinduo Zhao1, Ye Zhao1, Fan Lin4, Xuedan Du3, Qiongjie Yu3, Qinqin Xiao5.   

Abstract

BACKGROUND: Endometrial cancer (EC) is one of the most common gynecological cancers. Epithelial-mesenchymal transition (EMT) is believed to be significantly associated with the malignant progression of tumors. However, there is no relevant study on the relationship between EMT-related gene (ERG) signatures and the prognosis of EC patients.
METHODS: We extracted the mRNA expression profiles of 543 tumor and 23 normal tissues from The Cancer Genome Atlas database. Then, we selected differentially expressed ERGs (DEERGs) among these mRNAs. Next, univariate and multivariate Cox regression analyses were performed to select the ERGs with predictive ability for the prognosis of EC patients. In addition, risk score models were constructed based on the selected genes to predict patients' overall survival (OS), progression-free survival (PFS), and disease-free survival (DFS). Finally, nomograms were constructed to estimate the OS and PFS of EC patients, and pan-cancer analysis was performed to further analyze the functions of a certain gene.
RESULTS: Six OS-, ten PFS-, and five DFS-related ERGs were obtained. By constructing the prognostic risk score model, we found that the OS, PFS, and DFS of the high-risk group were notably poorer. Last, we found that AQP5 appeared in all three gene signatures, and through pan-cancer analysis, it was also found to play an important role in immunity in lower grade glioma (LGG), which may contribute to the poor prognosis of LGG patients.
CONCLUSIONS: We constructed ERG signatures to predict the prognosis of EC patients using bioinformatics methods. Our findings provide a thorough understanding of the effect of EMT in patients with EC and provide new targets and ideas for individualized treatment, which has important clinical significance.
Copyright © 2020 Cai, Hu, Yu, Liu, Zhao, Zhao, Lin, Du, Yu and Xiao.

Entities:  

Keywords:  EMT; endometrial cancer; gene signature; nomogram; prognosis

Year:  2020        PMID: 33343628      PMCID: PMC7738567          DOI: 10.3389/fgene.2020.582274

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Endometrial cancer (EC) is one of the most common malignant tumors in women. According to global cancer statistics, in 2018, there were approximately 382,000 new cases of EC worldwide, with nearly 90,000 deaths (Bray et al., 2018). The incidence rate of EC has increased year by year, especially in developed countries, and EC has become the most prevalent cancer among gynecological malignancies (Miller et al., 2019; Siegel et al., 2020). Patients with early stage EC usually have a good prognosis; however, for patients with advanced and recurrent EC, treatment options are extremely limited, and side effects are more serious, with a 5-year survival rate of only 10–30% (Morice et al., 2016; Clarke et al., 2019). Therefore, it is necessary to identify efficient biomarkers and therapeutic targets to predict and improve the prognosis of patients with EC. Epithelial–mesenchymal transition (EMT) is a biological process that transforms epithelial cells into stromal cells and is involved in embryogenesis, wound healing, and cancer progression (Nieto et al., 2016; Dongre and Weinberg, 2019; Yang J. et al., 2020). In tumors, EMT makes tumor cells more mobile and invasive and promotes cancer progression and metastasis, making them resistant to antitumor drugs (Marcucci et al., 2016). At present, some studies have found that molecular markers related to EMT were significantly related to the unfavorable clinical outcomes of patients with EC. For example, the lncRNA LINC01123 can promote the invasion and metastasis of EC by promoting the EMT pathway, which leads to disease progression and poor prognosis (Yang Y. et al., 2020). QKI, a circRNA regulator, was observed to have a mechanism that promotes EMT in EC, leading to poor clinical outcomes (Dou et al., 2020). Moreover, E-cadherin, an EMT-related protein, has decreased expression in EC, which represents an EMT-positive status that is significantly related to poor overall survival (OS) and progression-free survival (PFS) in patients with EC (Tanaka et al., 2013). However, single gene biomarkers cannot achieve a good prediction effect, and some studies have suggested that gene signatures may be a better choice for predicting patient outcomes. By mining public databases, a few scholars have studied the relationship between the EMT-related gene (ERG) signature and the prognosis of patients with cancer, such as glioma (Tao et al., 2020), gastric adenocarcinoma (Zhang et al., 2020), and head and neck squamous cell carcinoma (Kisoda et al., 2020). However, there is no bioinformatics research on this in EC. Therefore, in this study, we analyzed the relationship between the ERG signature and the prognosis of EC patients through The Cancer Genome Atlas (TCGA) database and constructed nomograms integrating clinical characteristics to estimate EC patients’ OS and PFS more conveniently. These findings help us better assess the prognosis of patients and provide new insights for the individualized treatment of patients with EC (Figure 1).
FIGURE 1

Flow chart of the bioinformatic analysis.

Flow chart of the bioinformatic analysis.

Materials and Methods

Patient Clinical Information and mRNA Expression Dataset

We downloaded the ERG list from the EMT gene database[1] (Supplementary Table 1) and extracted the mRNA expression profiles of 543 EC samples and 23 normal samples from TCGA database[2]. The clinical information of the patients included age, grade, AJCC stage, histologic type, OS, PFS, and disease-free survival (DFS) (Supplementary Table 2).

Identification and Analysis of Differentially Expressed ERGs (DEERGs)

We analyzed the expression levels of 1184 ERGs between EC and normal tissues with the limma package to screen out differentially expressed ERGs (DEERGs) and visualized them by volcano plots and heatmaps. False discovery rate <0.05 and | logFC| > 1 were defined as the significance threshold. Next, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were carried out on the basis of these DEERGs. Moreover, we used the STRING database to construct a protein–protein interaction (PPI) network of the selected genes to visualize the relationships between the DEERGs; 0.4 was defined as the minimum required interaction score, and genes with node degree >15 were selected as hub genes.

Construction and Analysis of the Prognostic Signature

To identify the prognostic value of the DEERGs in EC patients, univariate Cox regression analysis was performed to confirm OS-, PFS-, and DFS-related DEERGs. Then, LASSO analysis was used to avoid collinearity and multivariable Cox analysis was used to construct the ERG-based prognostic signatures. The linear combination of the expression values of the selected genes weighted by their respective regression coefficient from the multivariate Cox regression analysis was used to establish the prognostic risk score model as follows: risk score = Σ (βn × expression of gene n). Then, we divided the EC patients into high- and low-risk groups based on the median risk score. Kaplan–Meier (K-M) survival curves and the log-rank test were adopted to compare the prognostic differences between these two groups. In addition, we calculated receiver operating characteristic (ROC) curves to evaluate the discrimination of the prognostic model.

Establishment of EMT-Clinical Nomograms

Nomograms can predict the survival rate of individual tumor patients based on the values of multiple variables. We combined the risk score with clinical characteristics and constructed nomograms to calculate the OS and PFS of patients with EC more conveniently, making the gene signature more practical. We performed univariate Cox regression analyses on the clinical data of the patients from the TCGA database to investigate which variables are significantly related to OS and PFS. Next, stepwise multivariate analysis was performed based on the variables we obtained. Finally, combining clinical characteristics and the ERG-based risk signature, we established a nomogram by using the rms package.

Pan-Cancer Analysis of a Gene

According to the results of multivariable Cox analysis, we tried to determine whether there was a certain gene that affected the OS, PFS, and DFS of EC patients at the same time. If such a gene existed, we would analyze this gene through the UCSC database to observe its expression level in different tumor tissues. K-M plotter was used to analyze the relationship between this gene and the survival of various cancers, and log-rank P values and 95% CIs of the HR were calculated. In addition, we downloaded the scores of six immune-infiltrating cells in 33 cancers from the TIMER database and analyzed the correlation between the gene expression and the scores of these immune cells. We collected more than 40 common immune checkpoint genes and analyzed the relationship between gene expression and immune checkpoint gene expression.

Results

Preliminary Screening of ERGs

We obtained 1,184 ERGs from the EMT gene database and the mRNA expression profiles from the TCGA database, including 543 tumor samples and 23 normal samples (Figures 2A,B). By comparing tumor and normal tissue samples, we screened out 157 DEERGs. To further analyze the biological functions and significant pathways of these DEERGs, we carried out GO analysis and KEGG pathway enrichment analysis on these genes. As a result, GO analysis revealed that the primary functions of the genes regarding biological processes (BPs) were gland development, epithelial cell proliferation, and cell growth. For the cellular component (CC) category, extracellular matrix and collagen-containing extracellular matrix were the main enriched GO terms. For the molecular function (MF) category, proximal promoter sequence-specific DNA binding, receptor ligand activity, and receptor regulator activity were the most enriched (Figure 3A). For KEGG pathway enrichment analysis, microRNAs in cancer, proteoglycans in cancer, and the PI3K-Akt signaling pathway were most often enriched by the DEERGs (Figure 3B). The PPI network of these DEERGs is shown in Figure 3C. The hub genes were EZH2, IL6, SPP1, CDKN2A, TWIST1, EGF, FGF2, RUNX2, WNT3A, TNFSF11, and BDNF, which are displayed in red.
FIGURE 2

Differentially expressed EMT-related genes (ERGs) between endometrial cancer (EC) and normal tissues. (A) The volcano plot for the 1,184 ERGs from the TCGA-STAD cohort. (B) Heatmap for screened ERGs between 543 tumor samples and 23 normal samples.

FIGURE 3

GO and KEGG pathway enrichment analysis and PPI network of DEERGs. (A) Functional annotation of DEERGs by GO enrichment analysis. (B) Functional annotation of DEERGs by KEGG pathway enrichment analysis. (C) PPI network analysis of 157 DEERGs.

Differentially expressed EMT-related genes (ERGs) between endometrial cancer (EC) and normal tissues. (A) The volcano plot for the 1,184 ERGs from the TCGA-STAD cohort. (B) Heatmap for screened ERGs between 543 tumor samples and 23 normal samples. GO and KEGG pathway enrichment analysis and PPI network of DEERGs. (A) Functional annotation of DEERGs by GO enrichment analysis. (B) Functional annotation of DEERGs by KEGG pathway enrichment analysis. (C) PPI network analysis of 157 DEERGs.

Identification of DEERGs Associated With Prognosis

We analyzed the 157 DEERGs selected previously to screen out prognosis-related genes by univariate Cox regression analysis and found 41, 34, and 21 ERGs (P < 0.05) significantly associated with OS, PFS, and DFS, respectively (Figure 4). Next, we performed LASSO analysis (Supplementary Figure 1) and multivariable Cox analysis to further identify ERGs related to prognosis and obtained their respective coefficients. Finally, six ERGs (SIM2, PDCD1, AQP5, CDKN2A, ONECUT2, and SIX1), ten ERGs (MARVELD3, MYBL2, PKP3, PDCD1, BDNF, AQP5, CAV1, ESRP1, CDKN2A, and SIX1), and five ERGs (HIC1, MST1R, AQP5, HOXB9, and E2F1) that were associated with OS, PFS, and DFS, respectively, were obtained (Tables 1–3).
FIGURE 4

Forest plots of the prognosis-related ERGs by univariate Cox analysis. (A) OS-related ERGs. (B) PFS-related ERGs. (C) DFS-related ERGs. Green: protective factor, red: risk factor.

TABLE 1

Information on six prognostic ERGs significantly related to OS in patients with EC, which were used to constructed the final prediction model.

mRNAB (Cox)HRP
SIM20.25421.28940.0208
PDCD1−0.28170.75450.0330
AQP5−0.13250.87590.0086
CDKN2A0.15651.16950.0078
ONECUT20.35621.42790.0365
SIX10.22571.25320.0048
Forest plots of the prognosis-related ERGs by univariate Cox analysis. (A) OS-related ERGs. (B) PFS-related ERGs. (C) DFS-related ERGs. Green: protective factor, red: risk factor. Information on six prognostic ERGs significantly related to OS in patients with EC, which were used to constructed the final prediction model. Information on ten prognostic ERGs significantly related to PFS in patients with EC, which were used to constructed the final prediction model. Information on five prognostic ERGs significantly related to DFS in patients with EC, which were used to constructed the final prediction model.

Constructing Three ERG Signatures to Predict Patient Prognosis

Using the linear combination of the expression value of the selected genes and their respective regression coefficient from multiple Cox regression analysis, we established three risk scoring models to predict EC patients’ prognosis (OS, PFS, and DFS). Based on the prognosis risk score, the EC patients were divided into high- and low-risk groups by using the median risk score as the cutoff value (Figures 5A–C), and the respective survival status of the EC patients was obtained (Figures 5D–F). The K-M analysis showed that compared with those of the high-risk group, the OS, PFS, and DFS of the low-risk group were markedly better (P < 0.0001; Figures 5G–I). The AUCs for 1-, 3-, and 5-year OS, PFS, and DFS were all in the range of 0.630–0.735 (Figures 5J–L), suggesting that these three ERG signatures have excellent diagnostic significance for prognosis prediction. In addition, we generated heatmaps to exhibit the expression profiles of these three groups of ERGs, visualizing the expression of the ERGs in the high- and low-risk groups (Figures 5M–O).
FIGURE 5

The ERG signatures associated with risk score predicts OS, PFS, and DFS in patients with EC. (A–C) The risk score distribution of EC patients. (D–F) Survival status and survival times of EC patients. (G–I) K-M curve to test the predictive effect of the gene signature. (J–L) ROC curve analysis of risk score with survival time at 1, 2, and 3 years to evaluate the sensitivity and specificity of the gene signature. (M–O) A heatmap of the expression profile of ERGs, which constructed gene signatures. Red: low-risk group, blue: high-risk group. Color from yellow to blue indicates gene expression from low to high.

The ERG signatures associated with risk score predicts OS, PFS, and DFS in patients with EC. (A–C) The risk score distribution of EC patients. (D–F) Survival status and survival times of EC patients. (G–I) K-M curve to test the predictive effect of the gene signature. (J–L) ROC curve analysis of risk score with survival time at 1, 2, and 3 years to evaluate the sensitivity and specificity of the gene signature. (M–O) A heatmap of the expression profile of ERGs, which constructed gene signatures. Red: low-risk group, blue: high-risk group. Color from yellow to blue indicates gene expression from low to high.

Developed Nomograms Integrating Clinical Characteristics Showed That Several Clinical Characteristics Were Significantly Related to the Risk Score

In univariate Cox regression analyses, we found that risk score, age, AJCC stage, grade, histologic type, and margin status were significantly associated with the OS and PFS of EC patients, and race, hypertension, and surgical approach can also significantly influence patients’ PFS (Table 4). Then, we further performed multivariate Cox regression analyses and the results showed that risk score (HR 2.41; 95% CI 1.32 to 4.40; P = 0.004), age (HR 1.03; 95% CI 1.00 to 1.05; P = 0.032), AJCC stage (HR 2.57; 95% CI 1.46 to 4.54; P = 0.001), and margin status (HR 1.94; 95% CI 1.04 to 3.64; P = 0.039) were independently related to OS (Table 5). Based on these independent survival indicators, we constructed a nomogram prediction model to predict EC patients’ OS (Figure 6A). In addition, another nomogram was constructed to predict the PFS of patients with EC (Figure 6E), in which the risk score (HR 4.12; 95% CI 2.38 to 7.12; P < 0.001), AJCC stage (HR 1.76; 95% CI 1.13 to 2.73; P = 0.012), and surgical approach (HR 0.62; 95% CI 0.40 to 0.96; P = 0.033) were integrated as independent risk factors (Table 5). The calibration plot showed that in these two nomograms, the predicted values of OS or PFS at 1, 2, and 3 years for EC patients have a good correlation with the actual values (Figures 6B–D, F–H).
TABLE 4

Univariate Cox regression analyses for identifying clinical characteristics related to EC patients’ OS and PFS.

Clinical featureUnivariate analysis (OS)
Univariate analysis (PFS)
HR95%CI of HRP valueHR95%CI of HRP value
Risk score3.6642.280–5.888<0.0013.7072.570–5.348<0.001
Age, y1.0341.014–1.054<0.0011.0211.006–1.0360.006
Race
Black1 (reference)1 (reference)
Other0.5400.201–1.4500.2220.3500.136–0.9020.030
White0.8760.525–1.4610.6120.9130.612–1.3630.658
AJCC stage
I1 (reference)1 (reference)
II-IV3.6672.377–5.657<0.0012.2311.616–3.078<0.001
Grade
Low1 (reference)1 (reference)
High3.4131.988–5.859<0.0011.8771.306–2.696<0.001
Histologic_type
EEA1 (reference)1 (reference)
MSE2.8801.230–6.7440.0151.9770.959–4.0780.065
SEA2.8951.878–4.461<0.0012.0491.450–2.894<0.001
Hypertension
NO1 (reference)1 (reference)
YES0.2820.047–1.6990.1670.2030.053–0.7700.019
Radiotherapy
NO1 (reference)1 (reference)
YES0.4480.071–2.8320.3940.6850.170–2.7610.594
Margin_status
R01 (reference)1 (reference)
R1/23.2611.852–5.744<0.0012.1921.319–3.6430.002
Surgical_approach
Minimally Invasive1 (reference)1 (reference)
Open0.7530.489–1.1600.1980.6700.480–0.9360.019
TABLE 5

Multivariate Cox regression analyses for identifying independent clinical characteristics related to EC patients’ OS and PFS.

Clinical featureMultivariate analysis (OS)
Multivariate analysis (PFS)
HR95%CI of HRP valueHR95%CI of HRP value
Risk score2.4131.324–4.3980.0044.1202.383–7.122<0.001
Age, y1.0301.002–1.0520.0320.9990.978–1.0200.910
Race
Black1 (reference)
Other0.5110.164–1.5900.246
White1.4000.822–2.5590.246
AJCC stage
I1 (reference)1 (reference)
II-IV2.5721.459–4.5360.0011.7561.130–2.7290.012
Grade
Low1 (reference)1 (reference)
High1.4640.739–2.9000.2740.6380.367–1.1100.112
Histologic_type
EEA1 (reference)1 (reference)
MSE1.2370.426–3.5920.6961.3180.497–3.4950.579
SEA0.8750.485–1.5810.6591.1930.700–2.0350.516
Margin_status
R01 (reference)1 (reference)
R1/21.9411.035–3.6410.0391.6910.946–3.0200.076
Surgical_approach
Minimally Invasive1 (reference)
Open0.6220.402–0.9630.033
FIGURE 6

The establishment of two nomograms which can predict the prognosis probability of EC patients. (A) OS nomogram was constructed combined with risk score and three clinical characteristic (margin status, age, and AJCC stage). (B–D) The calibration plot of OS nomogram for 1-, 2-, and 3-year survival. (E) PFS nomogram was constructed combined with risk score and two clinical characteristic (surgical approach and AJCC stage). (F–H) The calibration plot of PFS nomogram for 1-, 2-, and 3-year PFS.

Univariate Cox regression analyses for identifying clinical characteristics related to EC patients’ OS and PFS. Multivariate Cox regression analyses for identifying independent clinical characteristics related to EC patients’ OS and PFS. The establishment of two nomograms which can predict the prognosis probability of EC patients. (A) OS nomogram was constructed combined with risk score and three clinical characteristic (margin status, age, and AJCC stage). (B–D) The calibration plot of OS nomogram for 1-, 2-, and 3-year survival. (E) PFS nomogram was constructed combined with risk score and two clinical characteristic (surgical approach and AJCC stage). (F–H) The calibration plot of PFS nomogram for 1-, 2-, and 3-year PFS. In addition, we compared the relationship between the risk score and three clinicopathological features: grade, AJCC stage, and histological type, which were significantly related to both OS, PFS, and DFS. Patients in the high grade, AJCC stage II–IV, and serous endometrial adenocarcinoma groups had higher risk scores (P < 0.0001) in these three risk score models (Figures 7A–I). These results showed that the risk score is closely related to clinical characteristics.
FIGURE 7

The relationship between risk score and clinical characteristics. (A–C) Distribution of OS risk scores in different grades, AJCC stages, and histological types. (D–F) Distribution of PFS risk scores in different grades, AJCC stages, and histological types. (G–I) Distribution of DFS risk scores in different grades, AJCC stages, and histological types.

The relationship between risk score and clinical characteristics. (A–C) Distribution of OS risk scores in different grades, AJCC stages, and histological types. (D–F) Distribution of PFS risk scores in different grades, AJCC stages, and histological types. (G–I) Distribution of DFS risk scores in different grades, AJCC stages, and histological types.

Prognostic Potential of AQP5 and Its Relationship With Immunity Across Cancer Types

Based on the aforementioned results, we found that AQP5 is the only ERG that appeared in all three ERG signatures. Therefore, we further analyzed AQP5 to determine whether it has a certain effect on prognosis in other cancer types. First, we performed gene expression correlation analysis using the samples from the UCSC database and found that in most cancer types, there were differences in the expression level, which could be higher or lower in tumor tissues, except for in ACC, BLCA, and LAML (Figure 8A). Subsequently, we compared the relationship between the expression level of AQP5 and prognosis in 33 kinds of cancers from the TCGA database. The results showed that AQP5 expression in lower grade glioma (LGG), LUAD, and SKCM was significantly different for the OS of patients (Figures 8B–D), and in LGG, SKCM, and UCEC, it was significantly different for the disease-specific survival (DSS) of patients (Figures 8E–G). Moreover, in LGG and SKCM, the high expression of AQP5 was associated with worse OS and DSS (Figures 8B,D–F), whereas in LUAD, the high expression of AQP5 represented better OS with P = 0.0096 (Figure 8C) and in UCEC, the high expression of AQP5 represented better DSS with P < 0.001 (Figure 8G).
FIGURE 8

Gene expression correlation analysis and prognostic potential of AQP5. (A) Expression difference of AQP5 in 27 cancer types through UCSC database. (B–D) Analysis of the relationship between AQP5 expression and OS of patients in LGG, LUAD, and SKCM by K-M survival curves. (E–G) Analysis of the relationship between AQP5 expression and DSS of patients in LGG, SKCM, and UCEC by K-M survival curves. OS, overall survival; DSS, disease-specific survival. “–,” not significant, *P < 0.05, **P < 0.01, and ***P < 0.001. P < 0.05 was considered to be statistically significant.

Gene expression correlation analysis and prognostic potential of AQP5. (A) Expression difference of AQP5 in 27 cancer types through UCSC database. (B–D) Analysis of the relationship between AQP5 expression and OS of patients in LGG, LUAD, and SKCM by K-M survival curves. (E–G) Analysis of the relationship between AQP5 expression and DSS of patients in LGG, SKCM, and UCEC by K-M survival curves. OS, overall survival; DSS, disease-specific survival. “–,” not significant, *P < 0.05, **P < 0.01, and ***P < 0.001. P < 0.05 was considered to be statistically significant. In addition, we analyzed the relationship between the expression of AQP5 and the immune response in 33 cancer types. As a result, we found that the expression of AQP5 in LGG, PRAD, and THCA was significantly correlated with the level of immune infiltration (Figures 9A–C). Interestingly, in LGG, the higher expression level of AQP5 was related to poorer prognosis and higher immune infiltration levels of B cells (R = 0.15, P = 0.00059), CD8+ cells (R = 0.305, P = 3.36e-15), DC cells (R = 0.143, P = 0.00101), macrophages (R = 0.116, P = 0.00687), and neutrophils (R = 0.226, P = 1.64e-07) (Figure 9A). Moreover, in the analysis of the relationship between the expression of AQP5 and immune checkpoint gene expression, we found that in LGG, the AQP5 expression level had a significant positive correlation with the expression of most immune checkpoint genes (Figure 9D). These findings strongly suggest that AQP5 plays a prominent role in immunity in LGG, which may contribute to the poor prognosis of patients.
FIGURE 9

The relationship between AQP5 and immunity across cancer types. (A–C) Analysis of the relationship between AQP5 expression and the level of immune infiltration in LGG, PRAD, and THCA by TIMER database. (D) Relationship between AQP5 expression and the immune checkpoint gene expression.

The relationship between AQP5 and immunity across cancer types. (A–C) Analysis of the relationship between AQP5 expression and the level of immune infiltration in LGG, PRAD, and THCA by TIMER database. (D) Relationship between AQP5 expression and the immune checkpoint gene expression.

Discussion

In recent decades, increasing evidence has shown that EMT is closely related to the progression, metastasis, and drug resistance of cancers (Nieto et al., 2016). In tumors, EMT is activated, and the most important sign of this is the downregulation of E-cadherin. E-cadherin is a protein that spans the cell membrane and closely binds adjacent cells; it is an important molecule for maintaining the characteristics of epithelial cells. Its loss or downregulation promotes the distant dissemination of cancer cells (Thiery et al., 2009; Lamouille et al., 2014). In addition, related transcription factors, such as SNAIL, TWIST, and zinc finger E-box binding (ZEB), also play significant roles in the biological process of EMT, promoting cell invasion, migration, proliferation, and angiogenesis (Dongre and Weinberg, 2019). At present, many studies have indicated that EMT status is closely associated with the survival of cancer patients, and some ERG signatures have been constructed to predict the survival of patients with cancer. For example, in non-small cell lung cancer, E-cadherin was found to interact with epidermal growth factor receptor, playing a pivotal role in prognosis and progression (Witta et al., 2006). An EMT-related seven-gene signature was used to predict the survival of patients with glioma (Tao et al., 2020). Furthermore, Tan et al. (2014), found that EMT status was related to OS and DFS in patients with ovarian cancer, and EMT scoring was performed. However, studies on the relationship between ERGs and the prognosis of patients with EC are still very limited. Because a single gene biomarker cannot provide a strong prediction effect, a more accurate and reliable gene signature was used to predict the clinical outcomes of patients. In this study, we used the ERG signature for the first time to predict the survival of patients with EC and obtained a good prediction effect. First, we screened out the DEERGs between 543 EC samples and 23 normal samples from the TCGA database and selected OS-related, PFS-related, and DFS-related DEERGs with predictive ability for the prognosis of patients with EC through univariate and multivariate Cox regression analyses (P < 0.05). Subsequently, by constructing the prognostic risk score models, we found that there were significant differences in OS, PFS, and DFS between the high- and low-risk groups, and the OS, PFS, and DFS of the high-risk group were evidently worse with P < 0.0001. In addition, we constructed two nomograms integrating clinical characteristics that provide a more convenient way to estimate the OS and PFS of patients with EC. Among them, AJCC stage was an independent risk factor used to construct both of the nomograms. Of course, the independent prognostic indicators we obtained are to fully consider the existing data, and the possible influencing factors have been investigated one by one. However, there is no denying that many may be confused by our conclusion that clinical data may not be included in the research, but we believe that with the increasing improvement of database data, as well as our research that is gradually thorough, the future will be more objective to front the dialectical point of view to look at the impact of these clinical indices for prognosis. Moreover, according to our analysis, we found that AQP5 is the only ERG related to EC patients’ OS, PFS, and DFS. AQP5 is a kind of transmembrane water channel protein that plays pivotal roles in regulating the water balance in cells and in maintaining cell function and has been found in a variety of tumors. Many studies have proven that AQP5 is closely associated with the migration and proliferation of cancer cells, and it can become a prognostic marker and potential drug target. For example, in colorectal cancer, the overexpression of AQP5 can promote the invasion and migration of cancer cells by activating EMT (Chen et al., 2017). There are also many related studies in ovarian cancer, and the expression of AQP5 in ovarian cancer is significantly increased, which is consistent with our previous gene expression correlation analysis using samples from the UCSC database, and is associated with the formation of ascites (Yang et al., 2006; Tiwari et al., 2014). In addition, the downregulation of AQP5 expression can inhibit ovarian cancer development (Yan et al., 2014). Furthermore, AQP5 also plays a key role in cervical cancer; its high expression is positively correlated with the expression of Ki-67, and both are significantly correlated with lymph node metastasis and poor prognosis (Zhang et al., 2012). On the basis of our pan-cancer analysis, in LGG and SKCM, the high expression of AQP5 was related to worse OS and DSS, which is consistent with the hypothesis that AQP5 may promote the proliferation and metastasis of tumor cells. However, in LUAD and UCEC, the high expression of AQP5 was associated with a better prognosis. What is more, in our study, AQP5 is found to be a protective factor for EC patients. Therefore, we suggested that AQP5 may play different roles in different cancer types with different mechanisms, and that in certain cancers may promote the proliferation of tumor cells and in certain cancers may inhibit the growth of tumor cells, but the specific mechanism is not clear. In our study, although AQP5 plays a protective role in patients with EC, there are no relevant studies that have reported its specific molecular mechanism in EC, and further in-depth studies are needed. In addition to AQP5, we also found that only PDCD1, SIX1, and ONECUT2 are ERGs that were repeatedly found in the gene signatures related to OS and PFS in EC patients. PDCD1 is a gene that encodes programmed cell death protein 1 (PD-1). PD-1 is an important immunosuppressive molecule, and its ligands are PD-L1 and PD-L2. The high expression of PD-1 in EC is positively correlated with high levels of microsatellite instability (MSI) and better OS (Kim et al., 2018; Sungu et al., 2019), which are consistent with our multivariable Cox analysis that PDCD1 is a protective factor with hazard ratio (HR) <1. Oncofetal protein sine oculis-related homeobox 1 (SIX1) is a transcription factor that plays a key role in the proliferation and development of tumor cells and is a development EMT regulator (Christensen et al., 2008; Micalizzi et al., 2010; Wu et al., 2015). In EC, SIX1 also plays an important role, and as a disease biomarker (Suen et al., 2016), its overexpression can promote the growth of tumor cells through ERK- and AKT-mediated pathways (Xin et al., 2016). One cut homeobox 2 (ONECUT2) is a transcription factor related to tumor cell proliferation, which is closely associated with the EMT process of cancer cells (Sun et al., 2014; Wang et al., 2020). Some studies have found that its expression is significantly related to the progression of ovarian cancer (Lu et al., 2018), prostate cancer (Joglekar et al., 2020), lung adenocarcinoma (Ma et al., 2019), etc. However, there are few studies that have studied the role of this gene in EC. It is undeniable that this study does have some limitations. On the one hand, the prediction models constructed by the genes we selected need to be independently validated before they can be used to evaluate the prognosis of patients with EC. Unfortunately, the GEO database lacks an EC dataset with OS, PFS, or DFS, so the stability of the prognostic models has not been verified. On the other hand, there is a limitation that we did not integrate multi-omics data which would improve identification accuracy and prediction performance of prognostic models. Last but not least, in this paper, Cox regression is used to screen variables and establish a prediction model, which is a statistical method widely used in survival analysis. However, in recent years, with the development of science and technology, many better algorithms are gradually developed (Wu and Ma, 2015; Wu et al., 2018; Ren et al., 2019). We also hope that these methods applied to the prognosis of tumor-related research in the future. In summary, we first identified the relationship between the ERG signature and the prognosis of patients with EC using bioinformatics methods and found that patients in the high-risk group had significantly lower OS, PFS, and DFS rates than those in the low-risk group. Furthermore, we found that the ERG AQP5, which is related to OS, PFS, and DFS in patients with EC, has a close relationship with the prognosis of LGG patients.

Conclusion

In brief, we constructed ERG signatures to predict the prognosis of patients with EC and built nomograms to estimate the prognosis of EC patients more accurately. In addition, AQP5 was the ERG that affected EC patients’ OS, PFS, and DFS, and we further explored its role in other cancer types through pan-cancer analysis. These findings provide new insights into the role of EMT in EC, guiding individualized treatment for patients with EC.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author/s.

Author Contributions

QX conceived and designed the study with LC and CH. LC and SY drafted the manuscript and analyzed the data with LL and CH. JZ and YZ handled the picture and manuscript format. FL, XD, and QY reviewed the data. All authors have read and approved the final published manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
TABLE 2

Information on ten prognostic ERGs significantly related to PFS in patients with EC, which were used to constructed the final prediction model.

mRNAB (Cox)HRP
MARVELD3−0.28240.75400.0764
MYBL20.13961.14980.1024
PKP3−0.22290.80020.0054
PDCD1−0.16110.85120.1016
BDNF−0.42110.65630.0568
AQP5−0.10740.89820.0098
CAV1−0.16040.85180.1018
ESRP10.33901.40350.0014
CDKN2A0.08691.09080.0715
SIX10.12891.13750.0527
TABLE 3

Information on five prognostic ERGs significantly related to DFS in patients with EC, which were used to constructed the final prediction model.

mRNAB (Cox)HRP
HIC1−0.55540.57380.0181
MST1R−0.2080.81220.0601
AQP5−0.11220.89390.0295
HOXB90.11141.11790.0538
E2F10.24031.27170.0332
  39 in total

1.  Epithelial-mesenchymal transition: a new target in anticancer drug discovery.

Authors:  Fabrizio Marcucci; Giorgio Stassi; Ruggero De Maria
Journal:  Nat Rev Drug Discov       Date:  2016-01-29       Impact factor: 84.694

2.  ONECUT2 is a novel target for treatment of castration-resistant prostate cancer.

Authors:  Tejashree Joglekar; Jianqing Lin; Maho Shibata
Journal:  Expert Opin Ther Targets       Date:  2020-02-03       Impact factor: 6.902

3.  Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis.

Authors:  Jie Ren; Yinhao Du; Shaoyu Li; Shuangge Ma; Yu Jiang; Cen Wu
Journal:  Genet Epidemiol       Date:  2019-02-11       Impact factor: 2.135

4.  Prognostic implication of programmed cell death 1 protein and its ligand expressions in endometrial cancer.

Authors:  Jisup Kim; Sinae Kim; Hye Sun Lee; Wookyeom Yang; Hanbyoul Cho; Doo Byung Chay; Seong Jin Cho; Soonwon Hong; Jae-Hoon Kim
Journal:  Gynecol Oncol       Date:  2018-03-21       Impact factor: 5.482

5.  STAT1-induced upregulation of lncRNA LINC01123 predicts poor prognosis and promotes the progression of endometrial cancer through miR-516b/KIF4A.

Authors:  Yuguang Yang; Jin Wu; Hongfeng Zhou; Wenming Liu; Jincai Wang; Qingyuan Zhang
Journal:  Cell Cycle       Date:  2020-05-13       Impact factor: 4.534

6.  Proteogenomic Characterization of Endometrial Carcinoma.

Authors:  Yongchao Dou; Emily A Kawaler; Daniel Cui Zhou; Marina A Gritsenko; Chen Huang; Lili Blumenberg; Alla Karpova; Vladislav A Petyuk; Sara R Savage; Shankha Satpathy; Wenke Liu; Yige Wu; Chia-Feng Tsai; Bo Wen; Zhi Li; Song Cao; Jamie Moon; Zhiao Shi; MacIntosh Cornwell; Matthew A Wyczalkowski; Rosalie K Chu; Suhas Vasaikar; Hua Zhou; Qingsong Gao; Ronald J Moore; Kai Li; Sunantha Sethuraman; Matthew E Monroe; Rui Zhao; David Heiman; Karsten Krug; Karl Clauser; Ramani Kothadia; Yosef Maruvka; Alexander R Pico; Amanda E Oliphant; Emily L Hoskins; Samuel L Pugh; Sean J I Beecroft; David W Adams; Jonathan C Jarman; Andy Kong; Hui-Yin Chang; Boris Reva; Yuxing Liao; Dmitry Rykunov; Antonio Colaprico; Xi Steven Chen; Andrzej Czekański; Marcin Jędryka; Rafał Matkowski; Maciej Wiznerowicz; Tara Hiltke; Emily Boja; Christopher R Kinsinger; Mehdi Mesri; Ana I Robles; Henry Rodriguez; David Mutch; Katherine Fuh; Matthew J Ellis; Deborah DeLair; Mathangi Thiagarajan; D R Mani; Gad Getz; Michael Noble; Alexey I Nesvizhskii; Pei Wang; Matthew L Anderson; Douglas A Levine; Richard D Smith; Samuel H Payne; Kelly V Ruggles; Karin D Rodland; Li Ding; Bing Zhang; Tao Liu; David Fenyö
Journal:  Cell       Date:  2020-02-13       Impact factor: 41.582

7.  Up-regulated ONECUT2 and down-regulated SST promote gastric cell migration, invasion, epithelial-mesenchymal transition and tumor growth in gastric cancer.

Authors:  G-H Wang; Y-M Zhou; Z Yu; J-P Deng; S-F Liu; C-Z Wei; Y Feng; M Mao; Z Wang
Journal:  Eur Rev Med Pharmacol Sci       Date:  2020-09       Impact factor: 3.507

Review 8.  A selective review of robust variable selection with applications in bioinformatics.

Authors:  Cen Wu; Shuangge Ma
Journal:  Brief Bioinform       Date:  2014-12-05       Impact factor: 13.994

9.  Blockade of ONECUT2 expression in ovarian cancer inhibited tumor cell proliferation, migration, invasion and angiogenesis.

Authors:  Tongyi Lu; Binhua Wu; Yunfei Yu; Wenhui Zhu; Simin Zhang; Yinmei Zhang; Jiaying Guo; Ning Deng
Journal:  Cancer Sci       Date:  2018-06-13       Impact factor: 6.716

Review 10.  New insights into the mechanisms of epithelial-mesenchymal transition and implications for cancer.

Authors:  Anushka Dongre; Robert A Weinberg
Journal:  Nat Rev Mol Cell Biol       Date:  2019-02       Impact factor: 94.444

View more
  6 in total

1.  Epithelial-Mesenchymal Transition-Based Gene Signature and Distinct Molecular Subtypes for Predicting Clinical Outcomes in Breast Cancer.

Authors:  Lili Hou; Shuang Hou; Lei Yin; Shuai Zhao; Xiaohua Li
Journal:  Int J Gen Med       Date:  2022-03-30

2.  Identification of the Ferroptosis-Associated Gene Signature to Predict the Prognostic Status of Endometrial Carcinoma Patients.

Authors:  Jinlong Qin; Xiaowen Shao; Lei Wu; Hongling Du
Journal:  Comput Math Methods Med       Date:  2021-09-07       Impact factor: 2.809

3.  An EMT-Related Gene Signature for Predicting Response to Adjuvant Chemotherapy in Pancreatic Ductal Adenocarcinoma.

Authors:  Zengyu Feng; Kexian Li; Jianyao Lou; Yulian Wu; Chenghong Peng
Journal:  Front Cell Dev Biol       Date:  2021-04-30

4.  Identification of the EMT-Related Genes Signature for Predicting Occurrence and Progression in Thyroid Cancer.

Authors:  Qiang Li; Sheng Jiang; Tienan Feng; Tengteng Zhu; Biyun Qian
Journal:  Onco Targets Ther       Date:  2021-05-12       Impact factor: 4.147

5.  Establishing a Prognostic Signature Based on Epithelial-Mesenchymal Transition-Related Genes for Endometrial Cancer Patients.

Authors:  Jinhui Liu; Guoliang Cui; Shuning Shen; Feng Gao; Hongjun Zhu; Yinghua Xu
Journal:  Front Immunol       Date:  2022-01-14       Impact factor: 7.561

6.  Endothelial cell-specific molecule 1 (ESM1) promoted by transcription factor SPI1 acts as an oncogene to modulate the malignant phenotype of endometrial cancer.

Authors:  Yu He; Lu Lin; Yurong Ou; Xiaowen Hu; Chi Xu; Caizhi Wang
Journal:  Open Med (Wars)       Date:  2022-08-26
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.