Literature DB >> 36246561

Five EMT-Related Gene Signatures Predict Acute Myeloid Leukemia Patient Outcome.

Jing Qi1, Jiawei Yan1, Muhammad Idrees2, Saeedah Musaed Almutairi3, Rabab Ahmed Rasheed4, Usama Ahmed Hussein5, Mostafa A Abdel-Maksoud3, Ran Wang1, Jun Huang1, Chen Huang1, Nana Wang1, Dongping Huang1, Yuan Hui6, Chen Li7.   

Abstract

Background: The epithelial mesenchymal transition (EMT) gene has been shown to be significantly associated with the prognosis of solid tumors; however, there is a lack of models for the EMT gene to predict the prognosis of AML patients.
Methods: First, we downloaded clinical data and raw transcriptome sequencing data from the TCGA database of acute myeloid leukemia (AML) patients. All currently confirmed EMT-related genes were obtained from the dbEMT 2.0 database, and 30% of the TCGA data were randomly selected as the test set. Univariate Cox regression analysis, random forest, and lasso regression were used to optimize the number of genes for model construction, and multivariate Cox regression was used for model construction. Area under the ROC curve was used to assess the efficacy of the model application, and the internal validation set was used to assess the stability of the model.
Results: A total of 173 AML samples were downloaded, and a total of 1184 EMT-related genes were downloaded. The results of univariate batch Cox regression analysis suggested that 212 genes were associated with patient prognosis, random forest and lasso regression yielded 18 and 8 prognosis-related EMT genes, respectively, and the results of multifactorial COX regression model suggested that 5 genes, CBR1, HS3ST3B1, LIMA1, MIR573, and PTP4A3, were considered as independent risk factors affecting patient prognosis. The model ROC results suggested that the area under the curve was 0.868 and the internal validation results showed that the area under the curve was 0.815.
Conclusion: During this study, we constructed a signature model of five EMT-related genes to predict overall survival in patients with AML; it will provide a useful tool for clinical decision making.
Copyright © 2022 Jing Qi et al.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 36246561      PMCID: PMC9568336          DOI: 10.1155/2022/7826393

Source DB:  PubMed          Journal:  Dis Markers        ISSN: 0278-0240            Impact factor:   3.464


1. Introduction

Acute myeloid leukemia (AML) is the most common type of acute leukemia in adults, characterized by a low remission rate, high relapse rate, high disease-specific mortality, and poor prognosis. The incidence of AML increases with age, and more than 20,000 cases are diagnosed per year in the United States, and over 50% of patients died from this disease [1, 2]. Although advances in immunology, cytogenetics, and molecular biology have laid the groundwork for stratified and precise treatment of AML, up to 50% of patients with normal karyotype have a wide range of clinical outcomes [3]. Thus, it is crucial to develop more risk standards and predictive models for predicting the prognosis and directing treatments of AML. AML is a highly heterogeneous group of diseases with uncontrolled proliferation and differentiation of abnormally clonal myeloid stem cells. The application of next-generation sequencing (NGS) technology and bioinformatic analysis has provided systemically studies of genome and transcriptome data to unravel the mutational spectrum, epigenetic landscape, and RNA interaction network of these clonal leukemia cells [4], which help to construct different models to predict prognosis and discover potential biomarkers of AML [5, 6]. Epithelial to mesenchymal transition (EMT) is a dynamic process with the transition of epithelial cells to mesenchymal cell phenotype, which has played important roles in embryonic development and wound healing, and this process is also thought to be involved in cancer progression and therapy resistance [7, 8]. The overexpression of EMT markers and EMT transcription factors (TFs) has been proved to correlate with tumor aggressiveness and poor prognosis [9, 10]. In addition, recent studies have shown that cancer cells with the EMT process may contribute to immune escape and drug resistance, thereby reducing the effect of immunotherapy and chemotherapy [11-13]. As in hematological malignancies, previous studies already indicated a correlation between some EMT markers and poor prognosis. For example, the upregulation of vimentin, one of the EMT markers, was found associated with poor clinical outcome in AML patients [14], and downregulation of ZEB1 in AML cells can inhibit the invasive ability [15]. Taken together, all these indicate that EMT markers and EMT-TFs involve in the progression of AML, and EMT-related signatures could be used as potential target for predicting prognosis. However, more of its specific biological function still needs to be explored.

2. Materials and Methods

2.1. Data Acquisition and Preprocessing

A total of 173 AML samples were obtained from the The Cancer Genome Atlas (TCGA) database, a landmark cancer genomic program, which contains more than 20,000 primary cancer and matched normal samples spanning 33 cancer types. The corresponding transcriptome sequencing data of the AML dataset were downloaded and normalized to FPKM format. EMT-related genes were obtained from the dbEMT2.0 database, which contains a total of 1184 experimentally confirmed EMT-related genes. Then, we extracted the expression profiles of EMT-related genes from the normalized matrix based on the obtained EMT-related gene names. Finally, the expression profiles were combined with clinical information to generate a new matrix, and 30% of the data were randomly extracted from this matrix and set as the test set. For clinical data, it is necessary that the enrolled patients have a complete follow-up time, those samples with missing survival time and survival status are excluded from the cohort, and overall patient survival is defined as the endpoint event.

2.2. Batch Univariate COX Regression Screening for Prognosis-Associated EMT Genes

Not all EMT-associated genes affect patient survival; therefore, further screening of EMT-associated genes that affect patient prognosis is necessary. We included 1184 EMT-related genes from the EMT database in a univariate COX regression model with p < 0.05 as a filtering condition in order to screen for risk factors that affect the prognosis of AML patients.

2.3. Machine Learning to Screen Prognosis-Associated EMT Genes

Randomized survival forest and lasso regression are machine learning algorithms that are often used for dimensionality reduction analysis. The prognostic genes obtained from the above analyses were included in the random survival forest, which was performed by the R package “random forest”, and the importance threshold of the variables was set to 0.45. Variables above this threshold were included in the lasso regression for further dimensionality reduction.

2.4. Multivariate Cox Regression and Model Construction

We first included the prognostic factors obtained from the lasso regression into the multivariate Cox regression to screen the independent risk factors affecting the prognosis of AML patients and then constructed a multigene prognostic model based on the coefficients of the regression model.

2.5. Model Efficacy Assessment and Internal Validation

We assessed whether there was a difference in the prognosis of patients in the high- and low-risk groups using the log rank test and then assessed the applied efficacy of the model using the area under the ROC curve. In addition, to validate the stability of the model, 30% of the randomly selected data from the original data were used as the test set for this evaluation.

3. Results

3.1. Training Set Prognosis-Related EMT Gene Screening and Model Construction

The results of the univariate batch COX regression analysis suggested that 212 EMT-related genes were associated with prognosis in AML patients. Top 20 prognosis-related genes are presented in Table 1. These 212 genes were included in the random survival forest model, and a total of 18 prognosis genes were selected when the gene importance was set greater than 0.45 (Figures 1(a)–1(c)), and these 18 genes were subsequently included in the lasso regression model for dimensionality reduction analysis, and a total of 8 genes were selected (Figures 2(a) and 2(b)). Further, we included these 5 genes into the multifactorial COX regression model, and a total of 5 genes were selected, and they were considered as independent risk factors affecting the prognosis of patients (Table 2). These 5 genes were CBR1, HS3ST3B1, LIMA1, MIR573, and PTP4A3. Five EMT-associated genes were further modeled for signature based on COX regression coefficients.
Table 1

Top 20 candidate genes of univariate Cox regression analysis results.

Candidate genesUnivariate Cox regression
HR95% CI P value
LowHigh
PTP4A31.0217262231.0143217631.0291847346.96E-09
CBR11.038209741.0250678271.0515201397.96E-09
ROR18.1791476583.37247869619.836583843.33E-06
ETS21.0053077941.002954531.007666589.55E-06
HIP11.0140755231.0076661621.0205256531.56E-05
PLA2G4A1.0212765971.0115314331.0311156461.68E-05
SRC1.0411052981.0218835721.0606885872.27E-05
KRT72.3056245081.54946943.430790163.80E-05
HOXB71.0177361461.0086655581.0268883020.000118629
PEBP49.2389643162.97620797428.680274490.000119556
UCP21.0012281921.000590241.0018665510.000160359
CDK51.0252600061.0119904181.038703590.000174577
CCL221.5211623351.2196396321.8972283180.000198028
RNF81.1478679461.0668002931.2350960440.000223934
LIMA11.0737899951.0334939771.1156571590.00026414
SPRR2A18025860.72044.9528081.58894E+110.000312518
BMP21.3636577521.1504709441.6163489170.000348845
LYPD31.5935863591.2319653572.0613546240.000387329
STIM21.048188121.0212920871.0757924680.000387406
BAG31.0304825291.0133561911.0478983120.000445426
Figure 1

Random survival forest select candidate EMT-related prognosis genes. The error estimate probability (a), the bar plot of genes (b), and candidate important genes (importance >0.45) (c).

Figure 2

Lasso regression model select candidate EMT-related prognosis genes. Lambda takes the minimum value; a total of eight candidate genes are selected (a), and (b) demonstrates the prognostic value of these eight genes.

Table 2

Multivariate Cox regression analysis of candidate genes.

Candidate genesMultivariate Cox regression
CoefHR95% CI P value
LowHigh
CBR10.02861.02901.01471.04366.62E-05
HS3ST3B1−0.04580.95520.91310.99930.0466
LIMA10.04151.04231.00781.07810.0160
MIR573−0.01340.98670.97161.00200.0888
PTP4A30.01451.01461.00641.02280.0004

3.2. Performance of EMT-Associated Signature

We first calculated the risk score for each patient based on this model. To evaluate the performance of the signature model, patients were divided into high and low groups according to the median value of risk score expression, and the results suggested that the disease-specific survival rate of high-risk patients was significantly lower than that of low-risk patients, and the comparison between groups was statistically different (p < 0.001) (Figures 3(a)–3(c)), and the ROC results suggested that the predictive efficacy of the model was likewise. The area under the curve was 0.868 (Figure 3(d)). In addition, to verify the stability of the model, 30% of the total sample was selected for the internal validation of the test set. The results suggested that the same between-group survival differences existed in the test set (Figures 4(a)–4(c)). In addition, the results suggest that the model has strong stability with an area under the ROC curve of 0.815 (Figure 4(d)). This result suggests that the model has a strong stability.
Figure 3

Construct model in training data set, based on the Cox regulation model, a five EMT-related gene signature was constructed: the risk score and the survival status distribution (a) and the heat map of five genes in high- and low-risk group (b). The survival curve show high-risk score patients with a worse outcome, compared with low-risk score patients (c). The area under the receiver operating characteristic of model was 0.868 (d).

Figure 4

Validate model in test data set, a five EMT-related gene signature was validated: the risk score and the survival status distribution (a) and the heat map of five genes in high- and low-risk group (b). The survival curve show high-risk score patients with a worse outcome, compared with low-risk score patients (c). The area under the receiver operating characteristic of model was 0.815 (d).

4. Discussion

AML is a deadly and highly heterogeneous disease due to extensive genomic changes and molecular mutations, which have been incorporated in the updated 2017 European LeukemiaNet (ELN) risk stratification guidelines [16]. Breakthroughs in NGS technology have not only explored the molecular mechanisms of this disease but also bring the AML into the era of small molecule inhibitor therapy. More studies are devoted to exploring new prognostic models based on the genetic and molecular profiling to uncover more potential therapeutic targets [4-6]. In the present study, we constructed a predictive model based on the EMT-related signature to provide a visual predictive tool for AML, which might lay the foundation for exploring the role of EMT in hematological malignancies. Epithelial cells provide intercellular adhesion by cell-cell cohesion and are essential for maintaining the integrity and barrier function of multicellular structures. However, epithelial cells transform into mesenchymal cells to acquire more complex structures and functions of organs during embryonic development and wound healing, which is termed EMT [17, 18]. The quiescent epithelial cells in adults reactivated and primed for the EMT under various internal and external changes, which facilitate tumor cells to invade the extracellular matrix and evade the immune elimination [19]. The downregulation of the cell adhesion protein E-cadherin and cytoskeletal rearrangements, including downregulation of keratin and upregulation of vimentin, are the main features of EMT, which cause ultimately tumor progression and metastasis. Several EMT-TFs have been well identified to coordinate the process, such as SNAIL/SNAI1, SLUG/SNAI2, and TFs of the TWIST and ZEB families [20]. Given that EMT is associated with tumor invasiveness and metastasis, as well as its molecular properties, some EMT-related signatures have been developed to predict the prognosis of cancers and the response to immunotherapy. A recent study reported an EMT-related gene signature for the prognosis of human bladder cancer [21], and Chae et al. [22] analyzed the immune landscape of NSCLC (nonsmall cell lung cancer) patients based on EMT scores to predict the response of patients to immunotherapy. Although some previous studies have shown the role of EMT makers and EMT-TFs in AML, no EMT signature has been applied to predict the prognosis of AML [14, 15]. As shown in our study, five EMT-related genes (CBR1, HS3ST3B1, LIMA1, MIR573, PTP4A3) were selected by random forest algorithm as the prognostic in TCGA-LAML cohort as a training set. Then, AML patients were divided into high-risk and low-risk groups based on the EMT-related signature risk score. The results demonstrated that patients in the low-risk group have longer OS than in the high-risk group, which were also validated in internal datasets. Carbonyl reductase 1 (CBR1) belongs to the short dehydrogenase (SDR) family, which could promote AML cell resistance to daunorubicin and be a risk gene in AML patients [23]. However, it is still unclear whether CBR1 can lead to progression and drug resistance through EMT in AML. A previous study has shown that heparan sulfate D-glucosamine 3-O-sulfotransferase 3B1 (HS3ST3B1) participates in the biosynthetic steps of heparan sulfate (HS) and positively contributed to acute AML progression by induction of VEGF expression, which also involves in the regulation TGF-beta-mediated EMT in NSCLC [24, 25]. LIMA1 (LIM domain and actin binding 1), also known as epithelial protein lost in neoplasm (EPLIN), has been known to play differential roles in the progression and metastasis of certain cancers [26, 27]. Downregulation or phosphorylation of EPLIN can alter the expression of some EMT elements such as E-cadherin and ZEB1 via Wnt-catenin signaling pathway, thus promotes the EMT process. While the exact mechanism of LIMA1 in AML remains unknown [27]. The role of MIR573 in EMT of tumors is still controversial. Wang et al. [28]. revealed that MIR573 can inhibit TGFβ1-induced EMT in prostate cancer, while another study indicated MIR573 associated with the EMT in cervical cancer cell growth and metastasis [29]. As so far, the expression of MIR573 has been confirmed in AML cell line (HL-60) and thought as a regulator in responsiveness to inorganic substances [30]. Protein tyrosine phosphatase of regenerating liver 3 (PRL-3), encoded by PTP4A3 gene, has been proved to promote EMT through PI3K/AKT pathway and Src-ERK1/2 pathways in a variety of tumors [31, 32], which is also a hazard factor with poor survival in AML [33]. All these hint the prognostic role of EMT-related gene signature in AML. Furthermore, given that the general condition of the patients is also included in the risk stratification of the disease in addition to the genomic profile [16], a predictive model was constructed based on the EMT-related genes, which demonstrated powerful predictivity.

5. Conclusion

During this study, we constructed a signature model of five EMT-related genes to predict overall survival in patients with AML; it will provide a useful tool for clinical decision making. However, our study still has some limitations. First, more datasets need to be included for better validation. Second, further function experiments regarding of the core genes are required to clarify the role of EMT-related genes in AML.
  33 in total

Review 1.  EMT, cancer stem cells and drug resistance: an emerging axis of evil in the war on cancer.

Authors:  A Singh; J Settleman
Journal:  Oncogene       Date:  2010-06-07       Impact factor: 9.867

Review 2.  Acute Myeloid Leukemia.

Authors:  Hartmut Döhner; Daniel J Weisdorf; Clara D Bloomfield
Journal:  N Engl J Med       Date:  2015-09-17       Impact factor: 91.245

3.  Long non-coding RNA TTN-AS1 promotes cell growth and metastasis in cervical cancer via miR-573/E2F3.

Authors:  Ping Chen; Rui Wang; Qingfen Yue; Mengmeng Hao
Journal:  Biochem Biophys Res Commun       Date:  2018-08-19       Impact factor: 3.575

Review 4.  Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel.

Authors:  Hartmut Döhner; Elihu Estey; David Grimwade; Sergio Amadori; Frederick R Appelbaum; Thomas Büchner; Hervé Dombret; Benjamin L Ebert; Pierre Fenaux; Richard A Larson; Ross L Levine; Francesco Lo-Coco; Tomoki Naoe; Dietger Niederwieser; Gert J Ossenkoppele; Miguel Sanz; Jorge Sierra; Martin S Tallman; Hwei-Fang Tien; Andrew H Wei; Bob Löwenberg; Clara D Bloomfield
Journal:  Blood       Date:  2016-11-28       Impact factor: 22.113

Review 5.  Emerging evidence of epithelial-to-mesenchymal transition in lung carcinogenesis.

Authors:  Mitsuo Sato; David S Shames; Yoshinori Hasegawa
Journal:  Respirology       Date:  2012-10       Impact factor: 6.424

6.  EPLIN downregulation promotes epithelial-mesenchymal transition in prostate cancer cells and correlates with clinical lymph node metastasis.

Authors:  S Zhang; X Wang; A O Osunkoya; S Iqbal; Y Wang; Z Chen; S Müller; Z Chen; S Josson; I M Coleman; P S Nelson; Y A Wang; R Wang; D M Shin; F F Marshall; O Kucuk; L W K Chung; H E Zhau; D Wu
Journal:  Oncogene       Date:  2011-05-30       Impact factor: 9.867

7.  Retroviral integration mutagenesis in mice and comparative analysis in human AML identify reduced PTP4A3 expression as a prognostic indicator.

Authors:  Renée Beekman; Marijke Valkhof; Stefan J Erkeland; Erdogan Taskesen; Veronika Rockova; Justine K Peeters; Peter J M Valk; Bob Löwenberg; Ivo P Touw
Journal:  PLoS One       Date:  2011-10-20       Impact factor: 3.240

8.  Characterization of miRNomes in acute and chronic myeloid leukemia cell lines.

Authors:  Qian Xiong; Yadong Yang; Hai Wang; Jie Li; Shaobin Wang; Yanming Li; Yaran Yang; Kan Cai; Xiuyan Ruan; Jiangwei Yan; Songnian Hu; Xiangdong Fang
Journal:  Genomics Proteomics Bioinformatics       Date:  2014-04-19       Impact factor: 7.691

9.  MiR-573 inhibits prostate cancer metastasis by regulating epithelial-mesenchymal transition.

Authors:  Lin Wang; Guanhua Song; Weiwei Tan; Mei Qi; Lili Zhang; Jonathan Chan; Jindan Yu; Jinxiang Han; Bo Han
Journal:  Oncotarget       Date:  2015-11-03

Review 10.  'Acute myeloid leukemia: a comprehensive review and 2016 update'.

Authors:  I De Kouchkovsky; M Abdul-Hay
Journal:  Blood Cancer J       Date:  2016-07-01       Impact factor: 11.037

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.