Literature DB >> 35836470

Epithelial to Mesenchymal Transition Relevant Subtypes with Distinct Prognosis and Responses to Chemo- or Immunotherapies in Osteosarcoma.

Yang Zhou1, Gai Li1, Hu Li1, Fuchong Lai1, Pingguo Duan1, Ming Cheng1.   

Abstract

Objective: Currently, clinical classification of osteosarcoma cannot accurately predict the survival outcomes and responses to chemo- or immunotherapies. Our goal was to classify osteosarcoma patients into clinical/biological subtypes based on EMT molecules.
Methods: This study retrospectively curated the RNA expression profiling of osteosarcoma patients from the TARGET and GSE21257 cohorts. Consensus clustering analyses were conducted in accordance with the expression profiling of prognostic EMT genes derived from univariate analyses. Immunological features were evaluated through immune cell infiltration, immune checkpoint expression, and activity of cancer immunity cycle. Drug sensitivity was estimated with the GDSC database. WGCNA approach was adopted to determine the EMT-derived genes. Following univariate analyses, a multivariate cox regression model was developed and externally verified. Predictive independency was evaluated with uni- and multivariate analyses. GSEA was presented to uncover relevant molecular mechanisms.
Results: Prognostic EMT genes across osteosarcoma patients were stratified into distinct subtypes, namely, subtypes A and B. Patients in subtype B presented desirable prognosis, high immune activation, and enhanced sensitivity to cisplatin. Meanwhile, patients in subtype A were more sensitive to gemcitabine. In total, 86 EMT-derived hub genes were determined, and an EMT score was conducted for osteosarcoma prognosis. Following external verification, this EMT score was reliably and independently predictive of patients' survival outcomes. Additionally, it was positively linked to steroid biosynthesis.
Conclusion: Overall, our findings proposed EMT-relevant molecular subtypes and signatures for predicting prognosis and therapeutic responses, contributing to personalized treatment and clinical implication for osteosarcoma.
Copyright © 2022 Yang Zhou et al.

Entities:  

Mesh:

Substances:

Year:  2022        PMID: 35836470      PMCID: PMC9274235          DOI: 10.1155/2022/1377565

Source DB:  PubMed          Journal:  J Immunol Res        ISSN: 2314-7156            Impact factor:   4.493


1. Introduction

Osteosarcoma represents the most prevalent primary bone sarcomas, which originates from mesenchymal stem cell populations, with an annual incidence of 3-5 cases per million individuals [1-3]. It most often affects children, adolescents, and young adults, with aberrant bone growth areas [4]. The difficulty in biology research is in relation to the complexity of the osteosarcoma genome as well as remarkable biological discrepancy among osteosarcoma subtypes [5]. Hence, an in-depth understanding of osteosarcoma biology highlights the heterogeneity as well as uncovers the molecular abnormality that defines patient subpopulations. The standard treatment of osteosarcoma comprises surgical resection and chemotherapy [6]. Patients are usually resistant to conventional chemotherapy; meanwhile, high-dose chemotherapeutic agents cause severe side effects [7]. Hence, to improve the survival duration of osteosarcoma patients has been proven to be challenging. Epithelial-mesenchymal transition (EMT) is a key process by which epithelial cells acquire mesenchymal characteristics [8]. Growing evidences demonstrate that EMT exerts a crucial role in stemness [9], metabolic reprogramming [10], immune evasion [11], and therapeutic resistance [12] for cancer cells. Osteosarcoma cells that have experienced EMT process will lose the cellular polarity and acquire aggressive and metastatic features. This process has been regarded as a crucial event for osteosarcoma metastases [13]. Previously, Yiqi et al. proposed an EMT-relevant model of osteosarcoma as a prognostic indicator through integrated multicohorts [14], indicative of the crucial prognostic implication of EMT during osteosarcoma progression. Limited evidences demonstrate that chemotherapy (cisplatin, etc.) resistance contributes to EMT activation in osteosarcoma cells [15, 16]. Recent research has presented that EMT genes are remarkably linked to immunity in osteosarcoma [17]. Tumor-associated macrophages trigger EMT in osteosarcoma through activation of COX-2/STAT3 signaling [18]. Thus, extensive research of EMT on clinical outcomes and therapeutic responses of osteosarcoma is urgently required. In our research, integrated genomic analyses were conducted for characterizing distinct EMT-relevant subtypes with prognostic and prediction implications in osteosarcoma.

2. Materials and Methods

2.1. Acquisition of Gene Expression Profiling

This study acquired the TARGET data matrix (https://ocg.cancer.gov/programs/target/data-matrix) comprising gene expression data and clinical data of 84 osteosarcoma patients, as the discovery cohort. Genome-wide gene expression profiling and prognostic information of 53 osteosarcoma patients were curated from the GSE21257 cohort in the Gene Expression Omnibus (GEO) repository (https://www.ncbi.nlm.nih.gov/gds/) [19], as the testing cohort. This cohort was on the basis of the platform of GPL10295 Illumina human-6 v2.0 expression beadchip. The gene expression series matrix files were directly retrieved, and the probe IDs were mapped to the gene symbols in accordance with the matched annotation files. Thereafter, expression values of multiple probes mapping to the same gene were averaged while probes mapping to multiple genes were eliminated. The gene set of EMT was curated from the Molecular Signatures Database (MSigDB), which was listed in Supplementary Table 1 [20].

2.2. Identification of Molecular Subtypes

Univariate analyses were presented for assessment of the association of EMT genes with osteosarcoma prognosis. Genes with P < 0.05 were determined for the subsequent analyses. K-means-based consensus clustering was carried out utilizing the ConsensusClusterPlus package on the basis of the transcriptome files of prognostic EMT genes [21]. Cumulative distribution function (CDF), delta area, and consensus matrix diagrams were conducted in accordance with default parameters. The number of clusters was determined in accordance with the following criteria: (i) the consistency within the clusters was relatively high; (ii) the coefficients of variation were relatively low; (iii) the area under the CDF curve was not remarkably elevated. The area under the CDF curves was utilized for defining the clustering number. Principal component analyses (PCA) were conducted for verifying the accuracy of this classification.

2.3. Gene Set Variation Analyses (GSVA)

Fifty hallmark pathways were acquired from the MSigDB, and the activity of above pathways was estimated with GSVA package in osteosarcoma specimens [22]. Limma package was adopted to compare the discrepancy in activity of hallmark pathways between subtypes [23].

2.4. Analyses of Immunological Features

The infiltration of tumor-infiltrating immune cells was estimated across osteosarcoma tissues utilizing the single-sample gene set enrichment analyses (ssGSEA) [24] and TIMER2 database [25]. The RNA expression of immune checkpoint molecules and HLA molecules was determined across osteosarcoma specimens. Cancer immunity cycle comprises release of cancer cell antigens (step 1), cancer antigen presentation (step 2), priming and activation (step 3), trafficking of immune cells to tumors (step 4), infiltration of immune cells into tumors (step 5), recognition of cancer cells by T cells (step 6), and killing of cancer cells (step 7) [26]. The activity of all steps was determined with ssGSEA approach in accordance with the transcriptome profiling [27]. The gene set of each step within the cancer immunity cycle was listed in Supplementary Table 2.

2.5. Estimation of Drug Sensitivity

Through adopting the pRRophetic algorithm [28], the half-maximal inhibitory concentration (IC50) values of cisplatin and gemcitabine were determined in osteosarcoma in accordance with the Genomics of Drug Sensitivity in Cancer (GDSC; http://www.cancerrxgene.org/) [29].

2.6. Weighted Gene Coexpression Network Analyses (WGNCA)

The data matrix of mRNA expression profiling in the TARGET cohort was analyzed utilizing the WGCNA package [30]. The first 25% genes with the largest variance were determined. For selecting a standard scale-free network, the sample hierarchical clustering approach was utilized for detecting and removing outlier specimens, followed by selection of the appropriate soft thresholding analyses. Thereafter, the adjacency matrix and topological overlap matrix (TOM) were established as well as the matched dissimilarity (1-TOM) was determined. Dynamic tree cutting methods were utilized for completing the gene tree and module clustering. Thereafter, the module characteristic genes were clustered as well as the highly similar modules were merged. The dissimilarity of the module eigengenes (ME) was determined with the moduleEigengenes function. The interactions of ME with EMT subtypes were analyzed with Pearson's correlation. Module membership (MM) represents the relevance of the expression profile to ME, while gene significance (GS) represents the correlation between the expression profiles and clinical traits. In this study, MM was calculated with Pearson's correlation of the expression profiling and ME. Meanwhile, GS was measured for evaluating the genes with EMT subtypes. Thereafter, the correlation between MM and gene signature was evaluated, and EMT-derived genes were determined.

2.7. Analysis of Hub Genes

Protein–protein interactions (PPIs) of genes in the lightcyan module were assessed through the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) online tool (http://string-db.org/) [31]. Hub genes in this PPI network were determined utilizing the Molecular Complex Detection (MCODE), a plug-in of Cytoscape software [32].

2.8. Functional Enrichment Analyses

Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were presented utilizing the clusterProfiler package [33]. The parameter utilized in this package was default as well as the threshold for identifying the GO functions and KEGG pathways was set as P value < 0.05.

2.9. Gene Set Enrichment Analyses (GSEA)

For uncovering the difference in functional phenotypes between high- and low-risk subpopulations, GSEA was carried out. The “c5.all.v7.0.symbols.gmt” was utilized as a reference gene set. GSEA was implemented utilizing GSEA software. A nominal P < 0.05 was regarded as a significant enrichment.

2.10. Statistical Analyses

Statistical analyses were implemented utilizing R software (version 4.0.2). Kaplan-Meier survival curves were drawn among osteosarcoma cases, and survival difference was determined with log-rank test. Receiver operator characteristic (ROC) curves were presented with timeROC package. Comparison between two subgroups was conducted with student's t or Wilcoxon test. Two-sided P value < 0.05 was indicative of statistical significance.

3. Results

3.1. Characterization of Two EMT Molecular Subtypes in Osteosarcoma

Figure 1 illustrates the flowchart of this study. We curated 200 EMT genes from the MSigDB and analyzed their prognostic implication in osteosarcoma through univariate cox regression analyses. As a result, 40 EMT genes were remarkably linked to osteosarcoma prognosis (Figure 2(a); Table 1), in which 28 served as protective factors of osteosarcoma prognosis while the others served as risk factors. The consensus clustering of prognostic EMT genes was conducted to determine distinct EMT subtypes for osteosarcoma. For two categories, the area under the CDF curves began to stabilize (Figures 2(b) and 2(c)). Meanwhile, the consensus matrix was conducted for identifying the optimal number of subtypes. In Figure 2(d), the consensus matrix displayed a well-defined block structure when k = 2. Overall, two EMT molecular subtypes were conducted, namely, subtypes A and B. PCA results also fit into two clusters (Figure 2(e)). Survival analyses showed that subtype A presented poorer prognosis in comparison to subtype B (Figure 2(f)), indicative of the discrepancy in survival outcomes between subtypes.
Figure 1

The flowchart of this study.

Figure 2

Characterization of two EMT molecular subtypes across osteosarcoma patients in the TARGET cohort. (a) Forest plots depict the univariate cox regression analyses results of EMT genes that were significantly linked to osteosarcoma prognosis with P < 0.05. Red indicates risk factor while blue represents protective factor. (b) CDF curves when k = 2 to 9. (c) The relative variations of the area under the CDF curves that k from 2 to 9. (d) The consensus clustering matrix at k = 2. (e) PCA plots visualizing the discrepancy in two clusters in accordance with the expression profiling of prognostic EMT genes. (f) Kaplan-Meier survival analyses for patients in two clusters.

Table 1

Prognostic EMT genes of osteosarcoma in the TARGET cohort.

EMT genesHRHR.95LHR.95H P value
ACTA20.600070.4783060.7528311.02E-05
CADM11.5633521.032822.3664040.034631
CALD10.6878340.5038440.9390130.018464
CD590.5782010.3404290.9820430.042661
CDH110.6463050.4264960.9793990.039579
COL5A21.7960281.214622.6557420.003344
COL8A20.8300520.6952620.9909720.039373
CXCL120.7012030.5278380.9315080.014302
DAB20.5528340.3568920.8563510.007943
DCN0.6879720.5328250.8882940.004125
DKK11.2174181.0173581.4568190.031726
EDIL30.5332630.3876030.7336610.000112
FAP0.7518640.6010250.9405570.012547
FAS0.6982160.498360.9782180.0368
FBLN10.8250410.6884570.9887210.037269
FUCA10.5383190.3517180.823920.004346
GADD45B1.5853841.0890542.3079140.016163
GLIPR10.5961840.3996290.8894130.011271
GPX72.2373571.2297064.07070.008362
ID21.6760291.0164652.7635730.042973
ITGB50.6562430.4655170.9251110.016205
LGALS10.6283150.4374040.9025520.01191
LOX1.7786161.2122372.6096180.003241
LOXL10.7381460.5958540.9144180.005456
MAGEE10.5051290.303880.8396590.008439
MYL90.6546790.4689670.9139330.012821
NID20.7655640.5957520.9837790.036818
PCOLCE21.3512951.0500021.7390420.019333
PMEPA10.6185420.4397160.8700920.005794
POSTN0.8067680.6519840.9982980.048195
SERPINE21.6475941.2203422.2244290.001114
SERPINH12.0617951.1992873.5446060.008863
TAGLN0.6419220.4640910.8878950.007398
TNFRSF11B1.4706171.1425421.8928980.002748
TNFRSF12A0.6625690.4700710.9338970.018749
TPM10.5051790.3246430.7861110.002473
TPM40.5972360.3742920.9529740.030618
VCAM10.713910.552340.9227420.010049
VEGFA1.4406341.1234441.8473790.00401
WNT5A0.649680.4529960.931760.019071

3.2. Immunological Features and Drug Sensitivity of Two EMT Molecular Subtypes

The activity of the 50 hallmark pathways was estimated across osteosarcoma tissues utilizing the GSVA algorithm. We noted that the remarkable activation of stromal pathways (EMT, Wnt β-catenin signaling, etc.), immune pathways (IL6 JAK STAT3 signaling, etc.), and metabolism pathways (bile acid metabolism, heme metabolism, xenobiotic metabolism, etc.) in EMT subtype B than subtype A (Figure 3(a) and Supplementary Table 3). In Figure 3(b), ssGSEA approach showed that EMT subtype B presented remarkably enhanced immune cell infiltrations containing central memory CD4 and CD8 T cells, effector memory CD4 T cells, memory B cells, regulatory T cells, type 1 and 2 helper cells, CD55bright natural killer cells, macrophages, MDSCs, natural killer cells, and natural killer T cells than subtype A. Meanwhile, TIMER2 approach showed the lower infiltrations of B cells and the higher infiltration of CD4 T cells compared with subtype A (Supplementary Figure 1). Additionally, we noted that immune checkpoint molecules ADORA2A, CD44, PDCD1LG2, TNFRSF14, and TNFRSF8 were prominently activated in subtype B than A (Figure 3(c)). Nevertheless, we did not investigate the significant discrepancy in HLA molecule expression between subtypes (Figure 3(d)). The activity of cancer immunity cycle was also estimated in each osteosarcoma specimen. Compared with subtype A, release of cancer cell antigen, B cell/monocyte/Th17 cell recruiting, recognition of tumor cells through T cells, and killer of tumor cells presented remarkably enhanced activity in subtype B (Figure 3(e)). Overall, EMT subtype B possessed the features of immune activation. Also, subtype B displayed increased sensitivity to cisplatin while subtype A was more sensitive to gemcitabine (Figure 3(f)).
Figure 3

Immunological features and drug sensitivity of two EMT molecular subtypes. (a) Heat map depicts the activity of the 50 hallmark pathways in EMT subtypes A and B. (b) Boxplot shows the infiltrations of tumor-infiltrating immune populations in two subtypes. (c) Boxplot depicts the RNA expression of known immune checkpoint molecules in two subtypes. (d) Comparison of the RNA expression of HLA molecules in two subtypes. (e) Comparing the activities of all steps within cancer immunity cycle between subtypes. (f) Boxplot displays the sensitivity to cisplatin and gemcitabine in two subtypes. ∗P value < 0.05; ∗∗P value < 0.01; and ∗∗∗P value < 0.001.

3.3. Establishment of EMT Subtype-Relevant Coexpression Modules

Considering the sensitivity of WGCNA to batch effects, this study firstly preprocessed the data of osteosarcoma individuals in EMT molecular subtypes A and B from the TARGET cohort. The first 25% genes with the largest variance were determined for subsequent analyses. Thereafter, the hclust function was adopted for confirming the batch effect removal as well as investigating whether there was any outlier. In Figure 4(a), no outlier sample was found. Because of the premise of WGCNA approach required to hypothesize that the network met the scale-free criteria, this study further screened the optimal soft thresholding value for making a coexpression network was more subject to a scale-free network. Through calculation of the scale-free topology fitting index, R2 value was up to 0.85 (Figure 4(b)), which validated the feasibility of WGCNA. A hierarchical clustering tree was retrieved through adopting hierarchical clustering analyses. Thereafter, in accordance with the dynamic tree cut approach, 29 distinct coexpression modules were assigned (Figure 4(c)). Figure 4(d) demonstrated that the lightcyan module presented the strongest correlation to EMT molecular subtypes. The genes in this module deserved more exploration.
Figure 4

Establishment of EMT subtype-relevant coexpression modules in the TARGET cohort. (a) Hierarchical clustering dendrogram of osteosarcoma individuals as well as the matched EMT molecular subtypes. (b) Analyses of network topology and average network connectivity under diverse soft-threshold powers (β). The red line is indicative of a correlation coefficient of 0.85. (c) Clustering dendrogram with dissimilarity in accordance with topological overlapping, along with designed module colors. (d) The interaction network between modules and EMT molecular subtypes. Each column is indicative of EMT molecular subtype while each row is indicative of a coexpression module. The number in the rectangle displays the correlation coefficient and P value.

3.4. Exploration of EMT-Derived Genes and Their Relevant Biological Implication

Our further analyses uncovered that the genes in the lightcyan module were remarkably linked to two EMT molecular subtypes (Figures 5(a) and 5(b)), indicative of the genes in the lightcyan module as EMT-derived genes. Through MCODE analyses, we determined 86 EMT-derived hub genes, as depicted in Figure 5(c). Their relevant biological implication was explored in-depth. In Figure 5(d), the EMT-derived hub genes were remarkably linked to endoplasmic reticulum. Additionally, they were significantly enriched in ribosome, oxidative phosphorylation, chemical carcinogenesis-reactive oxygen species, thermogenesis, and HIF-1 signaling pathway (Figure 5(e)).
Figure 5

Exploration of EMT-derived genes and their relevant biological implication. (a) Scatter plots depict the interactions of module membership and gene significance for EMT subtype A. (b) Scatter plots display the interactions of module membership and gene significance for EMT subtype B. (c) The PPI network uncovers the EMT-derived hub genes through MCODE analyses. (d, e) GO and KEGG pathway enrichment analyses of EMT-derived hub genes.

3.5. Development of an EMT-Derived Prognostic Model for Osteosarcoma

Our univariate cox regression analyses were indicative that 33 EMT-derived genes displayed significant interactions with osteosarcoma prognosis (Table 2). Above genes were input the multivariate cox regression model. In accordance with the expression and coefficient of candidate genes (Table 3), we developed an EMT-derived prognostic model for osteosarcoma, comprising RPS9, RPS23, EIF4A1, RPL12, RPL36, RPL37A, RPL34, EEF1B2, RPS8, RPS28, RPL10, RPS24, RPL35A, RPL11, RPL21, RPS27A, RPS12, and RPL13A. We noted the remarkable discrepancy in their expressions in high- than low-risk subpopulations (Figure 6(a)). Figure 6(b) displayed the EMT score distribution across osteosarcoma patients. As EMT score elevated, dead cases were gradually increased (Figure 6(c)). Survival analyses demonstrated that high-risk cases were indicative of more unfavorable clinical outcomes in comparison to low-risk cases (Figure 6(d)). ROC curves were presented to assess the predictive performance of EMT score in osteosarcoma prognosis. In Figure 6(e), area under the curves (AUCs) at one-, three-, and five-year survival were separately 0.723, 0.858, and 0.860, proving that EMT score was capable of prediction of osteosarcoma prognosis. Compared with previously published two EMT models from Yiqi et al. [14] and Peng et al. [17], our EMT-derived prognostic model had higher C-index (0.828), indicating the prediction superiority of this model (Figure 6(f)).
Table 2

Prognostic EMT-derived genes in the TARGET cohort.

GenesHazard ratio95% lower confidence interval95% upper confidence interval P value
RPS92.4585411.153835.238570.01977
NACA2.0628091.0721553.9688120.030111
RPL311.6802771.00252.8162920.048903
RPS231.5821191.0165512.4623480.042083
EIF4A11.9379611.081813.4716740.026127
RPL121.8255211.0147963.2839390.044538
RPL361.8027131.146672.8340990.010684
RPL37A1.9253171.2444032.9788140.003262
RPL301.7284991.0133952.9482150.044557
EEF1D2.4048851.2893224.485670.005799
RPL341.5919331.0083512.5132620.045971
EEF1B21.8697241.1098063.1499810.018701
RPS192.0107871.0323953.9163940.040005
RPS82.6686521.4691294.847570.001268
RPS281.7820711.1612612.7347660.008189
RPL101.8545121.0610423.2413540.030163
RPL171.6413531.0153592.6532890.043159
RPS241.7249821.0735012.7718320.024256
RPL35A1.9945411.1179853.5583610.01941
RPS291.7495571.0445432.9304180.033539
RPS272.2318721.3116563.7976810.003074
RPS62.0213361.3496113.027390.000639
RPL111.7084681.0021662.9125510.049078
RPL10A2.0632781.0765173.9545290.029103
RPL212.1998991.2239213.954140.008404
RPS32.0924331.2474173.5098730.005147
RPS27A1.9137861.1135513.2890980.018812
RPS71.7524511.1184472.7458470.014344
RPS121.8091681.0449983.1321470.034248
PABPC11.6181951.0053082.6047280.047506
RPL32.2182721.1199944.3935310.022315
RPL71.7770431.0661452.9619650.027407
RPL13A2.19361.1391544.2240850.01879
Table 3

Multivariate cox regression models of prognostic EMT-derived genes in the TARGET cohort.

GenesCoefficientHazard ratio95% lower confidence interval95% upper confidence interval P value
RPS94.02323155.881364.964366629.02810.001125
RPS23-2.227290.1078210.0291770.3984440.000838
EIF4A11.2763263.583451.4124459.0914070.007211
RPL12-1.65770.1905770.0470670.7716660.020167
RPL361.6289095.0983091.26764820.504710.021793
RPL37A2.96351119.365843.250895115.36380.001135
RPL34-1.150580.3164530.0889791.1254630.075506
EEF1B2-1.617650.1983650.0522390.7532470.017492
RPS81.8724096.5039451.04550440.46020.044678
RPS28-0.965740.3807010.1253721.1560250.088358
RPL101.5566044.7426881.64107213.706340.004043
RPS241.051292.8613390.80512910.168880.104176
RPL35A1.4845254.412871.23741215.737210.022118
RPL11-2.611060.0734570.0077380.697350.022973
RPL211.8966436.6634891.46090430.393580.014305
RPS27A1.2786133.5916551.04799512.30920.041895
RPS12-0.859010.4235810.1481221.2113050.109073
RPL13A-3.57750.0279450.0026520.2944320.002905
Figure 6

Development of EMT score for osteosarcoma prognosis in the TARGET cohort. (a) Heat map depicts the expression of each gene from EMT score in each osteosarcoma specimen. Red is indicative of upregulation; and blue is indicative of downregulation. (b) The distribution of risk score among osteosarcoma individuals. Vertical dotted line is indicative of the median value of EMT score. (c) The distribution of survival status of high- and low-risk subgroups. (d) Kaplan-Meier survival analyses for two subgroups. (e) ROC curves at one-, three-, and five-year survival in accordance with EMT score. (f) C-index of our EMT model and previously published two EMT models from Zhang et al. and Peng et al.

3.6. External Verification of the Prognostic Value of EMT Score

The prognostic value of EMT score was further verified in the GSE21257 cohort. Figure 7(a) depicted the expression patterns of each gene from EMT score across osteosarcoma specimens. We also visualized the distribution of EMT score in the GSE21257 cohort (Figure 7(b)). In accordance with the median value of EMT score, we stratified osteosarcoma individuals into high- as well as low-risk subpopulations. Additionally, we noted that there were relatively increased dead cases in high-risk subgroup (Figure 7(c)). As expected, high EMT score was remarkably linked to more unfavorable survival outcomes (Figure 7(d)). The AUC at five-year survival was 0.818, proving the excellent predictive performance of EMT score (Figure 7(e)).
Figure 7

External verification of the prognostic value of EMT score in the GSE21257 cohort. (a) Heat map depicts the expression of each gene from EMT score in each osteosarcoma specimen. Red is indicative of upregulation while blue is indicative of downregulation. (b) The distribution of risk score among osteosarcoma individuals. Vertical dotted line is indicative of the mean value of EMT score. (c) The distribution of survival status of high- and low-risk subpopulations. (d) Kaplan-Meier survival analyses for two subpopulations. (e) ROC curves at five-year survival in accordance with EMT score.

3.7. Genes in EMT Score as Prognostic Indicators of Osteosarcoma

Further analyses were conducted for assessing the survival implication of each gene in EMT score in the TARGET cohort. As a result, high expression of EEF1B2, EIF4A1, RPL10, RPL11, RPL12, RPL13A, RPL21, RPL34, RPL35A, RPL36, RPL37A, RPS8, RPS9, RPS12, RPS23, RPS24, RPS27A, and RPS28 was prominently linked to more undesirable survival outcomes in comparison to their low expression (Figures 8(a)–8(r)).
Figure 8

Association of genes in EMT score with osteosarcoma prognosis in the TARGET cohort. (a)–(r) Kaplan-Meier survival analyses for high and low expression subgroups of (a) EEF1B2, (b) EIF4A1, (c) RPL10, (d) RPL11, (e) RPL12, (f) RPL13A, (g) RPL21, (h) RPL34, (i) RPL35A, (j) RPL36, (k) RPL37A, (l) RPS8, (m) RPS9, (n) RPS12, (o) RPS23, (p) RPS24, (q) RPS27A, and (r) RPS28 in the TARGET cohort.

3.8. EMT Score Is Independent of Clinicopathological Indicators for Osteosarcoma Prognosis

In the TARGET cohort, our univariate cox regression analyses displayed that this EMT score was prominently linked to osteosarcoma survival outcomes (Figure 9(a)). Further multivariate analyses demonstrated that the EMT score was independently predictive of patients' prognosis (Figure 9(b)). For uncovering the underlying biological phenotypes between subpopulations, this research carried out GSEA. As a result, steroid biosynthesis was remarkably activated in high-risk subpopulation (Figure 9(c)), and hypertrophic cardiomyopathy displayed prominent activation in low-risk subpopulation (Figure 9(d)).
Figure 9

Evaluation of the independency of EMT score in osteosarcoma prognosis and its relevant signaling pathways. (a, b) Uni- and multivariate cox regression models for determining the independency of EMT score in estimating osteosarcoma survival outcomes in the TARGET cohort. (c, d) Signaling pathways that were remarkably linked to EMT score via GSEA.

4. Discussion

Osteosarcoma presents diverse clinical courses and biological heterogeneity [34]. Hence, to reliably predict prognosis and therapeutic responses, it is critical for comprehensively investigating the molecular mechanisms. Our study determined forty prognostic EMT genes in osteosarcoma. Through consensus clustering approach, two EMT molecular subtypes were conducted. Especially, EMT subtype A presented poorer prognosis in comparison to subtype B, indicating that there was a remarkable discrepancy in survival outcomes between subtypes. Recently, immunotherapy is a promising treatment option against osteosarcoma, and it is crucial to improve the comprehending about the immune responses [3]. We evaluated the immunological features from multiple perspectives. GSVA demonstrated the activation of immune pathways (like IL6 JAK STAT3 signaling) in subtype B. Our ssGSEA revealed that subtype B displayed remarkably enhanced immune cell infiltrations containing central memory CD4 and CD8 T cells, effector memory CD4 T cells, memory B cells, regulatory T cells, type 1 and 2 helper cells, CD55bright natural killer cells, macrophages, MDSCs, natural killer cells, and natural killer T cells. In addition, subtype B had the features of increased expression of immune checkpoint molecules ADORA2A, CD44, PDCD1LG2, TNFRSF14, and TNFRSF8. Among steps within cancer immunity cycle, release of cancer cell antigen, B cell/monocyte/Th17 cell recruiting, recognition of tumor cells through T cells, and killer of tumor cells presented remarkably enhanced activity in subtype B. Hence, EMT subtype B was characterized by immune activation. Drug resistance severely hinders the improvement of survival rate of osteosarcoma patients [35]. Cisplatin, an alkylating drug, can form irreversible covalent bonds with DNA, which causes DNA strands to cross-link and break and missense mutation [36]. It is widely applied for osteosarcoma chemotherapy as well as cisplatin resistance is frequent across osteosarcoma individuals [37]. Our data demonstrated that EMT subtype B presented enhanced sensitivity to cisplatin. Gemcitabine represents the second cytidine analog developed following cytosine arabinoside [38]. Intriguingly, subtype A was more sensitive to gemcitabine. Through WGCNA approach, we constructed 29 EMT subtype-relevant coexpression modules in the TARGET cohort. Especially, lightcyan module presented the strongest interactions with EMT molecular subtypes. Further, MCODE analyses determined 86 EMT-derived hub genes. Functional enrichment analyses unraveled the remarkable interactions of these EMT-derived hub genes with endoplasmic reticulum (ER), ribosome, oxidative phosphorylation, chemical carcinogenesis-reactive oxygen species, thermogenesis, and HIF-1 signaling pathway. For instance, the ER represents the central intracellular organelle of diverse cellular functions and endoplasmic reticulum stress response participates in osteosarcoma pathogenesis [39]. SENP1/HIF-1alpha regulates hypoxia-mediated EMT in osteosarcoma cells [40]. This study conducted an EMT-derived prognostic model for osteosarcoma in the TARGET cohort, comprising EEF1B2, EIF4A1, RPL10, RPL11, RPL12, RPL13A, RPL21, RPL34, RPL35A, RPL36, RPL37A, RPS8, RPS9, RPS12, RPS23, RPS24, RPS27A, and RPS28. ROC curves proved that the EMT score was capable of accurately predicting osteosarcoma individuals' clinical outcomes. Additionally, external verification proved the feasibility of this model in the GSE21257 cohort. Each gene in the EMT score was linked to undesirable prognosis of osteosarcoma individuals. Previously, mTOR inhibitor blunts the p53 response to nucleolar stress through modulating RPL11 and MDM2 expressions [41]. RPL34 upregulation is indicative of undesirable clinical outcomes in osteosarcoma as well as silencing RPL34 weakens osteosarcoma proliferation [42]. Suppressing RPS9 blunts osteosarcoma cell growth via inactivating MAPK signaling [43]. Nevertheless, the functions of most genes in osteosarcoma progression are lack of experimental evidences. Multivariate analyses suggested the independency of the EMT score in prediction of osteosarcoma outcomes. Further, GSEA results presented that steroid biosynthesis was remarkably activated in high-risk osteosarcoma individuals, indicating that steroid biosynthesis might affect osteosarcoma progression. A few limitations of our study should be taken into consideration. First, further analyses are required for elucidating the molecular mechanisms underlying the impact of genes in the EMT-derived model on osteosarcoma through in-depth experiments. Second, more osteosarcoma patients can produce more convincing and accurate findings. Hence, abundant specimens will be included for improving our conclusions as well as more reliably illustrating the underlying mechanisms by which genes in the EMT-derived model affect osteosarcoma progression.

5. Conclusion

Collectively, this study adopted consensus clustering analyses for determining two EMT molecular subtypes in the TARGET cohort as well as uncovered the discrepancy in survival outcomes, immunological features, and drug sensitivity between subtypes. Through system biology-based WGCNA approach, we determined EMT-derived genes and conducted an EMT score for predicting osteosarcoma prognosis. To our knowledge, we firstly characterized the EMT-relevant subtypes for osteosarcoma. Additionally, the EMT score we proposed was externally verified in the external cohort. Overall, our findings might contribute to personalized treatment and be of much clinical implication for osteosarcoma.
  43 in total

1.  TIP: A Web Server for Resolving Tumor Immunophenotype Profiling.

Authors:  Liwen Xu; Chunyu Deng; Bo Pang; Xinxin Zhang; Wei Liu; Gaoming Liao; Huating Yuan; Peng Cheng; Feng Li; Zhilin Long; Min Yan; Tingting Zhao; Yun Xiao; Xia Li
Journal:  Cancer Res       Date:  2018-08-28       Impact factor: 12.701

2.  limma powers differential expression analyses for RNA-sequencing and microarray studies.

Authors:  Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth
Journal:  Nucleic Acids Res       Date:  2015-01-20       Impact factor: 16.971

3.  Glaucocalyxin A reverses EMT and TGF-β1-induced EMT by inhibiting TGF-β1/Smad2/3 signaling pathway in osteosarcoma.

Authors:  Xiubo Jiang; Zhenhao Zhang; Changqin Song; Hanzhi Deng; Runyu Yang; Lvqi Zhou; Yang Sun; Qi Zhang
Journal:  Chem Biol Interact       Date:  2019-05-04       Impact factor: 5.192

4.  ConsensusClusterPlus: a class discovery tool with confidence assessments and item tracking.

Authors:  Matthew D Wilkerson; D Neil Hayes
Journal:  Bioinformatics       Date:  2010-04-28       Impact factor: 6.937

5.  ZFPM2-AS1 transcriptionally mediated by STAT1 regulates thyroid cancer cell growth, migration and invasion via miR-515-5p/TUSC3.

Authors:  Ruizhen Ren; Yuanna Du; Xing Niu; Rukun Zang
Journal:  J Cancer       Date:  2021-04-17       Impact factor: 4.207

6.  TIMER2.0 for analysis of tumor-infiltrating immune cells.

Authors:  Taiwen Li; Jingxin Fu; Zexian Zeng; David Cohen; Jing Li; Qianming Chen; Bo Li; X Shirley Liu
Journal:  Nucleic Acids Res       Date:  2020-07-02       Impact factor: 16.971

7.  WGCNA: an R package for weighted correlation network analysis.

Authors:  Peter Langfelder; Steve Horvath
Journal:  BMC Bioinformatics       Date:  2008-12-29       Impact factor: 3.169

8.  TM4SF1 promotes EMT and cancer stemness via the Wnt/β-catenin/SOX2 pathway in colorectal cancer.

Authors:  Qiang Tang; Jinhuang Chen; Ziyang Di; Wenzheng Yuan; Zili Zhou; Zhengyi Liu; Shengbo Han; Yanwei Liu; Guoguang Ying; Xiaogang Shu; Maojun Di
Journal:  J Exp Clin Cancer Res       Date:  2020-11-05

Review 9.  Pathogenesis and Current Treatment of Osteosarcoma: Perspectives for Future Therapies.

Authors:  Richa Rathore; Brian A Van Tine
Journal:  J Clin Med       Date:  2021-03-12       Impact factor: 4.964

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.