Literature DB >> 35299749

Identification of a Novel Tumor Microenvironment Prognostic Signature for Bladder Urothelial Carcinoma.

Chaojie Xu1, Dongchen Pei1, Yi Liu1, Yang Yu1, Jinhua Guo1, Nan Liu1, Zhengjun Kang1.   

Abstract

Background: The tumor microenvironment (TME) regulates the proliferation and metastasis of solid tumors and the effectiveness of immunotherapy against them. We investigated the prognostic role of TME-related genes based on transcriptomic data of bladder urothelial carcinoma (BLCA) and formulated a prediction model of TME-related signatures.
Methods: Molecular subtypes were identified using the non-negative matrix factorization (NMF) algorithm based on TME-related genes from the TCGA database. TME-related genes with prognostic significance were screened with univariate Cox regression analysis and lasso regression. Nomogram was developed based on risk genes. Receiver operating characteristic (ROC) curve and decision curve analysis (DCA) were used for inner and outer validation of the model. Risk scores (RS) of patients were calculated and divided into high-risk group (HRG) and low-risk group (LRG) to compare the differences in clinical characteristics and PD-L1 treatment responsiveness between HRG and LRG.
Results: We identified two molecular subtypes (C1 and C2) according to the NMF algorithm. There were significant differences in overall survival (OS) (p<0.05), progression-free survival (PFS) (p<0.05), and immune cell infiltration between the two subtypes. A total of eight TME-associated genes (CABP4, ZNF432, BLOC1S3, CXCL11, ANO9, OAS1, FBN2, CEMIP) with independent prognostic significance were screened to build prognostic risk models. Age (p<0.001), grade (p<0.001), and RS (p<0.001) were independent predictors of survival in BLCA patients. The developed RS nomogram was able to predict the prognosis of BLCA patients at 1, 3, and 5 years more potentially than the models of other investigators according to ROC and DCA. RS showed significantly higher values (p = 0.047) in patients with stable disease (SD)/progressive disease (PD) compared to patients with complete response (CR)/partial response (PR). Conclusions: We successfully clustered and constructed predictive models for TME-associated genes and helped guide immunotherapy strategies.
Copyright © 2022 Xu, Pei, Liu, Yu, Guo, Liu and Kang.

Entities:  

Keywords:  bladder urothelial carcinoma; immunotherapy; non-negative matrix factorization; prognosis; tumor microenvironment

Year:  2022        PMID: 35299749      PMCID: PMC8921452          DOI: 10.3389/fonc.2022.818860

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   6.244


Introduction

Bladder urothelial carcinoma (BLCA) is the pathological type of bladder cancer with the highest percentage, about 90% (1). There were globally approximately 550,000 new patients and 200,000 deaths in 2018 (2). Despite significant advances in the treatment of BLCA with radiotherapy, surgery, and targeted therapy, the prognosis of BCLA patients remains poor, with 30% of them having muscle-invasive bladder cancer (MIBC) at initial consultation (3). However, patients with MIBC are characterized by rapid disease progression and low survival, with a 5-year tumor-specific mortality rate of >50% (4). Therefore, a validated prognostic risk model can help guide individualized treatment of BLCA patients to improve their prognosis. At present, the 8th edition of the TNM staging method published by the Union International Center of Cancer (UICC) is one of the most valuable indicators to determine the prognosis of patients with BLCA (5). However, due to the heterogeneity of BLCA, the prognosis of patients with the same TNM stage may vary considerably (6–8). In addition, multiple comprehensive BLCA molecular typing based on genetic analysis can forecast the overall survival (OS) of individuals, such as ferroptosis-associated gene signature (9), autophagy-associated gene signature (10), and RNA binding protein-associated gene signature (11). Therefore, we considered the use of gene signatures as biomarkers that could predict individual prognosis and drug responsiveness, thus improving clinical outcomes in BLCA patients. The tumor microenvironment (TME) includes solid tumor cells, vascular network, extracellular matrix, secreted factors (cytokines, chemokines), and distantly recruited cells such as activated B cells and macrophages (12). Overall, this homeostatic system supports the progression and recurrence of malignancies and has important implications in chemoresistance and immunotherapy (13, 14). Mesenchymal cells such as fibroblasts within TME are associated with the T-cell efflux phenotype in bladder cancer (15). In addition, the non-immune cellular components of TME also influence the therapeutic response, for example, the secretion of TGF-β by fibroblasts can lead to efflux of immune cells or resistance to chemotherapeutic agents, and therefore the therapeutic effect of the tumor varies with the degree of stromal cell infiltration (16). Therefore, tumor tissue gene expression profiles can reflect the relationship between TME and patient prognosis. In summary, TME-associated gene signature can enhance the reliability of forecasting patient prognosis. Therefore, we aimed to design a prediction model combining TME-related gene signatures and patient clinical characteristics and develop a nomogram to forecast the prognosis of BLCA patients at 1, 3, and 5 years.

Materials and Methods

Data Download and Preprocessing

We downloaded transcriptome data and clinical annotations from the Genomic Data Commons. The TCGA-BLCA (The Cancer Genome Atlas-Bladder Cancer) cohort consisted of 433 RNA sequencing samples, including 19 normal profiles and 414 tumor profiles. We removed samples without clinical follow-up information and microdissection from the TCGA-BLCA cohort, resulting in the inclusion of a total of 359 samples. We also downloaded the simple nucleotide variation data in the TCGA database for further analysis of copy number variation (CNV). Moreover, for external validation, data for the cohort of GSE31684 were obtained from the GEO database. The microarray data of GSE31684 from Affymetrix Human Genome U133 Plus 2.0 Array, we downloaded the normalized matrix file directly. All dates contain survival information. TME-related genes were obtained from published studies (17–23), and a total of 4061 genes were included ( ). We used the UCSC Xena (https://xenabrowser.net/) to download the TCGA-GDC pan-cancer data.

Screening for TME-Related Differentially Expressed Genes

The differentially expressed TME-associated genes in BLCA tissues and adjacent tissues were screened, and the screened differential genes and their expression were organized into a gene expression matrix with a corrected p<0.05 and the absolute value of differential expression multiplicity >1 (FDR<0.05 and | log2Fold Change|>1) was set as the threshold value.

Identification of Molecular Subtypes Using Non-Negative Matrix Factorization (NMF) Algorithm

Fifty iterations of the sample were performed using NMF for extracting biological correlation coefficients and predicting the inner feature structure in gene expression matrices (24). We observed performance for the number of clusters k between two and ten.

Comparison of Immune Scores Between Clusters

Microenvironment Cell Populations (MCP) counters allow quantification of the absolute abundance of eight immune cells and two stromal cells from transcriptomic data (24). We evaluated infiltrating cell scores between clusters, which included neutrophils, NK cells, and myeloid dendritic cells, among others. Then we evaluated the infiltrating cells between clusters.

Univariate Cox Regression Analysis and Lasso Regression Analysis

At the ratio of 7:3, 359 samples were divided into the training set and validation set with no one as the control. TME-related differentially expressed genes were subjected to univariate Cox analysis to get the prognostic genes. The lasso method prevents model overfitting by building a penalized feature to build a more refined model. We then applied lasso Cox regression to minimize the amounts of genes in the prognostic modeling. The results of the lasso regression analysis on the variables affecting the outcome of individuals with BLCA were incorporated into multivariate Cox regression analysis.

Construction of a Nomogram Combined With Risk Score (RS) and Clinical Features

The TCGA cohort was used to build a nomogram to forecast the prognosis of individuals with BLCA, with variables including RS and clinical characteristics. RS was calculated according to the expression of differential genes and regression analysis coefficient values. The formula is shown below: We classify the cases into high-risk group (HRG) and low-risk group (LRG) according to the median RS. The receiver operating characteristic (ROC) curve was utilized to assess the predictive value of RS for prognosis. Kaplan-Meier (K-M) curves were drawn and the Log-rank method was used to evaluate OS.

Prediction Model Evaluation

ROC, calibration curve (bootstrap 1000), and decision curve analysis (DCA) were applied to assess the confidence validity of the model. Our RS nomogram was also compared with those of other established models. RMS time was used to assess the prognostic accuracy of the models beyond 60 months of patient survival.

Gene Set Enrichment Analysis (GSEA)

The c2.cp.kegg.v7.4.symbols and c5.go.v7.4.symbols collection were used to explore the function annotation in HRG and LRG by GSEA software. Gene sets with FDR < 0.05 were was considered statistically significant.

Immunotherapy Prediction

We selected patients with urologic tumors in the IMvigor210 cohort who had received programmed death-ligand 1 (PD-L1) blockade treatment to predict response to immunotherapy. This cohort had a total of 348 cases, containing 232 death samples and 116 censored, all of which contained survival data. BLCA patients who received anti-PD-L1 therapy could be classified into the following categories according to the patient’s response: complete response (CR), partial response (PR), stable disease (SD), and progressive disease (PD). Among them, CR and PR are recognized as patients who respond to immunotherapy. SD and PD are recognized as patients who do not respond to immunotherapy. We computed RS for each case and classified them as HRG and LRG based on the median value of RS.

Statistical Analysis

Statistical analysis and graphical work were finished in the R environment (version 4.1.1). Volcano maps were drawn using the “ggplot2” package. Violin plots were drawn with “ggpubr” package. Cox regression analyses were performed by the “survival” package. We used the Chi-square test or Fisher’s exact test to measure the difference between training and validating sets and the relationship between clinical data and RS. K-M survival curves with log-rank tests were plotted by the “survminer” package. The ROC curves were depicted by the “timeROC” package. Calibration curves were derived from the “rms” package. The restricted mean survival (RMS) package was for computing the C-index for each of the models. p < 0.05 is considered to have statistical difference.

Results

Patient Characteristics and Analysis of Differentially Expressed Genes Associated With TME in BLAC Patients

The TCGA-BLCA cohort consisted of 433 RNA sequencing samples, including 19 normal samples and 414 tumor samples. The characteristics of the cases enrolled in this study were shown in after pre-processing. In total, there were 1014 TME-associated genes (FDR<0.05 and |log2FC|>1) differentiated in expression between BLCA patients and regular bladder tissue. The top 50 up-regulated and down-regulated genes with differential expression were plotted as volcanoes ( ).
Table 1

Clinicopathological characteristics of BLCA patients from the TCGA and GEO databases.

CharacteristicsTCGA-BLCA cohort N = 359GSE31684 N = 93
Age
<=65141 (39.28%)28 (39.41%)
>65218 (60.72%)65 (60.59%)
Gender
Female99 (27.58%)25 (26.88%)
Male260 (72.42%)68 (73.12%)
Grade
High342 (95.26%)87 (93.50%)
Low14 (3.9%)6 (6.50%)
Unknow3 (0.84%)0 (0.00%)
Stage
I-II113 (31.48%)74 (79.57%)
III-IV244 (67.97%)11 (11.83%)
Unknow2 (0.56%)8 (8.60%)
T
T0-T2107 (29.81%)NA
T3-T4223 (62.12%)NA
Unknow29 (8.08%)NA
M
M0164 (45.68%)NA
M111 (3.06%)NA
UnknowNA
N 184 (51.25%)
N0-N1251 (69.92%)NA
N2-N372 (20.06%)NA
Unknow36 (10.03%)NA
Neoadjuvant therapy
Yes10 (2.79%)NA
No349 (97.21%)NA
Radiotherapy
Yes15 (4.18%)NA
No243 (67.69%)NA
Unknow101 (28.13%)NA
Survival status
Alive201 (55.99%)28 (30.11%)
Dead158 (44.01%)65 (69.89%)
The median follow-up time (year) 5.192.57
Figure 1

(A) Volcano plot of differentially expressed genes in BLCA from the TCGA database. (B) Consensus map clustered via the non-negative matrix factorization (NMF) algorithm. (C) The cophenetic correlation coefficient is used to reflect the stability of the cluster obtained from NMF. (D) RSS is used to reflect the clustering performance of the model. (E) Overall survival (OS) showed significant differences between C1 and C2. (F) Progression-free survival (PFS) showed significant differences between C1 and C2. (G) Alluvial plot showing the percentage of C1 and C2 between molecular subtypes.

Clinicopathological characteristics of BLCA patients from the TCGA and GEO databases. (A) Volcano plot of differentially expressed genes in BLCA from the TCGA database. (B) Consensus map clustered via the non-negative matrix factorization (NMF) algorithm. (C) The cophenetic correlation coefficient is used to reflect the stability of the cluster obtained from NMF. (D) RSS is used to reflect the clustering performance of the model. (E) Overall survival (OS) showed significant differences between C1 and C2. (F) Progression-free survival (PFS) showed significant differences between C1 and C2. (G) Alluvial plot showing the percentage of C1 and C2 between molecular subtypes.

Molecular Subtypes of TME-Related Genes According to the NMF Algorithm

We clustered genes based on the differential expression TME-related genes by the NMF algorithm. When k=2, C1 and C2 were formed based on covariance and RSS ( ). Survival results showed that cases in the C1 cluster had better OS and PFS than those in the C2 cluster ( ). There were differences in immune scores between C1 and C2 for nine cell types, including CD8+ T cells (p<0.05), cytotoxic lymphocytes (p<0.05), B cells (p<0.05), neutrophils (p = 0.023), monocytes (p<0.05), endothelial cells (p<0.05), fibroblasts (p<0.05), NK cells (p<0.05), and myeloid dendritic cells (p = 0.014) ( ). The international transcriptomic immune typology of solid tumors has established six immune subtypes, including wound healing (Immune C1), IFN-gamma dominant (Immune C2), inflammatory (Immune C3), lymphocyte depleted (Immune C4), immunologically quiet (Immune C5), and TGF-beta dominant (Immune C6). Our molecular subtyping results were compared with the former and the results are shown in .
Figure 2

(A–I) Immune scores of cells of the tumor microenvironment (TME) showed significant differences.

(A–I) Immune scores of cells of the tumor microenvironment (TME) showed significant differences.

Developing an RS Prediction Model for TME-Related Genes

At the ratio of 7:3, 359 samples were randomly divided into a training set (n = 252) and validation set (n = 107) ( ). Baseline features of the patients demonstrated no obvious distinction between them in terms of gender, age, pathological grade, treatment history, and TNM stage (p>0.05). Univariate Cox analysis was performed on the training cohort with 173 differentially TME-associated genes (p < 0.05) ( ). Lasso regression was applied to achieve reduction in the number of genes while remaining highly accurate. The traces of independent variables showed a gradual increase in the number of independent coefficients that converge to zero as λ decreases ( ). We selected 11 genes as candidate genes according to the best value of λ ( ). Then TME-related genes with significant differences were screened according to a multifactorial Cox proportional risk model. Finally, eight genes were obtained, constructing the following equation:
Table 2

Comparison of TCGA training and testing cohort.

CharacteristicsTCGA Testing Cohort N = 107TCGA Training Cohort N = 252 p-Value
Age 0.5587
<=6545 (42.06%)96 (38.1%)
>6562 (57.94%)156 (61.9%)
Gender 0.1973
Female35 (32.71%)64 (25.4%)
Male72 (67.29%)188 (74.6%)
Grade 1
High103 (96.26%)239 (94.84%)
Low4 (3.74%)10 (3.97%)
Unknow0 (0%)3 (1.19%)
Stage 0.5564
I-II31 (28.97%)82 (32.54%)
III-IV76 (71.03%)168 (66.67%)
Unknow0 (0%)2 (0.79%)
T 0.806
T0-T230 (28.04%)77 (30.56%)
T3-T467 (62.62%)156 (61.9%)
Unknow10 (9.35%)19 (7.54%)
M 1
M046 (42.99%)118 (46.83%)
M13 (2.8%)8 (3.17%)
Unknow58 (54.21%)126 (50%)
N 0.254
N0-N179 (73.83%)172 (68.25%)
N2-N317 (15.89%)55 (21.83%)
Unknow11 (10.28%)25 (9.92%)
Neoadjuvant therapy 0.7362
Yes2 (1.87%)8 (3.17%)
No105 (98.13%)244 (96.83%)
Radiotherapy 0.9045
Yes4 (3.74%)11 (4.37%)
No77 (71.96%)166 (65.87%)
Unknow26 (24.3%)75 (29.76%)
Figure 3

(A) Changing the trajectory of each independent variable (the abscissa represents the corrected lambda and the ordinate represents the coefficient of the independent variable). (B) log value of the independent variable lambda (the abscissa represents the CI of each lambda, and the ordinate represents errors in cross-validation). (C–F) Construction and validation of the TME-related eight-gene risk score (RS) for BLCA with 1-, 3- and 5-year receiver operating characteristic (ROC) within different cohorts: (C) TCGA training cohort; (D) entire TCGA cohort; (E) TCGA testing cohort; (F) GEO cohort.

Comparison of TCGA training and testing cohort. (A) Changing the trajectory of each independent variable (the abscissa represents the corrected lambda and the ordinate represents the coefficient of the independent variable). (B) log value of the independent variable lambda (the abscissa represents the CI of each lambda, and the ordinate represents errors in cross-validation). (C–F) Construction and validation of the TME-related eight-gene risk score (RS) for BLCA with 1-, 3- and 5-year receiver operating characteristic (ROC) within different cohorts: (C) TCGA training cohort; (D) entire TCGA cohort; (E) TCGA testing cohort; (F) GEO cohort. ROC was utilized to evaluate the accuracy of the RS model in our training set. As shown in , the area under curve (AUC) for the model were 0.818, 0.776, and 0.771 for 1-, 3-, and 5- years, respectively. We used the median value of RS as the boundary to classify the samples into HRG and LRG. K-M survival analysis showed that LRG had a better prognosis than HRG ( ).
Figure 4

(A–C) (A) Nomogram predicting the 1-, 3- and 5-year OS for patients. The points identified on the point scale of each variable are totaled. Finally, beneath the total points, the probability of 1-, 3- or 5-year survival is projected on the scales below. (B) Calibration curves for the nomogram predicted 1-, 3- and 5-year OS for patients in relation to actual survival. (C–F) Construction and validation of the TME-related eight-gene risk score (RS) for BLCA with Kaplan–Meier (KM) curves within different cohorts: (C) TCGA training cohort; (D) TCGA testing cohort; (E) entire TCGA cohort; (F) GEO cohort.

(A–C) (A) Nomogram predicting the 1-, 3- and 5-year OS for patients. The points identified on the point scale of each variable are totaled. Finally, beneath the total points, the probability of 1-, 3- or 5-year survival is projected on the scales below. (B) Calibration curves for the nomogram predicted 1-, 3- and 5-year OS for patients in relation to actual survival. (C–F) Construction and validation of the TME-related eight-gene risk score (RS) for BLCA with Kaplan–Meier (KM) curves within different cohorts: (C) TCGA training cohort; (D) TCGA testing cohort; (E) entire TCGA cohort; (F) GEO cohort. We performed survival analyses in the TCGA test set, all TCGA set, and the GEO cohort. HRG had a significantly worse prognosis than LRG ( ). In the three validation sets, the 1-year AUC was 0.761, 0.664, and 0.589, respectively ( ). In the GEO cohort, the low AUC values at 1, 3, and 5 years were partly due to the short median follow-up time of patients.

Construction of a Nomogram Containing RS for TME-Related Genes

We used univariate and multivariate Cox regression to examine the relationship between potential variables and OS, which included RS, TNM stage, pathological grade, age, and sex ( ). The results showed that RS (HR: 1.045, 95% CI: 1.019–1.071, p<0.001), TNM stage(HR: 2.127, 95% CI: 1.416–3.196, p<0.001), and age (HR: 1.034, 95% CI: 1.017–1.052, p<0.001) were independent risk factors. Next, we constructed a nomogram including RS and clinical features ( ). Individualized 1-year, 3-year, and 5-year survival rates can be visualized based on the nomogram. For example, a 63-year-old male patient diagnosed with BLCA has clinical features of T3M0N0, high pathological grading, and low RS. The clinician can then calculate the corresponding scores for the variables, with a final total score of 442. Thus, the patient’s 1-year, 3-year, and 5-year survival rates can be inferred to be 0.944,0.797, and 0.685, respectively. The results showed a significant impact of RS on survival prediction. We drew calibration graphs of 1-year, 3-year, and 5-year OS in the training sets to demonstrate the consensus of our model with the real results ( ). The AUC of the model was higher than the other impact factors in the 3-year and 5-year ROC curve ( ). However, the RS was higher than the AUC of the nomogram at 1-year ROC curve, which was the highest of all factors ( ). Finally, DCA to assess the clinical utility of nomograms. Both nomogram and RS showed good consistency in forecasting OS at 1, 3, and 5 years compared with a singular prognostic factor ( ).
Table 3

Univariable and multivariable Cox regression to analyze the relationship between the RS and clinical prognosis.

VariablesUnivariable AnalysisMultivariable Analysis
HR95%CI P-ValueHR95%CI P-Value
Age1.0381.021–1.056<0.0011.0341.017–1.052<0.001
Gender0.8910.632–1.2570.511~~~
Grade1.7790.438–7.2270.421~~~
Stage2.3301.555–3.492<0.0012.1271.416–3.196<0.001
Risk Score1.0571.032–1.083<0.0011.0451.019–1.071<0.001
Figure 5

(A–C) ROC curves of the nomograms compared with other clinical variables. (D–F) Decision curve analysis (DCA) curves can evaluate the clinical benefit of the nomograms and their potential scope of application. None indicates that all samples are negative and none are treated, therefore the net benefit is zero. All indicates that all samples are positive and all are treated. The x-axis represents the threshold probability.

Univariable and multivariable Cox regression to analyze the relationship between the RS and clinical prognosis. (A–C) ROC curves of the nomograms compared with other clinical variables. (D–F) Decision curve analysis (DCA) curves can evaluate the clinical benefit of the nomograms and their potential scope of application. None indicates that all samples are negative and none are treated, therefore the net benefit is zero. All indicates that all samples are positive and all are treated. The x-axis represents the threshold probability.

Comparison of the Eight-Gene Signature Risk Model With Other Models

We compared the prediction models of our eight-gene signature with other models to demonstrate the predictive performance of our model, including models of the identified 3-gene (25), 5-gene (26), and 7-gene signature (27). We also computed RS for every sample by multivariate Cox regression analysis. We evaluated the ROC of the four models according to the corresponding genes. The cases were then classified into HRG and LRG based on the median RS value. In all four models, there was a significant difference in survival time between patients in the HRG and LRG groups, with patients in LRH having a better prognosis than those in HRG ( ). However, the ROC of the previous models showed lower AUCs and, therefore, the other three models have worse predictions compared to our nomogram. ( ).
Figure 6

(A–D) ROC of three other published gene signatures. (E–H) KM curves of three other published gene signatures. (I) Concordance index (C-index) of the four prognostic risk models including our model, which has the highest C-index. (J) Restricted mean survival (RMS) time curve of all five prognostic risk models, revealing an overlap of 60 months.

(A–D) ROC of three other published gene signatures. (E–H) KM curves of three other published gene signatures. (I) Concordance index (C-index) of the four prognostic risk models including our model, which has the highest C-index. (J) Restricted mean survival (RMS) time curve of all five prognostic risk models, revealing an overlap of 60 months. We calculated the C-index of the four models and found that our model has the highest C-index with 0.695 ( ). RMS time was used to estimate the forecasting effectiveness of the model at different time points. Our model was found to outperform the other two gene signatures at > 60 months except for Zhou signature. This demonstrates that our nomogram has an advantage over the other models in predicting both patient survival up to 5 years and patient survival beyond 60 months ( ).

Functional Analysis of Genes Between HRG and LRG

We ran GSEA on samples within the HRG and LRG. We identified the following pathways with associated normalized enrichment score (NES) and the adjusted p-value (q-value) enriched in the HRG ( ): KEGG_FOCAL_ADHESION (NES =2.1496, q =1.57E-08), KEGG_ECM_RECEPTOR_INTERACTION (NES =2.2856, q =4.50E-07), KEGG_REGULATION_OF_ACTIN_CYTOSKELETON (NES =2.0206, q =1.25E-06), KEGG_SYSTEMIC_LUPUS_ERYTHEMATOSUS (NES =2.0084, q =4.96E-05), KEGG_PATHWAYS_IN_CANCER (NES =1.7137, q =0.0003), GOBP_CELLULAR_ION_HOMEOSTASIS (NES =1.9040, q =1.71E-08), GOBP_CIRCULATORY_SYSTEM_PROCESS (NES =2.0543, q =1.71E-08), GOBP_CORNIFICATION (NES =2.5951, q =1.71E-08), GOBP_EPIDERMAL_CELL_DIFFERENTIATION (NES =2.6868, q =1.71E-08), GOBP_EPIDERMIS_DEVELOPMENT (NES =2.5372, q =1.71E-08).
Figure 7

(A) KEGG analysis of our signature in the HRG. (B) Correlation between RS, immune cell score and TMB. (C) Correlation between RS of BLCA samples and expression of representative genes for generic pathway targets in oncology. (D) GO analysis of our signature in the HRG. (E) RS according to the effectiveness of immunotherapy expressed as complete response (CR), partial response (PR), stable disease (SD), or progressive disease (PD). (F) Correlation between RS of BLCA samples and immune scores.

(A) KEGG analysis of our signature in the HRG. (B) Correlation between RS, immune cell score and TMB. (C) Correlation between RS of BLCA samples and expression of representative genes for generic pathway targets in oncology. (D) GO analysis of our signature in the HRG. (E) RS according to the effectiveness of immunotherapy expressed as complete response (CR), partial response (PR), stable disease (SD), or progressive disease (PD). (F) Correlation between RS of BLCA samples and immune scores. Classical pathways in tumors include immune checkpoints, DNA duplication, mismatch repair, and epithelial-mesenchymal transition (EMT). The correlation of RS with target genes was analyzed by extracting the expression of the relevant pathway target genes from the samples. The results showed that RS was positively correlated with EMT-related genes (F-AP, TAGLN, and LOXL2) ( ). We computed the immune cell fraction for every sample of the TCGA-BLCA set using an MCP counter and then compared their correlation with RS. As shown in , RS was positively correlated with CD8+ T cells (p<0.05), endothelial cells (p<0.05), and fibroblasts (p<0.05). In addition, we evaluated the correlation between RS, TMB, and infiltrating cells by MCP counter with the results shown in . Our analysis showed that there were also differences in RS between clinical subgroups of BLCA patients. More specifically, patients older than 65 years, with no history of radiotherapy, female, and with high TNM stage had higher RS ( ). The K-M survival curves of patients in both stage I-II and III-IV subgroups also proved our results ( ). In addition, the K-M survival curves of patients in both MIBC and NMIBC subgroups also proved our results ( ).
Figure 8

(A–E) RS comparison of clinical information of BLCA samples. (F, G) Evaluation of the eight-gene risk model in different stage groups with OS KM curves comparing HRG and LRG. (H, I) Evaluation of the eight-gene risk model in different degree of tumor infiltration groups with OS KM curves comparing HRG and LRG.

(A–E) RS comparison of clinical information of BLCA samples. (F, G) Evaluation of the eight-gene risk model in different stage groups with OS KM curves comparing HRG and LRG. (H, I) Evaluation of the eight-gene risk model in different degree of tumor infiltration groups with OS KM curves comparing HRG and LRG.

Prediction of Response to Immunotherapy Based on RS Model

We selected patients with urologic tumors in the IMvigor210 cohort who had received PD-L1 blockade therapy to observe the effectiveness against immunotherapy. We calculated the RS for each sample and classified them into HRG and LRG based on the median RS value. the RS showed significantly higher values (p = 0.047) in patients with SD/PD compared to patients with CR/PR ( ). This suggests that our RS model may be useful in predicting patient response to PD-L1 blocking therapy.

Discussion

The major discovery of our work was a prediction model according to the TME - associated gene signature of BLCA. TME heterogeneity has an essential role in patient prognosis and in predicting the effectiveness of targeted therapies (28–30). 4061 TME-related genes and 359 BLCA samples of TCGA were evaluated in this work. We applied the NMF method to distinguish two molecular clusters, which is a promising novel clustering method. There were differences in immune scores for TME-infiltrating cell types between C1 and C2. In addition, correlation analysis with prognostic indicators OS and PFS also showed differences between C1 and C2. These results demonstrate the heterogeneity of TME. We found that our prognostic model based on eight TME-related genes (CABP4, ZNF432, BLOC1S3, CXCL11, ANO9, OAS1, FBN2, CEMIP) was able to accurately forecast the probability of survival in individuals with BLCA. Prior researches have examined several genes included in our signature under a variety of tumors. For instance, CXCL11 has an important role in the production of chemokines. First, these mediators can trigger the accumulation of CD8+ T cells that can contribute to the elimination of the tumor. Secondly, the production of these chemokines by tumor tissue may trigger the migration and activation of immune cells including myeloid-derived suppressor cells and regulatory T cells, which act in favor of the tumor and its progress (31). FBN2 is highly expressed in lung cancer tissues, and as an oncogene, it affects the pathogenesis of lung cancer (32). In addition, Chen Y (33) found that CEMIP can promote a variety of tumor processes by affecting tumor proliferation, dedifferentiation, and the tumor microenvironment. In terms of molecular mechanisms, existing research has shown that CEMIP mainly affects the WNT and EGFR signaling pathways. Multivariate Cox regression analysis revealed RS (HR =1.045, 95% CI: 1.019-1.071) as an independent risk factor, which was validated in both the internal and external sets. We then defined HRG and LRG in the TCGA cohort based on the median RS value. For example, we found that RS was positively correlated with EMT-related genes. In general, a greater degree of immune infiltration could explain a more immune defense in LRG, and thus a more favorable prognosis. Conversely, positively correlated EMT genes may lead to a higher propensity for metastasis and a poorer outcome for BLCA patients in HRG. In addition, RS differs between clinical subgroups of BLCA patients, with a worse prognosis in the elderly, women, and patients with high TNM staging. Thus, our results suggest that RS can provide resilient risk stratification for BLCA patients. Nomograms are a well-recognized statistical method for visualizing and predicting the probability of survival of patients (34–36). We innovatively constructed an advanced nomogram for the prognosis of BLCA patients according to TME-RS. The ROC and DCA proved that they can exactly assess the outcome of patients. When we compared our nomogram with three other predictive genes signatures by Wang et al. (27), Zhou et al. (25), and Li et al. (26). Our nomogram showed the highest C-index. These findings suggest that the clinical utility of our constructed nomogram is outperformed by other models. Next, our team will transfer this RS model to other urological tumors such as renal clear cell carcinoma to demonstrate the potential pan-cancer usability of the model. Of course, there were still some deficiencies in our research. First of all, all our data come from the TCGA database and GEO database, and the sample size was not large enough, which may lead to the bias of the results. Secondly, our conclusions were also lack experimental verification. Therefore, it was necessary to conduct multicenter, large sample, prospective double-blind trials for further verification in the future.

Conclusions

In conclusion, our eight TME-associated gene signatures combined with clinical features may more accurately predict patient survival at 1, 3, and 5 years. Both external and internal validation were able to have the ability to verify the forecasting capability of the model. RS can be promising to predict the response to immunotherapy of BLCA individuals with PD-L1 blocking therapy.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/ .

Author Contributions

CX, ZK, and DP designed this work. YL analyzed the data. CX and DP wrote this manuscript. ZK edited and revised the manuscript. JG, NL and YY did a lot of work in revising the paper, including re-screening the data, analyzing the data, plotting the figures and the layout of the figures. YY embellished the language of the paper. All authors approved this manuscript.

Funding

This study was funded by the Health Care Commission of Henan Province (201403125, LHGJ20190422) and the Department of Education of Henan Province (21A320037).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
  36 in total

Review 1.  Tumor microenvironment: Sanctuary of the devil.

Authors:  Lanlan Hui; Ye Chen
Journal:  Cancer Lett       Date:  2015-08-11       Impact factor: 8.679

2.  Radical Cystectomy and the Multidisciplinary Management of Muscle-Invasive Bladder Cancer.

Authors:  Edmund Charles Paul Chedgy; Peter C Black
Journal:  JAMA Oncol       Date:  2016-07-01       Impact factor: 31.777

Review 3.  Accessories to the crime: functions of cells recruited to the tumor microenvironment.

Authors:  Douglas Hanahan; Lisa M Coussens
Journal:  Cancer Cell       Date:  2012-03-20       Impact factor: 31.743

4.  Fibrillin 2 gene knockdown inhibits invasion and migration of lung cancer cells.

Authors:  Qiaojun Hong; Rong Li; Yiyan Zhang; Kangsheng Gu
Journal:  Cell Mol Biol (Noisy-le-grand)       Date:  2020-10-31       Impact factor: 1.770

Review 5.  The role of CEMIP in tumors: An update based on cellular and molecular insights.

Authors:  Yu Chen; Hu Zhou; Wen-Jing Jiang; Jia-Fei Wang; Yuan Tian; Yan Jiang; Bai-Rong Xia
Journal:  Biomed Pharmacother       Date:  2021-12-15       Impact factor: 6.529

6.  Robust enumeration of cell subsets from tissue expression profiles.

Authors:  Aaron M Newman; Chih Long Liu; Michael R Green; Andrew J Gentles; Weiguo Feng; Yue Xu; Chuong D Hoang; Maximilian Diehn; Ash A Alizadeh
Journal:  Nat Methods       Date:  2015-03-30       Impact factor: 28.547

Review 7.  Microenvironmental Regulation of Tumor Progression and Therapeutic Response in Brain Metastasis.

Authors:  Michael Schulz; Anna Salamero-Boix; Katja Niesel; Tijna Alekseeva; Lisa Sevenich
Journal:  Front Immunol       Date:  2019-07-24       Impact factor: 7.561

8.  The construction and validation of an RNA binding protein-related prognostic model for bladder cancer.

Authors:  Fengxia Chen; Qingqing Wang; Yunfeng Zhou
Journal:  BMC Cancer       Date:  2021-03-08       Impact factor: 4.430

9.  Immune infiltration profiling in gastric cancer and their clinical implications.

Authors:  Feifei Ren; Qitai Zhao; Minghai Zhao; Shaogong Zhu; Bin Liu; Ihtisham Bukhari; Kai Zhang; Wanqing Wu; Yuming Fu; Yong Yu; Youcai Tang; Pengyuan Zheng; Yang Mi
Journal:  Cancer Sci       Date:  2021-07-12       Impact factor: 6.716

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.