| Literature DB >> 35626146 |
Rafael Stroggilos1, Maria Frantzi2, Jerome Zoidakis1, Marika Mokou2, Napoleon Moulavasilis3, Emmanouil Mavrogeorgis1, Anna Melidi1, Manousos Makridakis1, Konstantinos Stravodimos3, Maria G Roubelakis4,5, Harald Mischak2, Antonia Vlahou1.
Abstract
Despite advancements in molecular classification, tumor stage and grade still remain the most relevant prognosticators used by clinicians to decide on patient management. Here, we leverage publicly available data to characterize bladder cancer (BLCA)'s stage biology based on increased sample sizes, identify potential therapeutic targets, and extract putative biomarkers. A total of 1135 primary BLCA transcriptomes from 12 microarray studies were compiled in a meta-cohort and analyzed for monotonal alterations in pathway activities, gene expression, and co-expression patterns with increasing stage (Ta-T1-T2-T3-T4), starting from the non-malignant tumor-adjacent urothelium. The TCGA-2017 and IMvigor-210 RNA-Seq data were used to validate our findings. Wnt, MTORC1 signaling, and MYC activity were monotonically increased with increasing stage, while an opposite trend was detected for the catabolism of fatty acids, circadian clock genes, and the metabolism of heme. Co-expression network analysis highlighted stage- and cell-type-specific genes of potentially synergistic therapeutic value. An eight-gene signature, consisting of the genes AKAP7, ANLN, CBX7, CDC14B, ENO1, GTPBP4, MED19, and ZFP2, had independent prognostic value in both the discovery and validation sets. This novel eight-gene signature may increase the granularity of current risk-to-progression estimators.Entities:
Keywords: bladder cancer stage; meta-analysis; molecular alterations; prognostic signature; transcriptomics; urothelial carcinoma
Year: 2022 PMID: 35626146 PMCID: PMC9140126 DOI: 10.3390/cancers14102542
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.575
Figure 1Study design and workflow for the analysis of the selected primary BLCA transcriptomes.
Stage distribution across the 12 microarray datasets used for the discovery set.
| Technology | Platform | Dataset ID | NAU | Ta | T1 | T2 | T3 | T4 | Study Size |
|---|---|---|---|---|---|---|---|---|---|
| Affymetrix | GPL17586 | GSE121711 | 10 | 3 | 2 | 3 | 18 | ||
| Affymetrix | GPL17586 | GSE93527 | 79 | 79 | |||||
| Affymetrix | GPL570 | E-MTAB-1940 | 4 | 41 | 41 | 86 | |||
| Affymetrix | GPL570 | GSE31684 | 5 | 10 | 16 | 41 | 18 | 90 | |
| Affymetrix | GPL6244 | GSE104922 | 19 | 7 | 10 | 5 | 0 | 41 | |
| Affymetrix | GPL6244 | GSE128959 | 13 | 39 | 53 | ||||
| Affymetrix | GPL6244 | GSE83586 | 13 | 44 | 241 | 1 | 1 | 301 | |
| Illumina | GPL14951 | GSE48276 | 2 | 6 | 23 | 6 | 37 | ||
| Illumina | GPL14951 | GSE52219 | 15 | 7 | 1 | 23 | |||
| Illumina | GPL14951 | GSE69795 | 3 | 7 | 27 | 37 | |||
| Illumina | GPL6102 | GSE13507 | 67 | 23 | 80 | 31 | 19 | 12 | 232 |
| Illumina | GPL6947 | GSE48075 | 33 | 34 | 42 | 23 | 8 | 142 | |
|
| 81 | 229 | 262 | 371 | 146 | 46 | 1135 |
Clinical data of the cohorts used.
| Kerrypnx | Discovery Set | TCGA-BLCA-2017 | IMvigor210 |
|---|---|---|---|
|
| 1135 | 188 | 132 |
|
| 67 (24–95) | 69 (34–90) | |
|
| |||
| Female | 135 (11.9%) | 45 (23.9%) | 29 (22.0%) |
| Male | 459 (40.5%) | 143 (76.1%) | 103 (78.0%) |
| No info | 541 (47.6%) | 0 (0.0%) | 0 (0.0%) |
|
| |||
| NAU | 81 (7.1%) | 0 (0.0%) | |
| Ta | 229 (20.2%) | 0 (0.0%) | |
| T1 | 262 (23.1%) | 0 (0.0%) | |
| T2 | 371 (32.7%) | 70 (37.3%) | |
| T3 | 146 (12.9%) | 90 (47.9%) | |
| T4 | 46 (4.0%) | 28 (14.8%) | |
|
| |||
| G1 | 13 (1.1%) | 0 (0.0%) | |
| G2 | 164 (14.5%) | 0 (0.0%) | |
| G3 | 372 (32.8%) | 188 (100%) | |
| Gx | 13 (1.1%) | 0 (0.0%) | |
| No info | 573 (50.5%) | 0 (0.0%) | |
|
| |||
| Complete | 11 (8.3%) | ||
| Partial | 20 (15.2%) | ||
| Stable disease | 25 (18.9%) | ||
| Progressed disease | 76 (57.6%) | ||
Figure 2Excerpt of the pathways showing a monotonal increase or decline in their activation scores with higher stage. Pathway activation scores were z-scaled across samples for visualization. Pathways are colored based on their database of origin. The top side of the heatmap presents dataset and clinical information. Samples (columns) are ordered based on the stage variable; from left to right: non-malignant adjacent urothelium (NAU), Ta, T1, T2, T3, and T4. Red lines indicate boundaries between adjacent stages.
Figure 3Biological process analysis of the largest (in size) co-expressed communities identified in each BLCA stage network: (A) Coherent communities identified and characterized across the non-malignant adjacent urothelium (NAU) and disease stages. The presence of a community is indicated by the + symbol. Numbers in parentheses show the fraction of genes with GO Biological Process annotation relevant to the community, with respect to the total number of genes found to be co-expressed in the community. (B) Bar plots of the most significantly enriched biological processes per community, depicting the number of co-expressed genes for each. (C) Hub genes identified across the studied conditions based on the betweenness centrality scores (y-axis).
Figure 4Differential expression analysis between non-malignant adjacent urothelium (NAU) and BLCA stages: (A) Volcano plots of the five stages’ comparisons to NAU, with the color red indicating the 157 genes showing a monotonal trend of expression across stages. (B) Heatmap of the absolute fold changes of the 157 monotonal genes, which were continuously either up- (yellow) or downregulated (blue), in the comparisons between disease stages and NAU. (C) Top 15 pathways of the 157 monotonal genes, sorted by their GeneCards enrichment score.
Figure 5Validation of key findings in the TCGA-2017 and the IMvigor210 cohorts: (A) Forest plots showing hazard ratios (HRs) for the 8 monotonal genes with univariate prognostic value in both the discovery and TCGA validation datasets. (B) Multivariate analysis of stage and the 8-gene signature scores in the discovery and TCGA validation sets. (C) 5-year survival analysis between patients with high and low 8-gene signature scores, in the discovery and the TCGA data. (D) Data from the IMvigor210 trial illustrating AIF1 expression across responses to atezolizumab in immunotherapy groups.