Literature DB >> 33791209

Subgroup-Independent Mapping of Renal Cell Carcinoma-Machine Learning Reveals Prognostic Mitochondrial Gene Signature Beyond Histopathologic Boundaries.

André Marquardt1,2,3, Antonio Giovanni Solimando4,5, Alexander Kerscher1, Max Bittrich6, Charis Kalogirou7, Hubert Kübler7, Andreas Rosenwald2, Ralf Bargou1, Philip Kollmannsberger8, Bastian Schilling9, Svenja Meierjohann2,3, Markus Krebs1,7.   

Abstract

Background: Renal cell carcinoma (RCC) is divided into three major histopathologic groups-clear cell (ccRCC), papillary (pRCC) and chromophobe RCC (chRCC). We performed a comprehensive re-analysis of publicly available RCC datasets from the TCGA (The Cancer Genome Atlas) database, thereby combining samples from all three subgroups, for an exploratory transcriptome profiling of RCC subgroups. Materials and
Methods: We used FPKM (fragments per kilobase per million) files derived from the ccRCC, pRCC and chRCC cohorts of the TCGA database, representing transcriptomic data of 891 patients. Using principal component analysis, we visualized datasets as t-SNE plot for cluster detection. Clusters were characterized by machine learning, resulting gene signatures were validated by correlation analyses in the TCGA dataset and three external datasets (ICGC RECA-EU, CPTAC-3-Kidney, and GSE157256).
Results: Many RCC samples co-clustered according to histopathology. However, a substantial number of samples clustered independently from histopathologic origin (mixed subgroup)-demonstrating divergence between histopathology and transcriptomic data. Further analyses of mixed subgroup via machine learning revealed a predominant mitochondrial gene signature-a trait previously known for chRCC-across all histopathologic subgroups. Additionally, ccRCC samples from mixed subgroup presented an inverse correlation of mitochondrial and angiogenesis-related genes in the TCGA and in three external validation cohorts. Moreover, mixed subgroup affiliation was associated with a highly significant shorter overall survival for patients with ccRCC-and a highly significant longer overall survival for chRCC patients. Conclusions: Pan-RCC clustering according to RNA-sequencing data revealed a distinct histology-independent subgroup characterized by strengthened mitochondrial and weakened angiogenesis-related gene signatures. Moreover, affiliation to mixed subgroup went along with a significantly shorter overall survival for ccRCC and a longer overall survival for chRCC patients. Further research could offer a therapy stratification by specifically addressing the mitochondrial metabolism of such tumors and its microenvironment.
Copyright © 2021 Marquardt, Solimando, Kerscher, Bittrich, Kalogirou, Kübler, Rosenwald, Bargou, Kollmannsberger, Schilling, Meierjohann and Krebs.

Entities:  

Keywords:  kidney cancer; mTOR; machine learning; mitochondrial DNA; mtDNA; pan-RCC

Year:  2021        PMID: 33791209      PMCID: PMC8005734          DOI: 10.3389/fonc.2021.621278

Source DB:  PubMed          Journal:  Front Oncol        ISSN: 2234-943X            Impact factor:   6.244


Introduction

Basic and clinical research in renal cell carcinoma (RCC) mainly focuses on established histopathologic subgroups, specifically clear cell (ccRCC), papillary (pRCC) and chromophobe RCC (chRCC). Accordingly, histopathology is of crucial relevance for determining treatment strategies including drug sequencing in RCC patients, especially in a metastasized situation. As reflected in the WHO classification for renal neoplasms (1), dividing RCC in three distinct (sub-)entities does not completely mirror tumor biology and its complexity. Instead, sub-categories such as clear cell papillary RCC (2) were introduced, indicating substantial greyscales between classical histopathologic subgroups. By performing transcriptomic analyses, researchers have identified characteristic signatures of ccRCC, pRCC, and chRCC—thereby supporting established histopathologic classification (3–5). Although comprehensive pan-RCC analyses have been performed previously, the boundaries of histopathologic origin usually were not scrutinized (6, 7). Using principal component analysis (PCA) with subsequent machine learning (ML) algorithms, we mapped 891 RCC specimen irrespective of histopathologic boundaries. Following this comprehensive pan-RCC approach allowed us to identify novel RCC subgroups with a prognostic impact for cancer patients and provide first functional insight.

Materials and Methods

Data Acquisition

This work mainly based on data provided by The Cancer Genome Atlas (TCGA) consortium. Utilized entities were ccRCC (KIRC cohort, n = 538 tumor samples), chRCC (KICH cohort, n = 65 samples) and pRCC (KIRP cohort, n = 288 samples) downloaded from the GDC portal (https://portal.gdc.cancer.gov). For evaluation, we used data provided by the ICGC (international network of cancer genome projects) (8), specifically the RECA-EU data set, comprising of n = 91 ccRCC samples (https://dcc.icgc.org/projects/RECA-EU) with available RNA-sequencing data. Additionally, we used ccRCC samples from the CPTAC-3-Kidney cohort (n = 101) as further external validation (https://portal.gdc.cancer.gov/projects/CPTAC-3) (9). Regarding RCC caused by hereditary leiomyomatosis (hlRCC)—also known as fumarate hydratase (FH)-deficient RCC—we further examined the smaller GSE157256 cohort (10) as another source for evaluation (n = 26) (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE157256). For all mentioned datasets, we used unprocessed FPKM or RSEM (GSE157256) values as provided.

Bioinformatical Analyses

The presented work was implemented in a Jupyter Notebook environment (version 7.5.0)—which is available upon request—using the Python version 3.6.9, SciPy version 1.3.0 (11) and scikit-learn version 0.22.1 (12).

T-SNE Plotting

Our project was based on the 2D representation of high-dimensional data with subsequent cluster analysis using ML. For plotting unprocessed FPKM data in 2D, we performed a PCA with 50 components—using PCA of the sklearn.decomposition module—and used the results as input for t-SNE plotting (sklearn.manifold module) (13). For calculating and plotting in 2D, we used a random initiation with a learning rate of 300 and a perplexity of 27 with 10.000 iterations. For reproducibility, we used the random state 0 (n_components=2, init='random', perplexity=27, n_iter=10,000, learning_rate=300, random_state=0). Additionally, cluster annotation and t-SNE coordinates for each TCGA sample from all RCC subgroups are shown in Supplementary Table 1.

Random Forest Learning

After manual annotation, we used these clusters for subsequent learning steps. For this, we applied a model utilizing Random Forest (RF) Classifier (RandomForestClassifier of the sklearn.ensemble module). For training our model, we used a 70/30 split, letting the model learn on 70% of the data and evaluating it on the remaining 30%, with 1,000 trees in the forest (n_estimators=1,000), leaving out the pRCC samples not clustering in one of the three annotated clusters or the mixed subgroup. For further investigation, we trained 20 models and used the one with the highest test accuracy for subsequent feature analysis. For this purpose, we assigned the according “feature values,” implying the importance of each feature, to each feature, representing the Ensembl gene IDs. We identified the top 200 genes with the strongest influence on our model, which distinguished our manually annotated clusters with the highest accuracy. These top 200 genes of our best performing model overlapped in 92 genes with the mean of the other 19 trained models, outperforming them in test accuracy—with 92.06% compared to the mean of 83.42% (min. 79.37%, max. 86.11%). A 10-fold-crossvalidation of the data yielded a mean accuracy of 84.52% (+/- 9.16%).

Plots and Statistical Analysis

Correlation and scatter plots were generated using matplotlib. Indicated Pearson Rs were calculated using the according module from scipy.stats. For subsequent survival analysis of patient survival, Kaplan Meier (KM) plots were generated using the lifelines module (version 0.23.1) with the KaplanMeierFitter (14). If not stated otherwise, statistical tests were performed using Kruskal-Wallis-Test—using scipy.stats module including indicated significances, for which we used the statannot module for python (version 0.2.3; https://github.com/webermarcolivier/statannot). For the analysis of further interactions and relations between the identified 200 genes with the highest influence on the learned model, we used a Network generated by StringDB (15). The coloring of the nodes was done directly by StringDB for selected gene-sets stated as significant. For validation purposes of relevant pathways and genes previously identified by mRNA expression patterns, we used the level four protein expression levels provided by The Cancer Proteome Atlas (16) (TCPA—https://tcpaportal.org/tcpa/) for the three investigated cohorts.

Results

Clustering 891 RCC Samples Independently From Histopathologic Origin

For PCA, we used the RNAseq data of all registered RCC specimen (n = 891) within the TCGA database, irrespective of their histopathologic origin. We combined all tumor specimen in a t-SNE-plot in order to illustrate familiarities and discordances between samples based upon unprocessed FPKM values. Figure 1A represents the t-SNE plot, with ccRCC, pRCC and chRCC samples marked in red, green and blue, respectively. Of note, most tumor samples clustered to RCC subgroups, thereby confirming the familiarity and the validity of histopathologic classification. Figure 1B represents the schematic distribution of clusters identified within the t-SNE-plot. Interestingly, pRCC samples did not cluster in a single subgroup, but instead in three distinct subgroups (cluster I-III), whereas ccRCC specimen built another cluster (IV). However, apart from most samples clustering according to histopathology, we identified a distinct cluster containing a mixture of ccRCC, pRCC and chRCC samples (Figure 1B; cluster V). We named this accumulation mixed subgroup. As depicted in Figure 1C, we manually split and defined the novel clusters for further ML-based analyses. Aside from three distinct pRCC clusters, which surely merit future investigation, we were mainly interested in this mixed subgroup—containing 19% of ccRCC, 36.8% of pRCC and 81.5% of chRCC samples (Figure 1D). Of note, our clustering approach revealed no clear separation between type 1 and type 2 pRCC (Supplementary Figure 1).
Figure 1

(A) t-SNE-plot for RNA-sequencing data from ccRCC (red), pRCC (green) and chRCC (blue) specimen within the TCGA database. (B) Visually identified clusters—I to III: distinct pRCC subgroups; IV: ccRCC samples; V: mixed subgroup containing ccRCC, pRCC and chRCC tumors. (C) Manually defined clusters based on visual separation. (D) Pie charts illustrating absolute numbers and proportions of RCC samples inside/outside the mixed subgroup for each RCC subgroup.

(A) t-SNE-plot for RNA-sequencing data from ccRCC (red), pRCC (green) and chRCC (blue) specimen within the TCGA database. (B) Visually identified clusters—I to III: distinct pRCC subgroups; IV: ccRCC samples; V: mixed subgroup containing ccRCC, pRCC and chRCC tumors. (C) Manually defined clusters based on visual separation. (D) Pie charts illustrating absolute numbers and proportions of RCC samples inside/outside the mixed subgroup for each RCC subgroup.

Clinical Characterization of Patient Samples From Mixed Subgroup

We next examined the clinical characteristics of RCC patient samples depending on their affiliation to the mixed subgroup (Table 1). Comparing ccRCC samples inside and outside the mixed subgroup, we found no significant differences in age, gender, tumor stage, tumor extension (T classification), lymphonodal invasion (N classification) or metastasis (M classification). In contrast, tumor grading was significantly different (p = 0.014). For pRCC, mixed subgroup patients were significantly older (65.1 ± 10.9 vs. 59.6 ± 12.1 years; p = 0.0002) than patients with pRCC not belonging to this cluster. Moreover, the proportion of male patients was significantly higher in the mixed subgroup (p = 0.001). In contrast to the age distribution in patients with pRCC, chRCC samples from the mixed subgroup were significantly younger (49.6 ± 13.2 vs. 61.9 ± 12.9 years; p = 0.012). In addition, the lymphonodal status differed significantly between the two subgroups (p = 0.005).
Table 1

Clinical characteristics of RCC patients inside and outside the mixed subgroup.

ccRCC non-mixedccRCC mixedppRCC non-mixedpRCC mixedpchRCC non-mixedchRCC mixedp
n = 428n = 102n = 182n = 105n = 12n = 53
Agemean60.2 ± 2.262.0 ± 11.80.19659.6 ± 12.165.1 ± 10.90.0002161.9 ± 12.949.6 ± 13.20.012
Genderm273 (63.79%)71 (69.61%)0.269123 (67.58%)89 (84.76%)0.00110 (83.33%)29 (54.72%)0.07
f155 (36.21%)31 (30.39%)59 (32.42%)16 (15.24%)2 (16.67%)24 (45.28%)
Tumor stageI223 (52.47%)42 (64.29%)0.166108 (64,70%)64 (38,32%)0.4192 (16.67%)18 (33.96%)0.1
II43 (10.12%)14 (8.33%)16 (9,60%)4 (2,40%)5 (41.67%)20 (37.74%)
III90 (21.18%)33 (19.64%)34 (20,30%)16 (9,58%)1 (8.33%)13 (24.53%)
IV69 (16.23%)13 (7.74%)9 (5,40%)6 (3,60%)4 (33.33%)2 (3.77%)
TT1228 (53.27%)43 (42.57%)0.092119 (64.67%)74 (70.48%)0.4632 (16.67%)18 (33.962%)0.14
T253 (12.38%)16 (15.84%)22 (11.96%)10 (9.52%)5 (41.67%)20 (37.74%)
T3137 (32.00%)41 (40.59%)39 (21.20%)20 (19.05%)3 (25%)15 (28.30%)
T410 (2.33%)1 (0.99%)4 (2.17%)1 (0.95%)2 (16.66%)0 (0%)
NN0192 (93.66%)47 (94%)0.92929 (59.18%)20 (71.43%)0.214 (57.14%)35 (94.60%)0.005
N113 (6.34%)3 (6%)16 (32.66%)8 (28.57%)2 (28.57%)1 (2.7%)
N20 (0%)0 (0%)4 (8.16%)0 (0%)1 (14.29%)1 (2.7%)
MM019 (90.48%)3 (75%)0.39260 (63.16%)35 (89.74%)0.6544 (80%)3 (75%)0.866
M12 (9.52%)1 (25%)35 (36.84%)4 (10.26%)1 (20%)1 (25%)
GradingG113 (3.06%)13 (11.93%)0.014
G2195 (45.88%)32 (29.36%)
G3158 (37.18%)48 (44.04%)
G459 (13.88%)16 (14.67%)

Except for age (mean ± standard deviation), all characteristics were presented as absolute values. p-values highlighted as bold were significant for p < 0.05.

Clinical characteristics of RCC patients inside and outside the mixed subgroup. Except for age (mean ± standard deviation), all characteristics were presented as absolute values. p-values highlighted as bold were significant for p < 0.05.

ML-Based Functional Characterization of Patient Samples Affiliated to Mixed Subgroup

To learn more about functional traits and characteristic differences of the clusters, we applied further ML based on the visual separation (Figure 1C). Therefore, we determined the top 200 genes best classifying the novel clusters. As shown in Figure 2, we depicted these genes in a StringDB gene network to uncover relevant signaling pathways. We found a substantial accumulation of mitochondrial genes—with String DB identifying “oxidative phosphorylation” (GO:0006119) and “respiratory electron transport chain” (GO:0022904) as highly overrepresented pathways in our analysis. Additionally, “blood vessel development” (GO:0001568) and “blood vessel morphogenesis” (GO:0048514) were also highly overrepresented.
Figure 2

StringDB network of the top 200 genes identified as relevant classifiers for RCC sample clusters from Figure 1C. Genes affiliated with oxidative phosphorylation and respiratory electron transport chain are marked in red and blue, genes related to blood vessel morphogenesis and blood vessel development are marked in green and yellow.

StringDB network of the top 200 genes identified as relevant classifiers for RCC sample clusters from Figure 1C. Genes affiliated with oxidative phosphorylation and respiratory electron transport chain are marked in red and blue, genes related to blood vessel morphogenesis and blood vessel development are marked in green and yellow. Moreover, mtDNA genes represented all of the top 10 classifier genes in our RF calculation—as shown in Table 2. In conclusion, we found mitochondrial and angiogenesis-related gene signatures to be most predictive within our clustering approach.
Table 2

Gene families significantly overrepresented in the top 200 cluster classifying genes from Random Forest (RF) analysis.

Mitochondrial GenesAngiogenesis-related Genes
HGNC SymbolEnsembl gene IDRF-Feature PositionHGNC SymbolEnsembl gene IDRF-Feature Position
MT-CYBENSG000001987271ETS1ENSG0000013495413
MT-ND4ENSG000001988862ANGPT2ENSG0000009187933
MT-CO1ENSG000001988043APLNENSG0000017138837
MT-CO3ENSG000001989384FLT1ENSG0000010275538
MT-CO2ENSG000001987125CRKLENSG0000009994246
MT-ND4LENSG000002129076ITGA5ENSG0000016163854
MT-ATP6ENSG000001988997NRP1ENSG0000009925056
MT-RNR1ENSG000002114598PRDM1ENSG0000005765793
MTATP6P1ENSG000002485279PTENENSG00000171862109
MT-ND1ENSG0000019888810VEGFAENSG00000112715112
MT-ND2ENSG0000019876320ACKR3ENSG00000144476114
MT-ND3ENSG0000019884024CDH13ENSG00000140945146
MT-RNR2ENSG0000021008225BMPR2ENSG00000204217148
CALCRLENSG00000064989177
ESM1ENSG00000164283191

For each gene, HGNC symbol, Ensembl gene IDs, and the position in our calculation is shown.

Gene families significantly overrepresented in the top 200 cluster classifying genes from Random Forest (RF) analysis. For each gene, HGNC symbol, Ensembl gene IDs, and the position in our calculation is shown.

Mitochondrial and Angiogenesis-Related Genes Inside and Outside Mixed Subgroup

Alterations and overexpression of mtDNA have been described as characteristic traits of chRCC (5, 17, 18)—and more than 80% of the chRCC samples in our analysis were located in the mixed subgroup (see Figure 1D). Due to this relative overrepresentation of chRCC in this cluster, we first checked whether our RF analysis was biased by a high proportion of chRCC samples. For this reason, we compared unprocessed FPKM values of mitochondrial genes for ccRCC, pRCC, and chRCC samples depending on the affiliation to the mixed subgroup. We found a highly significant overexpression of mitochondrial genes for chRCC samples inside compared to samples outside the mixed subgroup. However, alterations in mtDNA expression were not limited to chRCC. Instead, mixed subgroup samples from pRCC as well as ccRCC exhibited a highly significant mtDNA overexpression. Figures 3A,B illustrate unprocessed FPKM values for candidate genes MT-CO2 (Figure 3A) and MT-CO3 (Figure 3B).
Figure 3

Unprocessed FPKM values of exemplary candidate genes–(A,B) MT-CO2 and MT-CO3, (C,D) FLT1 and KDR. ns, not significant, ****p < 0.0001.

Unprocessed FPKM values of exemplary candidate genes–(A,B) MT-CO2 and MT-CO3, (C,D) FLT1 and KDR. ns, not significant, ****p < 0.0001. For angiogenesis-related genes such as FLT1 (Figure 3C) and KDR (Figure 3D), we discovered significantly lower expression levels within ccRCC samples from mixed subgroup. Additionally, we discovered significant expression differences for genes displayed in Table 2, regardless of the underlying histopathologic entity, when compared to normal tissue samples (Supplementary Figures 2–7). Regarding expression of mitochondrial and angiogenesis-related genes in ccRCC, we found negative Pearson R correlations in the TCGA dataset (Figure 4A) as well as all three RCC validation cohorts (Figures 4B–D). In line with a weaker angiogenesis signature, ccRCC and pRCC samples from mixed subgroup displayed significantly lower levels of c-MET (Supplementary Figure 8).
Figure 4

Color-coded presentation of the Pearson R correlation matrix of mitochondrial genes and angiogenesis-associated genes for ccRCC samples from the (A) TCGA, (B) the ICGC RECA-EU, and (C) the CPTAC-3-Kidney cohort as well as (D) Fumarate hydratase-deficient RCC samples contained within the GSE157256 cohort.

Color-coded presentation of the Pearson R correlation matrix of mitochondrial genes and angiogenesis-associated genes for ccRCC samples from the (A) TCGA, (B) the ICGC RECA-EU, and (C) the CPTAC-3-Kidney cohort as well as (D) Fumarate hydratase-deficient RCC samples contained within the GSE157256 cohort. Summing up the results, mtDNA and angiogenesis signatures proved to be predictive not only for our pan-RCC clustering approach—but also specifically for ccRCC samples. Moreover, expression levels of mitochondrial and angiogenesis-associated genes were negatively correlated in four independent RCC cohorts.

Impact of Mixed Subgroup Affiliation on Patient Survival

After characterizing mixed subgroup samples from a clinical and a functional perspective, we next investigated whether an affiliation to this cluster impacted patient prognosis. Strikingly, survival analysis revealed a significantly worse prognosis (p = 0.005) for ccRCC patients from the TCGA database belonging to the mixed subgroup (Figure 5A). For chRCC patients (Figure 5B), cluster affiliation had the opposite effect—with significantly higher survival rates (p = 0.003) for patients inside the mixed subgroup. In contrast, there was no significant survival difference for patients with pRCC (Figure 5C).
Figure 5

(A,B) KM plots illustrating overall survival of patients with ccRCC (A), chRCC (B) and pRCC (C) from TCGA database depending on mixed subgroup affiliation. (D) Protein expression levels of bona fide candidate genes from mTOR-associated, angiogenesis-related and immune-related signaling for ccRCC, pRCC and chRCC samples inside (blue) and outside (red) the mixed subgroup (TCPA database). ns, not significant. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.

(A,B) KM plots illustrating overall survival of patients with ccRCC (A), chRCC (B) and pRCC (C) from TCGA database depending on mixed subgroup affiliation. (D) Protein expression levels of bona fide candidate genes from mTOR-associated, angiogenesis-related and immune-related signaling for ccRCC, pRCC and chRCC samples inside (blue) and outside (red) the mixed subgroup (TCPA database). ns, not significant. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001. Given that clinical characteristics such as tumor stage and TNM classification did not differ significantly for patients with ccRCC (Table 1), we reasoned that the survival impact could partially be due to an inadequate therapy stratification. Using The Cancer Proteome Atlas (TCPA) (16, 19), we therefore analyzed the protein expression of bona fide gene candidates related to mTOR and PI3K/Akt signaling, angiogenesis and immune signaling (Figure 5D). Regarding ccRCC as well as pRCC samples, we found a significant downregulation of VEGFR2 and HIF1A protein expression in mixed subgroup samples. For both subgroups, this downregulation of angiogenesis-related genes was accompanied by a significant upregulation of PD-L1 expression. Moreover, protein expression of TSC1 and PTEN was downregulated in mixed subgroup samples. While pRCC samples from our novel cluster exhibited a significant mTOR downregulation, the slight increase in mTOR protein expression of ccRCC samples from mixed subgroup was not significant. Potentially due to lower sample numbers, TCPA analysis revealed no significant expression differences for chRCC specimen. In summary, we found a highly significant and clinically relevant influence of mixed subgroup affiliation in RCC patients from the TCGA database—with a better prognosis for chRCC and a worse overall survival for ccRCC patients.

Discussion

Classifying cancer tissue into three histopathologic subgroups—clear cell, papillary and chromophobe—critically determines treatment strategies and prognosis of RCC patients. However, growing evidence highlights that this classification is not absolute nor distinct. Instead, the WHO system of renal cell tumors from 2016 contained several additional subgroups, such as succinate dehydrogenase-deficient renal carcinoma and clear cell papillary RCC (1). Previous functional analyses on RCC mainly focused on isolated gene signatures, which were characteristic and prognostic for single histopathologic subgroups (4, 5, 20, 21)—e.g., ClearCode34 (22) for determining the individual risk of recurrence in localized ccRCC. Moreover, researchers aimed to identify biomarkers and gene networks predictive of future therapy response—especially for angiogenesis inhibition, tyrosine kinase inhibition (TKI) and immune checkpoint blockade (23–27). Interestingly, a recent study was able to discriminate ccRCC and pRCC samples originating from proximal tubules of the nephron from chRCC specimen originating from distal tubules based on the metabolic and lipidomic profile of the samples (28).

Pan-RCC Clustering Identifies Subgroup Beyond Established Histopathologic Classification

While most studies focused on gene signatures within given histopathologic boundaries, we aimed to challenge the absoluteness and robustness of RCC subgroup classification. In our pan-RCC approach, we performed a clustering for all RCC specimen from the TCGA database. Of note, a substantial number of RCC samples clustered independently from histopathologic origin. We called this cluster mixed subgroup. Conferring samples inside and outside the mixed subgroup, ccRCC patients exhibited no significant differences regarding age, gender, tumor stage and TNM classification. In contrast, grading of tumor samples appeared significantly different, partially due to a higher proportion of G1 tumors in the mixed subgroup. pRCC patients from this cluster were significantly older than the remainder of the group. Moreover, the proportion of male patients was higher inside the mixed subgroup. All other clinical characteristics did not differ significantly. Patients with chRCC within the cluster were significantly younger and had a higher proportion of N0 patients.

ML Reveals Mitochondrial Genes as Most Influential for Pan-RCC Clustering

For further functional characterization, we applied RF learning in order to identify gene signatures most predictive for the novel clusters. This ML approach revealed mitochondrial genes to be most influential for the basic clustering, followed by genes related to angiogenesis. As mtDNA overexpression is a reported feature of chRCC (5, 17, 18), we had to rule out a systematic bias caused by the high proportion of chRCC samples within the mixed subgroup. Therefore, we analyzed mtDNA expression in all RCC subgroups depending on subgroup affiliation. Of note, ccRCC, pRCC, and chRCC specimen belonging to the mixed subgroup all displayed significantly higher levels of mtDNA expression compared to the counterparts outside this cluster. In ccRCC, this mtDNA upregulation went along with a downregulation of angiogenesis-related genes. Taking these results together led to significantly negative correlations between mitochondrial and angiogenesis signatures—not only in ccRCC samples from TCGA but also in the RECA-EU and CPTAC-3-Kidney cohorts taken as external validation. Moreover, comparable results from the GSE157256 cohort representing fumarate hydratase-deficient RCC could imply a general underlying mechanism beyond RCC subgroups. A pan-RCC subgroup characterized by a prominent mtDNA signature appeared surprising at first sight. Although aberration in mitochondrial signaling is known across RCC subgroups, these deviations are not considered being unidirectional toward an upregulation of mitochondrial transcripts and mitochondrial mass (29). While mtDNA aberrations and overexpression are mainly regarded as a characteristic trait of chRCC tissue (5, 17, 18), downregulation of mitochondrial enzymes with increasing tumor stages and decreased oxidative capacity were previously reported for ccRCC (30–32). However, growing evidence indicates that mtDNA can also have oncogenic functions, thereby appearing as a potential (co-)target in future cancer therapies (33). Specifically, researchers showed that tumor cells lacking mtDNA could not metastasize in vivo—after restoration of mtDNA levels, cancer cells regained this ability (34). In line with these findings, Schöpf et al. demonstrated the importance of oxidative phosphorylation in high-grade prostate cancer by describing a high-risk subgroup characterized by a distinct mitochondrial signature (35). Given the established role of angiogenesis and angiogenesis-related genes such as VEGFR2 in high-risk prostate cancer (36, 37), further examining the interaction of mitochondrial and angiogenesis pathways in prostate cancer could prove beneficial.

Assessing Prognosis and Therapeutic Windows for Mixed Subgroup Patients

Importantly, ccRCC patients inside the mixed subgroup suffered from significantly worse overall survival. This result was even more surprising given the non-significant differences in TNM stage between both subgroups. For pRCC patients, we did not find significant survival differences regarding mixed subgroup affiliation. However, we identified three distinct pRCC clusters. This result surely merits further investigation regarding functions and prognosis of each pRCC cluster. In contrast, patients with chRCC belonging to the mixed subgroup exhibited a significantly longer overall survival. In summary, survival data from ccRCC and chRCC patients underline the role of the mixed subgroup as a novel prognostic RCC cluster identified by our comprehensive clustering approach. Regarding the striking survival impact in ccRCC combined with non-differing clinical characteristics, it was tempting to assume that diverging outcomes were at least partially treatment-related. First, addressing mtDNA overexpression appears as an attractive therapeutic approach, as compounds such as the anthelmintic drug atovaquone (38) and the antibiotic tigecycline (39) were shown to chemo-sensitize RCC. Moreover, our results could explain the beneficial effect of additional Metformin intake during treatment of metastatic RCC, as this biguanide works as a mitochondrial inhibitor (40–42). Above all, mTOR inhibitors—which already have the approval for treating RCC—appear as promising candidates for mixed subgroup RCC patients. mTOR signaling is tightly linked to mitochondrial function (43, 44). Recently, a mitochondrial complex I inhibitor (IACS-010759) targeting oxidative phosphorylation in cancer cells showed efficacy in brain cancer and leukemia models (45). Given the downregulation of angiogenesis-related gene signatures in ccRCC patients from the mixed subgroup, inhibition of angiogenesis and TKI do not appear as attractive first-line approaches for these patients. Indeed, the predominant use of TKI within the historical RCC cohort from the TCGA database may partially explain the striking survival differences observed in this analysis. Supporting our findings from RNA-sequencing, protein levels of VEGFR2 and HIF2A were lowered in the mixed subgroup for ccRCC as well as pRCC patients. Regarding the downregulation of c-MET in ccRCC and pRCC samples from the mixed subgroup, treatment with MET (co-)inhibitors such as Cabozantinib (46) does not appear promising, either. Further research could clarify whether immune checkpoint inhibition constitutes a viable treatment strategy in our new cluster. At first sight, highly significant protein overexpression of PD-L1 in ccRCC and pRCC patients from the mixed subgroup makes it an attractive therapeutic target. However, unlike in entities as melanoma and non-small cell lung cancer (47), it is still unclear whether PD-L1 overexpression in RCC results in better response to immunotherapy (48). Completely in line with our findings, several clinical trials already stated that PD-L1 overexpression marked a RCC high-risk cohort (48, 49). Our study has some limitations regarding its methodology and its retrospective nature. We are aware that manual cluster annotation approaches naturally contain immanent biases. Moreover, our findings derive from the re-analysis of historic cohorts and require further—ideally prospective—validation in future studies. Essentially, we identified a high-risk ccRCC subgroup best described by a mitochondrial signature and a downregulation of angiogenesis-related genes, which was not exclusive to one RCC subgroup. Although preliminary, these results could contribute to an individual risk classifier based on transcriptomic data from patients' samples and help establishing personalized medicine in RCC.

Data Availability Statement

Publicly available datasets were analyzed in this study. This data can be found here: https://portal.gdc.cancer.gov/projects https://dcc.icgc.org/projects/RECA-EU https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE157256 https://portal.gdc.cancer.gov/projects/CPTAC-3.

Author Contributions

AM, AGS, BS, SM, MK: conceptualization. AM, PK: methodology. AM, AGS, AK, MB, MK: writing – draft preparation. AM, AGS, MB, CK, HK, AR, RB, PK, BS, SM, MK: writing – review and editing. HK, AR, RB, BS, SM: supervision/funding/infrastructure. All authors contributed to the article and approved the submitted version.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  46 in total

1.  Cabozantinib (XL184), a novel MET and VEGFR2 inhibitor, simultaneously suppresses metastasis, angiogenesis, and tumor growth.

Authors:  F Michael Yakes; Jason Chen; Jenny Tan; Kyoko Yamaguchi; Yongchang Shi; Peiwen Yu; Fawn Qian; Felix Chu; Frauke Bentzien; Belinda Cancilla; Jessica Orf; Andrew You; A Douglas Laird; Stefan Engst; Lillian Lee; Justin Lesch; Yu-Chien Chou; Alison H Joly
Journal:  Mol Cancer Ther       Date:  2011-09-16       Impact factor: 6.261

2.  Explore, Visualize, and Analyze Functional Cancer Proteomic Data Using the Cancer Proteome Atlas.

Authors:  Jun Li; Rehan Akbani; Wei Zhao; Yiling Lu; John N Weinstein; Gordon B Mills; Han Liang
Journal:  Cancer Res       Date:  2017-11-01       Impact factor: 12.701

3.  PD-L1 Expression and Clinical Outcomes to Cabozantinib, Everolimus, and Sunitinib in Patients with Metastatic Renal Cell Carcinoma: Analysis of the Randomized Clinical Trials METEOR and CABOSUN.

Authors:  Toni K Choueiri; Sabina Signoretti; Abdallah Flaifel; Wanling Xie; David A Braun; Miriam Ficial; Ziad Bakouny; Amin H Nassar; Rebecca B Jennings; Bernard Escudier; Daniel J George; Robert J Motzer; Michael J Morris; Thomas Powles; Evelyn Wang; Ying Huang; Gordon J Freeman
Journal:  Clin Cancer Res       Date:  2019-08-01       Impact factor: 12.531

4.  Effect of Metformin Use on Survival Outcomes in Patients With Metastatic Renal Cell Carcinoma.

Authors:  Lana Hamieh; Rana R McKay; Xun Lin; Raphael B Moreira; Ronit Simantov; Toni K Choueiri
Journal:  Clin Genitourin Cancer       Date:  2016-06-29       Impact factor: 2.872

5.  Comprehensive Molecular Characterization of Papillary Renal-Cell Carcinoma.

Authors:  W Marston Linehan; Paul T Spellman; Christopher J Ricketts; Chad J Creighton; Suzanne S Fei; Caleb Davis; David A Wheeler; Bradley A Murray; Laura Schmidt; Cathy D Vocke; Myron Peto; Abu Amar M Al Mamun; Eve Shinbrot; Anurag Sethi; Samira Brooks; W Kimryn Rathmell; Angela N Brooks; Katherine A Hoadley; A Gordon Robertson; Denise Brooks; Reanne Bowlby; Sara Sadeghi; Hui Shen; Daniel J Weisenberger; Moiz Bootwalla; Stephen B Baylin; Peter W Laird; Andrew D Cherniack; Gordon Saksena; Scott Haake; Jun Li; Han Liang; Yiling Lu; Gordon B Mills; Rehan Akbani; Mark D M Leiserson; Benjamin J Raphael; Pavana Anur; Donald Bottaro; Laurence Albiges; Nandita Barnabas; Toni K Choueiri; Bogdan Czerniak; Andrew K Godwin; A Ari Hakimi; Thai H Ho; James Hsieh; Michael Ittmann; William Y Kim; Bhavani Krishnan; Maria J Merino; Kenna R Mills Shaw; Victor E Reuter; Ed Reznik; Carl S Shelley; Brian Shuch; Sabina Signoretti; Ramaprasad Srinivasan; Pheroze Tamboli; George Thomas; Satish Tickoo; Kenneth Burnett; Daniel Crain; Johanna Gardner; Kevin Lau; David Mallery; Scott Morris; Joseph D Paulauskis; Robert J Penny; Candace Shelton; W Troy Shelton; Mark Sherman; Eric Thompson; Peggy Yena; Melissa T Avedon; Jay Bowen; Julie M Gastier-Foster; Mark Gerken; Kristen M Leraas; Tara M Lichtenberg; Nilsa C Ramirez; Tracie Santos; Lisa Wise; Erik Zmuda; John A Demchok; Ina Felau; Carolyn M Hutter; Margi Sheth; Heidi J Sofia; Roy Tarnuzzer; Zhining Wang; Liming Yang; Jean C Zenklusen; Jiashan Zhang; Brenda Ayala; Julien Baboud; Sudha Chudamani; Jia Liu; Laxmi Lolla; Rashi Naresh; Todd Pihl; Qiang Sun; Yunhu Wan; Ye Wu; Adrian Ally; Miruna Balasundaram; Saianand Balu; Rameen Beroukhim; Tom Bodenheimer; Christian Buhay; Yaron S N Butterfield; Rebecca Carlsen; Scott L Carter; Hsu Chao; Eric Chuah; Amanda Clarke; Kyle R Covington; Mahmoud Dahdouli; Ninad Dewal; Noreen Dhalla; Harsha V Doddapaneni; Jennifer A Drummond; Stacey B Gabriel; Richard A Gibbs; Ranabir Guin; Walker Hale; Alicia Hawes; D Neil Hayes; Robert A Holt; Alan P Hoyle; Stuart R Jefferys; Steven J M Jones; Corbin D Jones; Divya Kalra; Christie Kovar; Lora Lewis; Jie Li; Yussanne Ma; Marco A Marra; Michael Mayo; Shaowu Meng; Matthew Meyerson; Piotr A Mieczkowski; Richard A Moore; Donna Morton; Lisle E Mose; Andrew J Mungall; Donna Muzny; Joel S Parker; Charles M Perou; Jeffrey Roach; Jacqueline E Schein; Steven E Schumacher; Yan Shi; Janae V Simons; Payal Sipahimalani; Tara Skelly; Matthew G Soloway; Carrie Sougnez; Angela Tam; Donghui Tan; Nina Thiessen; Umadevi Veluvolu; Min Wang; Matthew D Wilkerson; Tina Wong; Junyuan Wu; Liu Xi; Jane Zhou; Jason Bedford; Fengju Chen; Yao Fu; Mark Gerstein; David Haussler; Katayoon Kasaian; Phillip Lai; Shiyun Ling; Amie Radenbaugh; David Van Den Berg; John N Weinstein; Jingchun Zhu; Monique Albert; Iakovina Alexopoulou; Jeremiah J Andersen; J Todd Auman; John Bartlett; Sheldon Bastacky; Julie Bergsten; Michael L Blute; Lori Boice; Roni J Bollag; Jeff Boyd; Erik Castle; Ying-Bei Chen; John C Cheville; Erin Curley; Benjamin Davies; April DeVolk; Rajiv Dhir; Laura Dike; John Eckman; Jay Engel; Jodi Harr; Ronald Hrebinko; Mei Huang; Lori Huelsenbeck-Dill; Mary Iacocca; Bruce Jacobs; Michael Lobis; Jodi K Maranchie; Scott McMeekin; Jerome Myers; Joel Nelson; Jeremy Parfitt; Anil Parwani; Nicholas Petrelli; Brenda Rabeno; Somak Roy; Andrew L Salner; Joel Slaton; Melissa Stanton; R Houston Thompson; Leigh Thorne; Kelinda Tucker; Paul M Weinberger; Cynthia Winemiller; Leigh Anne Zach; Rosemary Zuna
Journal:  N Engl J Med       Date:  2015-11-04       Impact factor: 91.245

6.  Integrated Proteogenomic Characterization of Clear Cell Renal Cell Carcinoma.

Authors:  David J Clark; Saravana M Dhanasekaran; Francesca Petralia; Jianbo Pan; Xiaoyu Song; Yingwei Hu; Felipe da Veiga Leprevost; Boris Reva; Tung-Shing M Lih; Hui-Yin Chang; Weiping Ma; Chen Huang; Christopher J Ricketts; Lijun Chen; Azra Krek; Yize Li; Dmitry Rykunov; Qing Kay Li; Lin S Chen; Umut Ozbek; Suhas Vasaikar; Yige Wu; Seungyeul Yoo; Shrabanti Chowdhury; Matthew A Wyczalkowski; Jiayi Ji; Michael Schnaubelt; Andy Kong; Sunantha Sethuraman; Dmitry M Avtonomov; Minghui Ao; Antonio Colaprico; Song Cao; Kyung-Cho Cho; Selim Kalayci; Shiyong Ma; Wenke Liu; Kelly Ruggles; Anna Calinawan; Zeynep H Gümüş; Daniel Geiszler; Emily Kawaler; Guo Ci Teo; Bo Wen; Yuping Zhang; Sarah Keegan; Kai Li; Feng Chen; Nathan Edwards; Phillip M Pierorazio; Xi Steven Chen; Christian P Pavlovich; A Ari Hakimi; Gabriel Brominski; James J Hsieh; Andrzej Antczak; Tatiana Omelchenko; Jan Lubinski; Maciej Wiznerowicz; W Marston Linehan; Christopher R Kinsinger; Mathangi Thiagarajan; Emily S Boja; Mehdi Mesri; Tara Hiltke; Ana I Robles; Henry Rodriguez; Jiang Qian; David Fenyö; Bing Zhang; Li Ding; Eric Schadt; Arul M Chinnaiyan; Zhen Zhang; Gilbert S Omenn; Marcin Cieslik; Daniel W Chan; Alexey I Nesvizhskii; Pei Wang; Hui Zhang
Journal:  Cell       Date:  2019-10-31       Impact factor: 41.582

7.  STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets.

Authors:  Damian Szklarczyk; Annika L Gable; David Lyon; Alexander Junge; Stefan Wyder; Jaime Huerta-Cepas; Milan Simonovic; Nadezhda T Doncheva; John H Morris; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

Review 8.  Mitochondrial DNA: the overlooked oncogenome?

Authors:  Payam A Gammage; Christian Frezza
Journal:  BMC Biol       Date:  2019-07-08       Impact factor: 7.431

9.  miR-221-3p Regulates VEGFR2 Expression in High-Risk Prostate Cancer and Represents an Escape Mechanism from Sunitinib In Vitro.

Authors:  Markus Krebs; Antonio Giovanni Solimando; Charis Kalogirou; André Marquardt; Torsten Frank; Ioannis Sokolakis; Georgios Hatzichristodoulou; Susanne Kneitz; Ralf Bargou; Hubert Kübler; Bastian Schilling; Martin Spahn; Burkhard Kneitz
Journal:  J Clin Med       Date:  2020-03-02       Impact factor: 4.241

10.  Identification of gene signature for treatment response to guide precision oncology in clear-cell renal cell carcinoma.

Authors:  Ninadh M D'Costa; Davide Cina; Raunak Shrestha; Robert H Bell; Yen-Yi Lin; Hossein Asghari; Cesar U Monjaras-Avila; Christian Kollmannsberger; Faraz Hach; Claudia I Chavez-Munoz; Alan I So
Journal:  Sci Rep       Date:  2020-02-06       Impact factor: 4.379

View more
  15 in total

Review 1.  Phytochemicals for the Prevention and Treatment of Renal Cell Carcinoma: Preclinical and Clinical Evidence and Molecular Mechanisms.

Authors:  Essa M Bajalia; Farah B Azzouz; Danielle A Chism; Derrek M Giansiracusa; Carina G Wong; Kristina N Plaskett; Anupam Bishayee
Journal:  Cancers (Basel)       Date:  2022-07-04       Impact factor: 6.575

2.  Visualizing the Interactions Shaping the Imaging of the Microenvironment in Human Cancers.

Authors:  Antonio G Solimando; Vanessa Desantis; Matteo Claudio Da Vià
Journal:  Methods Mol Biol       Date:  2023

3.  Identification of TRPM2 as a Marker Associated With Prognosis and Immune Infiltration in Kidney Renal Clear Cell Carcinoma.

Authors:  Lei Sun; Zijun Zhang; Hang Zhao; Miaoyun Qiu; Ying Wen; Xiaoqiang Yao; Wai Ho Tang
Journal:  Front Mol Biosci       Date:  2022-01-05

4.  A Novel Prognostic Predictor of Immune Microenvironment and Therapeutic Response in Kidney Renal Clear Cell Carcinoma based on Necroptosis-related Gene Signature.

Authors:  Wenwei Chen; Wenfeng Lin; Liang Wu; Abai Xu; Chunxiao Liu; Peng Huang
Journal:  Int J Med Sci       Date:  2022-01-24       Impact factor: 3.738

5.  Determination of Exosome Mitochondrial DNA as a Biomarker of Renal Cancer Aggressiveness.

Authors:  Elena Arance; Viviana Ramírez; Alejandro Rubio-Roldan; Francisco M Ocaña-Peinado; Catalina Romero-Cachinero; Ana Belén Jódar-Reyes; Fernando Vazquez-Alonso; Luis Javier Martinez-Gonzalez; Maria Jesus Alvarez-Cubero
Journal:  Cancers (Basel)       Date:  2021-12-31       Impact factor: 6.639

6.  Malignancy and NF-κB signalling strengthen coordination between expression of mitochondrial and nuclear-encoded oxidative phosphorylation genes.

Authors:  Marcos Francisco Perez; Peter Sarkies
Journal:  Genome Biol       Date:  2021-12-02       Impact factor: 17.906

7.  LINC01232 serves as a novel biomarker and promotes tumour progression by sponging miR-204-5p and upregulating RAB22A in clear cell renal cell carcinoma.

Authors:  Qingling Liu; Chengbin Lei
Journal:  Ann Med       Date:  2021-12       Impact factor: 4.709

8.  A Mitochondrial Dysfunction and Oxidative Stress Pathway-Based Prognostic Signature for Clear Cell Renal Cell Carcinoma.

Authors:  Yue Wu; Xi Zhang; Xian Wei; Huan Feng; Bintao Hu; Zhiyao Deng; Bo Liu; Yang Luan; Yajun Ruan; Xiaming Liu; Zhuo Liu; Jihong Liu; Tao Wang
Journal:  Oxid Med Cell Longev       Date:  2021-11-24       Impact factor: 6.543

9.  Identification of CCL4 as an Immune-Related Prognostic Biomarker Associated With Tumor Proliferation and the Tumor Microenvironment in Clear Cell Renal Cell Carcinoma.

Authors:  Lu Zhang; Mengzhao Zhang; Lu Wang; Jianlong Li; Tao Yang; Qiuya Shao; Xiao Liang; Minghai Ma; Nan Zhang; Minxuan Jing; Rundong Song; Jinhai Fan
Journal:  Front Oncol       Date:  2021-11-24       Impact factor: 6.244

10.  Identification and Validation of PIK3CA as a Marker Associated with Prognosis and Immune Infiltration in Renal Clear Cell Carcinoma.

Authors:  Ya Li; Chong Wang; Yang Gao; Liang Zhou
Journal:  J Oncol       Date:  2021-07-27       Impact factor: 4.375

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.