| Literature DB >> 33791209 |
André Marquardt1,2,3, Antonio Giovanni Solimando4,5, Alexander Kerscher1, Max Bittrich6, Charis Kalogirou7, Hubert Kübler7, Andreas Rosenwald2, Ralf Bargou1, Philip Kollmannsberger8, Bastian Schilling9, Svenja Meierjohann2,3, Markus Krebs1,7.
Abstract
Background: Renal cell carcinoma (RCC) is divided into three major histopathologic groups-clear cell (ccRCC), papillary (pRCC) and chromophobe RCC (chRCC). We performed a comprehensive re-analysis of publicly available RCC datasets from the TCGA (The Cancer Genome Atlas) database, thereby combining samples from all three subgroups, for an exploratory transcriptome profiling of RCC subgroups. Materials andEntities:
Keywords: kidney cancer; mTOR; machine learning; mitochondrial DNA; mtDNA; pan-RCC
Year: 2021 PMID: 33791209 PMCID: PMC8005734 DOI: 10.3389/fonc.2021.621278
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1(A) t-SNE-plot for RNA-sequencing data from ccRCC (red), pRCC (green) and chRCC (blue) specimen within the TCGA database. (B) Visually identified clusters—I to III: distinct pRCC subgroups; IV: ccRCC samples; V: mixed subgroup containing ccRCC, pRCC and chRCC tumors. (C) Manually defined clusters based on visual separation. (D) Pie charts illustrating absolute numbers and proportions of RCC samples inside/outside the mixed subgroup for each RCC subgroup.
Clinical characteristics of RCC patients inside and outside the mixed subgroup.
| mean | 60.2 ± 2.2 | 62.0 ± 11.8 | 0.196 | 59.6 ± 12.1 | 65.1 ± 10.9 | 61.9 ± 12.9 | 49.6 ± 13.2 | |||
| m | 273 (63.79%) | 71 (69.61%) | 0.269 | 123 (67.58%) | 89 (84.76%) | 10 (83.33%) | 29 (54.72%) | 0.07 | ||
| f | 155 (36.21%) | 31 (30.39%) | 59 (32.42%) | 16 (15.24%) | 2 (16.67%) | 24 (45.28%) | ||||
| I | 223 (52.47%) | 42 (64.29%) | 0.166 | 108 (64,70%) | 64 (38,32%) | 0.419 | 2 (16.67%) | 18 (33.96%) | 0.1 | |
| II | 43 (10.12%) | 14 (8.33%) | 16 (9,60%) | 4 (2,40%) | 5 (41.67%) | 20 (37.74%) | ||||
| III | 90 (21.18%) | 33 (19.64%) | 34 (20,30%) | 16 (9,58%) | 1 (8.33%) | 13 (24.53%) | ||||
| IV | 69 (16.23%) | 13 (7.74%) | 9 (5,40%) | 6 (3,60%) | 4 (33.33%) | 2 (3.77%) | ||||
| T1 | 228 (53.27%) | 43 (42.57%) | 0.092 | 119 (64.67%) | 74 (70.48%) | 0.463 | 2 (16.67%) | 18 (33.962%) | 0.14 | |
| T2 | 53 (12.38%) | 16 (15.84%) | 22 (11.96%) | 10 (9.52%) | 5 (41.67%) | 20 (37.74%) | ||||
| T3 | 137 (32.00%) | 41 (40.59%) | 39 (21.20%) | 20 (19.05%) | 3 (25%) | 15 (28.30%) | ||||
| T4 | 10 (2.33%) | 1 (0.99%) | 4 (2.17%) | 1 (0.95%) | 2 (16.66%) | 0 (0%) | ||||
| N0 | 192 (93.66%) | 47 (94%) | 0.929 | 29 (59.18%) | 20 (71.43%) | 0.21 | 4 (57.14%) | 35 (94.60%) | ||
| N1 | 13 (6.34%) | 3 (6%) | 16 (32.66%) | 8 (28.57%) | 2 (28.57%) | 1 (2.7%) | ||||
| N2 | 0 (0%) | 0 (0%) | 4 (8.16%) | 0 (0%) | 1 (14.29%) | 1 (2.7%) | ||||
| M0 | 19 (90.48%) | 3 (75%) | 0.392 | 60 (63.16%) | 35 (89.74%) | 0.654 | 4 (80%) | 3 (75%) | 0.866 | |
| M1 | 2 (9.52%) | 1 (25%) | 35 (36.84%) | 4 (10.26%) | 1 (20%) | 1 (25%) | ||||
| G1 | 13 (3.06%) | 13 (11.93%) | ||||||||
| G2 | 195 (45.88%) | 32 (29.36%) | ||||||||
| G3 | 158 (37.18%) | 48 (44.04%) | ||||||||
| G4 | 59 (13.88%) | 16 (14.67%) |
Except for age (mean ± standard deviation), all characteristics were presented as absolute values. p-values highlighted as bold were significant for p < 0.05.
Figure 2StringDB network of the top 200 genes identified as relevant classifiers for RCC sample clusters from Figure 1C. Genes affiliated with oxidative phosphorylation and respiratory electron transport chain are marked in red and blue, genes related to blood vessel morphogenesis and blood vessel development are marked in green and yellow.
Gene families significantly overrepresented in the top 200 cluster classifying genes from Random Forest (RF) analysis.
| MT-CYB | ENSG00000198727 | 1 | ETS1 | ENSG00000134954 | 13 |
| MT-ND4 | ENSG00000198886 | 2 | ANGPT2 | ENSG00000091879 | 33 |
| MT-CO1 | ENSG00000198804 | 3 | APLN | ENSG00000171388 | 37 |
| MT-CO3 | ENSG00000198938 | 4 | FLT1 | ENSG00000102755 | 38 |
| MT-CO2 | ENSG00000198712 | 5 | CRKL | ENSG00000099942 | 46 |
| MT-ND4L | ENSG00000212907 | 6 | ITGA5 | ENSG00000161638 | 54 |
| MT-ATP6 | ENSG00000198899 | 7 | NRP1 | ENSG00000099250 | 56 |
| MT-RNR1 | ENSG00000211459 | 8 | PRDM1 | ENSG00000057657 | 93 |
| MTATP6P1 | ENSG00000248527 | 9 | PTEN | ENSG00000171862 | 109 |
| MT-ND1 | ENSG00000198888 | 10 | VEGFA | ENSG00000112715 | 112 |
| MT-ND2 | ENSG00000198763 | 20 | ACKR3 | ENSG00000144476 | 114 |
| MT-ND3 | ENSG00000198840 | 24 | CDH13 | ENSG00000140945 | 146 |
| MT-RNR2 | ENSG00000210082 | 25 | BMPR2 | ENSG00000204217 | 148 |
| CALCRL | ENSG00000064989 | 177 | |||
| ESM1 | ENSG00000164283 | 191 | |||
For each gene, HGNC symbol, Ensembl gene IDs, and the position in our calculation is shown.
Figure 3Unprocessed FPKM values of exemplary candidate genes–(A,B) MT-CO2 and MT-CO3, (C,D) FLT1 and KDR. ns, not significant, ****p < 0.0001.
Figure 4Color-coded presentation of the Pearson R correlation matrix of mitochondrial genes and angiogenesis-associated genes for ccRCC samples from the (A) TCGA, (B) the ICGC RECA-EU, and (C) the CPTAC-3-Kidney cohort as well as (D) Fumarate hydratase-deficient RCC samples contained within the GSE157256 cohort.
Figure 5(A,B) KM plots illustrating overall survival of patients with ccRCC (A), chRCC (B) and pRCC (C) from TCGA database depending on mixed subgroup affiliation. (D) Protein expression levels of bona fide candidate genes from mTOR-associated, angiogenesis-related and immune-related signaling for ccRCC, pRCC and chRCC samples inside (blue) and outside (red) the mixed subgroup (TCPA database). ns, not significant. *p < 0.05, **p < 0.01, ***p < 0.001, ****p < 0.0001.