| Literature DB >> 36109798 |
Florian A Büttner1,2, Stefan Winter1,2, Jens Bedke3,4, Matthias Schwab5,6,7,8,9, Elke Schaeffeler1,2,4,10, Viktoria Stühler3, Steffen Rausch3, Jörg Hennenlotter3, Susanne Füssel11, Stefan Zastrow11, Matthias Meinhardt12, Marieta Toma12,13, Carmen Jerónimo14,15,16, Rui Henrique14,15,16, Vera Miranda-Gonçalves14,15,16, Nils Kröger17, Silvia Ribback18, Arndt Hartmann19, Abbas Agaimy19, Christine Stöhr19, Iris Polifka19, Falko Fend20, Marcus Scharpf20, Eva Comperat21, Gabriel Wasinger21, Holger Moch22, Arnulf Stenzl3, Marco Gerlinger23.
Abstract
BACKGROUND: Renal cell carcinoma (RCC) is a heterogeneous disease comprising histologically defined subtypes. For therapy selection, precise subtype identification and individualized prognosis are mandatory, but currently limited. Our aim was to refine subtyping and outcome prediction across main subtypes, assuming that a tumor is composed of molecular features present in distinct pathological subtypes.Entities:
Keywords: Cancer-specific survival; Gene expression deconvolution; Immunotherapy; RCC subtypes; Renal cell carcinoma
Mesh:
Substances:
Year: 2022 PMID: 36109798 PMCID: PMC9476269 DOI: 10.1186/s13073-022-01105-y
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 15.266
Fig. 1Overview of the general data analysis workflow and the use of the different cohorts. RNA quantification technologies, cohort compositions, and tissue preparation techniques used are given. FF fresh-frozen, FFPE formalin-fixed and paraffin-embedded
Characteristics of the discovery and the validation cohorts
| Male | 579 | 67.0 | 165 | 68.2 |
| Female | 285 | 33.0 | 77 | 31.8 |
| 1 | 476 | 55.1 | 124 | 51.2 |
| 2 | 122 | 14.1 | 21 | 8.7 |
| 3 | 251 | 29.1 | 92 | 38 |
| 4 | 13 | 1.5 | 4 | 1.7 |
| NA | 2 | 0.2 | 1 | 0.4 |
| 0 | 322 | 37.3 | 173 | 71.5 |
| 1/2 | 46 | 5.3 | 26 | 10.7 |
| X | 495 | 57.3 | 40 | 16.5 |
| NA | 1 | 0.1 | 3 | 1.2 |
| 0 | 537 | 62.2 | 170 | 70.2 |
| 1 | 82 | 9.5 | 38 | 15.7 |
| X | 209 | 24.2 | 33 | 13.6 |
| NA | 36 | 4.2 | 1 | 0.4 |
| ccRCC | 512 | 59.3 | 134 | 55.4 |
| pRCC | 287 | 33.2 | 86 | 35.5 |
| chRCC | 65 | 7.5 | 16 | 6.6 |
| mixed | 0 | 0.0 | 6 | 2.5 |
| Alive | 648 | 75.0 | 161 | 66.5 |
| Deceased | 216 | 25.0 | 81 | 33.5 |
| Censored | 714 | 82.6 | 188 | 77.7 |
| Events | 135 | 15.6 | 54 | 22.3 |
| NA | 15 | 1.7 | 0 | 0.0 |
| Median | 3.0 | 4.8 | ||
| Range | 0 to 16.2 | 0 to 21.2 | ||
| NA | 2 | 0.2 | 0 | 0.0 |
| Median | 60 | 64 | ||
| Range | 17 to 90 | 25 to 90 | ||
| NA | 3 | 0.3 | 0 | 0.0 |
| Median | 5.1 | 5.8 | ||
| Range | 1 to 25 | 1.3 to 17.7 | ||
| NA | 105 | 12.2 | 2 | 0.8 |
NA Not available, CSS Cancer-specific survival
Fig. 2Proportional subtype assignment (PSA) of RCC and RCC cell lines. A, B Principal component analysis of the TCGA RCC cohort (C3) using expression data of the 174 signature genes. A TCGA cohorts of ccRCC (KIRC, n = 512), pRCC (KIRP, n = 287), and chRCC (KICH, n = 65) are displayed. B PSA were determined for tumors of C3 by computational deconvolution. A total of 246 RCC samples with maximum PSA values below 95% were considered as potential heterogeneous tumors (enlarged symbols). Their molecular subtype composition based on PSA is visualized by pie charts. Nineteen samples with as determined by a permutation P-value approach are displayed by shaded pie charts with gray borders. C PSA for 14 RCC-derived cell lines were calculated using transcriptomic data as provided by the Broad-Novartis Cancer Cell Line Encyclopedia (CCLE) [37, 38] as well as the COSMIC Cell Lines Project (CCLP) [39, 40]. Asterisk indicates
Fig. 3Distribution of PSA in histologically defined RCC subtypes. A–C Distributions of assigned proportions of ccRCC (A), pRCC (B), and chRCC (C) to tumors of C3 are shown for distinct, pathologically defined subgroups including 469 ccRCC, 270 pRCC including 159 pRCC T1 and 78 pRCC T2, and 80 chRCC, respectively. Tumors with a maximum PSA value ≥ 95% are colored. Furthermore, ten tumors with the CpG island methylator phenotype (CIMP), a molecular pRCC subtype with a specific methylation profile, are indicated, as well as six metabolically divergent (MD) chRCC. Samples with are marked in gray. Boxes refer to median and interquartile ranges with whiskers extending to a maximum of 1.5 times the interquartile range. D The information content of different subtype classifications was quantified by determining the amount of variance they explained in gene expression data. Expression of 25,208 genes in 623 tumors consisting of 466 ccRCC, 116 pRCC, and 41 chRCC that were repeatedly resampled (with replacement) from 805 cases of C3 was analyzed. Points and error bars indicate the mean together with the 95% value range of the resampling distribution. “Cohort” refers to the TCGA cohorts KIRC (n = 479), KIRP (n = 266), and KICH (n = 60). “Path.cat” comprises ccRCC (n = 466), pRCC (n = 266), and chRCC (n = 73) subgroups. Additionally, the combination of Path.cat and PSA was evaluated. Samples with were not considered here. E, F Relationship between PSA and computational histopathology. The mean pairwise Manhattan distance between 50 randomly selected tiles per tumor tissue slide was used as a measure of histopathological complexity. A circle represents one tissue slide, and multiple slides may be present per tumor. The values in parentheses indicate the number of the slides and associated tumors. Samples with were not considered here. E Histological complexity is displayed in dependence on the ccRCC proportion. Slides from samples classified as either chRCC or with chRCC proportion above 5% were excluded. Colors indicate the pathological classification, and the dashed lines display the mean distance of the respective set of slides. The Pearson correlation coefficient (PCC) was calculated. F Per pathologically defined RCC subgroup (Path.cat), histopathological complexity was compared between tumors with maximum PSA value ≥ 95% (filled circles) and potential heterogeneous tumors with maximum PSA value < 95% (open circles) using the t-test. Boxes refer to median and interquartile ranges with whiskers extending to a maximum of 1.5 times the interquartile range
Fig. 4Subtype prediction through PSA in formalin-fixed and paraffin-embedded (FFPE) tissue. A PSA based on fresh-frozen (FF) samples from 9 RCC were compared to PSA based on matching FFPE samples. Whole-transcriptome profiles generated by RNA-Seq for two ccRCC were obtained from Li et al. (marked by asterisks) [82]. The remaining 7 tumors (pRCC) from the present study have been analyzed using microarray technology. For all 18 samples, was below 0.05. B–D PSA were determined for 92 FPPE tissues of cohort C4. Gene expression was quantified using microarray technology. The assigned proportions of molecular features of ccRCC (B), pRCC (C), and chRCC (D) were compared with the original pathological classification. According to pathology, 23 tumors had a mixed histology dominated either by clear cell morphology (n = 11) or by papillary features (n = 12). Additionally, 4 ccRCC, 48 pRCC including 18 pRCC T1 and 14 pRCC T2, and 17 chRCC were analyzed. Tumors with PSA values ≥ 95% are colored. Samples with are marked in gray. Boxes refer to median and interquartile ranges with whiskers extending to a maximum of 1.5 times the interquartile range
Fig. 5Risk prediction for RCC based on PSA and the RCC-R score. A–C PSA were used as predictors of survival in C3 (n = 864). A prognostic index (PI), which differentiated the individual risk of patients based on ccRCC- and pRCC-score, was calculated as detailed in the “Methods” section. For samples without available survival data, the PI was predicted. Samples with are marked in gray. Hazard ratios (HR) were obtained by exponentiating the PI. A Combinations of ccRCC- and pRCC-score values are colored according to their HR. Points in the corners represent 201 (bottom right), 165 (top left), and 48 (bottom left) cases, respectively. B Principal component analysis plot is shown with samples colored according to their HR. C Distributions of HR in distinct, pathologically defined subtypes (n = 805) are displayed. Samples with are not shown here. Per histological subtype, a Cox regression of cancer-specific survival (CSS) on the respective subset of PI was conducted. Log-rank P-values are indicated by the level of significance: “***” P < 0.001, “**” P < 0.01, “*” P < 0.05, “.” P < “0.1”. D–F Relationship of cancer-specific survival (CSS) and the ccRCC-score, termed as RCC-R score, is shown. D The curve displays the estimated relationship as specified in Eq. 1 in the “Methods” section between the RCC-R score, modeled via cubic polynomials, and the PI in C3 (n = 828). Using conditional inference trees with endpoint CSS, the PI was categorized into three risk groups (good (n = 290), intermediate (n = 480), and poor (n = 58)). Corresponding P-values from recursive binary splitting are indicated. E Kaplan–Meier curves of CSS for risk groups based on the RCC-R score are shown for the discovery cohort (C3). Additionally, HR with the good group as reference are specified for the intermediate and the poor group. F Kaplan–Meier curves of CSS for risk groups based on the RCC-R score in the validation cohort (C5, n = 241) are shown by colored curves. Corresponding Kaplan–Meier curves for C3 are added for comparison. Indicated HR and log-rank test P-value result from Cox regression analysis in C5. G–I Relationship of RCC-R score with established molecular signatures. Risk groups derived from the RCC-R score in cohort C3 were compared with different molecular-based classifications or signatures available for the combined TCGA RCC or the KIRC cohort. G Bar chart showing distribution (%) of nine major genomic subtypes of RCC (as established by multi-omics analysis [67]) per risk group (good (n = 290), intermediate (n = 480), and poor (n = 58)). H Boxplots showing immune infiltration as predicted by the ESTIMATE method [70] per risk group (good (n = 290), intermediate (n = 480), and poor (n = 58)). I Bar chart showing distribution (%) of four immune subtypes of ccRCC [71] per risk group (good (n = 17), intermediate (n = 421), and poor (n = 15))
Multivariate Cox analyses of cancer-specific survival in 237 patients of the validation cohort (C5) with and all clinicopathological parameters available
| Variable | Level | Hazard ratio (95% CI) | |
|---|---|---|---|
| Female | 1 | ||
| Male | 0.88 (0.46–1.70) | 0.71 | |
| Linear | 0.99 (0.97–1.02) | 0.59 | |
| 1 | 1 | ||
| 2 | 1.83 (0.4–8.43) | 0.44 | |
| 3 | 5.12 (2.1–12.51) | 3.4E − 04 | |
| 4 | 11.49 (2.54–51.88) | 1.5E − 03 | |
| 0 | 1 | ||
| 1/2 | 1.62 (0.73–3.62) | 0.24 | |
| X | 0.53 (0.14–1.96) | 0.34 | |
| 0 | 1 | ||
| 1 | 6.06 (2.74–13.41) | 8.7E − 06 | |
| X | 1.45 (0.32–6.51) | 0.63 | |
| Linear | 0.92 (0.83–1.02) | 9.7E − 02 | |
| ccRCC | 1 | ||
| pRCC | 2 (0.96–4.16) | 6.2E − 02 | |
| chRCC | 8.8 (0.86–89.78) | 6.7E − 02 | |
| Mixed | 1.88 (0.28–12.67) | 0.52 | |
| Linear | 2.14 (1.14–4.04) | 1.8E − 02 | |
Fig. 6Prediction of therapeutic outcome by PSA in the IMmotion151 and JAVELIN Renal 101 trials. A, B Principal component analysis (PCA) using expression of 174 signature genes identified 341 of 823 samples and 407 of 726 samples as heterogeneous tumors in the IMmotion151 and of the JAVELIN Renal 101 trials, respectively. Here, tumors with maximum PSA value below 95% were considered as potential heterogeneous. Their molecular subtype is visualized by pie charts (enlarged symbols). Samples with non-significant PSA () are displayed by shaded pie charts with gray borders (IMmotion151 n = 23; JAVELIN Renal 101 n = 12). C–F Kaplan–Meier curves of progression-free survival (PFS) are shown for PD-L1-positive tumors in both cohorts with a ccRCC proportion of at least (C, D) or less (E, F) than 95% based on PSA. Cox regression analysis was used to determine P-values (log-rank test) and HR of checkpoint inhibition with tyrosine-kinase inhibition versus sunitinib, respectively