| Literature DB >> 32982583 |
Qiang Zhao1, Jia Xue2, Baoan Hong1, Wubin Qian2, Tiezhu Liu3, Bin Fan4, Jie Cai4, Yongpeng Ji1, Jia Liu1, Yong Yang1, Qixiang Li5,6, Sheng Guo2, Ning Zhang1.
Abstract
BACKGROUND: Large-scale initiatives like The Cancer Genome Atlas (TCGA) performed genomics studies on predominantly Caucasian kidney cancer. In this study, we aimed to investigate genomics of Chinese clear cell renal cell carcinoma (ccRCC).Entities:
Keywords: Clear cell renal cell carcinoma; Immuno-phenotyping; Molecular classification; TCGA; Transcriptomic characterization
Year: 2020 PMID: 32982583 PMCID: PMC7510315 DOI: 10.1186/s12935-020-01552-w
Source DB: PubMed Journal: Cancer Cell Int ISSN: 1475-2867 Impact factor: 5.722
Clinical data summary of studied ccRCC datasets
| CKC dataset | Percentage | TCGA dataset | Percentage | |
|---|---|---|---|---|
| Sample (matched normal) | 55 (11) | 533 (72) | ||
| Average of age (range) | 58.3 (25–80) | 60.63 (26–90) | ||
| Gender | ||||
| Male | 41 | 74.5 | 345 | 64.7 |
| Female | 14 | 25.5 | 188 | 35.3 |
| Race | ||||
| White | 0 | 0. | 462 | 86. |
| Black | 0 | 0.0 | 56 | 10.5 |
| Asian | 55 | 100.0 | 8 | 1.5 |
| Not available | 0 | 0.0 | 7 | 1.3 |
| Vital status | ||||
| Alive | 55 | 100.0 | 358 | 67.2 |
| Dead | 0 | 0.0 | 175 | 32.8 |
| Metastasis | ||||
| Mets− | 51 | 92.7 | 422 | 79.2 |
| Mets+ | 4 | 7.3 | 109 | 20.5 |
| Not available | 0 | 0.0 | 2 | 0.4 |
| Pathologic stage | ||||
| Stage I | 41 | 74.5 | 267 | 50.1 |
| Stage II | 3 | 5.5 | 57 | 10.7 |
| Stage III | 11 | 20.0 | 123 | 23.1 |
| Stage IV | 0 | 0.0 | 83 | 15.6 |
| Not available | 0 | 0.0 | 3 | 0.6 |
| Histologic grade | ||||
| Grade1 | 8 | 14.5 | 14 | 2.6 |
| Grade2 | 29 | 52.7 | 229 | 43.0 |
| Grade3 | 13 | 23.6 | 206 | 38.6 |
| Grade4 | 4 | 7.3 | 76 | 14.3 |
| Not available | 1 | 1.8 | 3 | 0.6 |
Fig. 1Driver mutations and Gene fusion markers. a Scatterplot of mutation load in clear cell renal cell carcinoma patients from Chinese (n = 55) and TCGA (n = 533) collection. Median mutation load of each dataset is marked in red. b Summary of Top 26 driver mutations detected in Chinese cohort. Colors indicate mutation types. c PBRM1 mutation is associated with activation of VEGF signaling pathways in TCGA ccRCC collection. d Circos plot of detected fusion genes. Recurrent fusion events observed in more than one sample are highlighted in red
The frequencies of driver mutation genes in CccRCC and TCGA datasets
| Gene | Frequency in Chinese ccRCC (%) | Frequency in TCGA KIRC white (%) | p-value (Fisher’s exact test) |
|---|---|---|---|
| VHL | 76.4 | 52.8 | 0.000862049 |
| BAP1 | 18.2 | 9 | 0.053248208 |
| NCOR2 | 12.7 | 1.4 | 0.000127079 |
| SETD2 | 10.9 | 11 | 1 |
| PBRM1 | 9.1 | 33.7 | 8.38E−05 |
| ATM | 9.1 | 3.5 | 0.065990135 |
| ERBB3 | 9.1 | 1.9 | 0.011804502 |
| NBN | 9.1 | 0.4 | 0.000226619 |
| DNM2 | 9.1 | 0.2 | 7.06E−05 |
| PTEN | 7.3 | 4.3 | 0.299167676 |
| KMT2C | 7.3 | 3.8 | 0.269293981 |
| GOLGA5 | 7.3 | 1.1 | 0.010440507 |
| MTOR | 5.5 | 7.1 | 1 |
| FAT1 | 5.5 | 3.5 | 0.449134307 |
| PTCH1 | 5.5 | 2.1 | 0.131792795 |
| MSH6 | 5.5 | 1.1 | 0.045880562 |
| PTPRB | 5.5 | 0.9 | 0.031041033 |
| EXT2 | 5.5 | 0.4 | 0.01040418 |
| APC | 3.6 | 1.1 | 0.170756605 |
| KDR | 3.6 | 1.1 | 0.170756605 |
| XPC | 3.6 | 1.1 | 0.170756605 |
| TSC1 | 3.6 | 0.9 | 0.13083787 |
| FGFR4 | 3.6 | 0.7 | 0.093628013 |
| SH2B3 | 3.6 | 0.4 | 0.06033848 |
| TGFBR2 | 3.6 | 0.4 | 0.06033848 |
| FANCE | 1.8 | 4.3 | 0.711854112 |
| PIK3CA | 1.8 | 4 | 0.70899921 |
| TP53 | 1.8 | 3.3 | 1 |
| POLE | 1.8 | 3.1 | 1 |
| CHD4 | 1.8 | 2.6 | 1 |
| SMARCA4 | 1.8 | 2.3 | 1 |
| ATR | 1.8 | 2.1 | 1 |
| CDK12 | 1.8 | 2.1 | 1 |
| TET2 | 1.8 | 2.1 | 1 |
| CHEK2 | 1.8 | 1.9 | 1 |
| EGFR | 1.8 | 1.9 | 1 |
| BLM | 1.8 | 1.6 | 0.604409035 |
| FLT4 | 1.8 | 1.6 | 0.604409035 |
| GNAS | 1.8 | 1.6 | 0.604409035 |
| MYH11 | 1.8 | 1.6 | 0.604409035 |
| TRRAP | 1.8 | 1.6 | 0.604409035 |
| NCOR1 | 1.8 | 1.4 | 0.55540565 |
| PIK3CB | 1.8 | 1.4 | 0.55540565 |
| FBXW7 | 1.8 | 1.1 | 0.500455786 |
| GATA2 | 1.8 | 1.1 | 0.500455786 |
| KAT6B | 1.8 | 1.1 | 0.500455786 |
| MLLT4 | 1.8 | 1.1 | 0.500455786 |
| TCF12 | 1.8 | 1.1 | 0.500455786 |
| ERBB2 | 1.8 | 0.9 | 0.4388528 |
| ARID1B | 1.8 | 0.9 | 0.4388528 |
| ASXL1 | 1.8 | 0.9 | 0.4388528 |
| CIC | 1.8 | 0.9 | 0.4388528 |
| JAK3 | 1.8 | 0.9 | 0.4388528 |
| MECOM | 1.8 | 0.9 | 0.4388528 |
| NUP98 | 1.8 | 0.9 | 0.4388528 |
| PER1 | 1.8 | 0.9 | 0.4388528 |
| BARD1 | 1.8 | 0.7 | 0.369807843 |
| DDR2 | 1.8 | 0.7 | 0.369807843 |
| MET | 1.8 | 0.7 | 0.369807843 |
| NDRG1 | 1.8 | 0.4 | 0.292440502 |
| FANCG | 1.8 | 0.4 | 0.292440502 |
| PLCG1 | 1.8 | 0.4 | 0.292440502 |
| PMS2 | 1.8 | 0.4 | 0.292440502 |
| PRF1 | 1.8 | 0.2 | 0.205768403 |
| CEBPA | 1.8 | 0.2 | 0.205768403 |
| MAP2K1 | 1.8 | 0.2 | 0.205768403 |
| MYH9 | 1.8 | 0.2 | 0.205768403 |
| PRDM1 | 1.8 | 0.2 | 0.205768403 |
| TCF7L2 | 1.8 | 0.2 | 0.205768403 |
| FH | 1.8 | 0 | 0.108695652 |
| CDKN1B | 1.8 | 0 | 0.108695652 |
| KLF6 | 1.8 | 0 | 0.108695652 |
| PPM1D | 1.8 | 0 | 0.108695652 |
| TRAF7 | 1.8 | 0 | 0.108695652 |
| KDM5C | 0.0 | 6.9 | 0.037678184 |
| ARID1A | 0.0 | 5.5 | 0.095406442 |
| MALAT1 | 0.0 | 2.1 | 0.606792113 |
| FGFR3 | 0.0 | 1.4 | 1 |
| ABL2 | 0.0 | 1.1 | 1 |
| BCL6 | 0.0 | 1.1 | 1 |
| CDK4 | 0.0 | 0.9 | 1 |
| CDH1 | 0.0 | 0.7 | 1 |
| PALB2 | 0.0 | 0.7 | 1 |
| FANCF | 0.0 | 0.2 | 1 |
Fisher’s Exact test was performed for each gene by comparing the sample counts with mutant and wild type in two datasets
Fig. 2Global overview of the transcriptomics of ccRCC patients. a The t-Distributed Stochastic Neighbor Embedding (t-SNE) plot of global mRNA expression for Chinese (C, n = 55) and TCGA’s (T, n = 533) ccRCC patients. Samples are colored by race. b Heatmap for gene set variation analysis (GSVA) on early (T1T2) and late (T3T4) clinical stages. Cutoffs used for GSVA were:|Fold Change| > 1.3 and Bonferroni & Hochberg adjusted p-value < 0.05
Fig. 3Clear cell renal cell carcinoma classification. a Chinese and TCGA ccRCC patients (n = 588) are classified into three subtypes based on NMF-clustering derived 300 genes. b Survival analysis on three identified classes of TCGA ccRCC patients revealed significant difference. The log rank test p-value across groups is 1.55E−15. c Comparison of CccRCC-defined ccRCC classification (n = 533 and n = 488) with TCGA’s classification (n = 488) by stratification of survival curves. d Overlap of samples classified in CccRCC and TCGA’s classification. The largest intersection for Class 1-3 is highlighted in red
Fig. 4Transcriptional Characterization of Immune Microenvironment of ccRCC. a Unsupervised hierarchical clustering of immune gene expression within CccRCC patients (including tumor and normal samples, n = 66). A signature of 66 immune related cell markers proposed by TCGA was used for clustering. b Survival analysis on TCGA ccRCC patients (n = 533) grouped by three identified immuno-phenotypes (immune-active, tolerant and inactive) revealed significant difference. The log-rank test p-value across groups is 1.8E−3. c Survival analysis on combination of classification. TCGA patients were grouped by combination of molecular classification from Fig. 3 (Class 1–3) and immuno-phenotyping (immune-active, tolerant and inactive). The log rank test p-value across groups is 2.40E−14