| Literature DB >> 27993167 |
Slavica Dimitrieva1, Ralph Schlapbach2, Hubert Rehrauer2.
Abstract
BACKGROUND: Kidney renal clear cell carcinoma (KIRC) is a type of cancer that is resistant to chemotherapy and radiotherapy and has limited treatment possibilities. Large-scale molecular profiling of KIRC tumors offers a great potential to uncover the genetic and epigenetic changes underlying this disease and to improve the clinical management of KIRC patients. However, in practice the clinicians and researchers typically focus on single-platform molecular data or on a small set of genes. Using molecular and clinical data of over 500 patients, we have systematically studied which type of molecular data is the most informative in predicting the clinical outcome of KIRC patients, as a standalone platform and integrated with clinical data.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27993167 PMCID: PMC5168807 DOI: 10.1186/s13062-016-0170-1
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 3Performance of different feature selection approaches (“extreme score stratification”, “mean score stratification”, “extreme survival stratification” and combined approach) on different omics data on the KIRC cohort using 3-fold cross validation. The points at each plot show the average values across the three cross validation rounds. For clarity, the standard errors are omitted here, but are shown in Additional files 1 and 3
Fig. 1Feature selection process using three different approaches illustrated for the miRNA hsa-mir-21 in the KIRC cohort. a “Extreme score stratification approach”, where we compare the differences in the survival between “extremely” high expression values (Z-scores > 1, shown in blue) and “extremely” low expression values (Z-scores < −1, shown in red). b “Mean score stratification approach”, where we compare the differences in the survival between higher than average expression values (Z-scores > 0, shown in blue) and lower than average expression values (Z-scores < 0, shown in blue). c “Extreme survival stratification approach”, where we search for significant expression differences between patients that died within the first year of diagnosis (shown in blue), and patients that lived longer than 5 years (shown in red)
Fig. 2Flowchart of the analyses. a 3-fold cross validation procedure: the complete set of patients was distributed into three equally sized sets, and each time two sets were used as a training data, while the remaining set was used as a test data. b Computational steps performed at each cross-validation round on the training and test datasets
Fig. 4a Performance of predictive models built using individual omics data (miRNA/mRNA/protein expression, CNV segment means and DNA methylation). The gray line denotes the performance of the model based only on clinical variables (gender, age, tumor grade and tumor stage). b Performance of predictive models built using individual omics data (miRNA/mRNA/protein expression, CNV segment means and DNA methylation) integrated with clinical data (gender, age, tumor grade and tumor stage). The plots show only the results for the best predictive approach on each omics data, as shown on Fig. 3. The results were validated using 3-fold cross validation. For clarity, the standard errors are omitted here, but are shown in Additional file 6
Overview of high-throughput molecular data availability by tissue type in TCGA KIRC patients
| Molecular platform | # patients with molecular profile in tumor tissue | # patients with molecular profile in normal adjacent tissue |
|---|---|---|
| miRNA expression | 493 | 71 |
| mRNA expression | 518 | 72 |
| Protein expression | 454 | 0 |
| CNV | 511 | 510 |
| DNA methylation | 477 | 358 |
Molecular biomarkers that were identified by at least 2 of the approaches with frequency of 100% in any of the three cross-validation rounds
| Molecular type | Molecular biomarker | Extreme score stratific. | Mean score stratific. | Extreme survival stratific. | Survival prognosis association |
|---|---|---|---|---|---|
| miRNA | hsa-mir-10b | ✓ | ✓ | ✓ | High expression in better outcome |
| hsa-mir-130a; hsa-mir-21 | ✓ | ✓ | ✓ | High expression in worse outcome | |
| hsa-mir-190; hsa-mir-204; hsa- mir-676 | ✓ | ✓ | High expression in better outcome | ||
| hsa-let-7i | ✓ | ✓ | High expression in worse outcome | ||
| hsa-mir-130b; hsa-mir-18a; hsa- mir-365-1; hsa-mir-223; hsa- mir-92b | ✓ | ✓ | High expression in worse outcome | ||
| hsa-mir-3613 | ✓ | ✓ | High expression in worse outcome | ||
| hsa-mir-374b; hsa-mir-590 | ✓ | ✓ | High expression in worse outcome | ||
| mRNA | ADH5; ARHGAP24; CLDN10; EHHADH; EIF4EBP2; FBXL5; GIPC2; IMPA2; MFSD4; SALL1; SORBS2; TPRG1L; LRBA; RBM47; RETSAT; RGNEF; SH3BGRL2 | ✓ | ✓ | ✓ | High expression in better outcome |
| AMOT; BBS1; CDC14B; EPHX2; FARS2; KCNJ15; PINK1; RAB3IP; STK32B; ZNF704; | ✓ | ✓ | High expression in better outcome | ||
| ACADM; ALDH6A1; AMD1; ANK3; ATP11A; C5orf23; CCDC121; CLCN5; CPT2; CRYL1; CYFIP2; DDAH1; DMRTA1; FCHO2l; MAP7; MIA2; MOBKL2B; MRPS18B; NPR3; PANK1; PRKAA2; PRUNE2; SLC16A12; SLC27A2; SPATA18; TFEC; TMEM192; TMEM27; TMEM38B; TOX3; WDR31; | ✓ | ✓ | High expression in better outcome | ||
| ACOX1; ALDH3A2; HLF; TIMP3; TMEM150C; UFSP2; | ✓ | ✓ | High expression in better outcome | ||
| Protein | AR; CTNNA1; CTNNB1; GAB2 | ✓ | ✓ | ✓ | High expression in better outcome |
| ACACA; CDKN1A; EA15; RAD51 | ✓ | ✓ | ✓ | High expression in worse outcome | |
| ERRFI1; IGF1R; MAPK1 MAPK3; SHC1 | ✓ | ✓ | High expression in better outcome | ||
| EEF2 | ✓ | ✓ | High expression in worse outcome | ||
| TSC2 | ✓ | ✓ | High expression in better outcome | ||
| IGFBP2; VASP | ✓ | ✓ | High expression in worse outcome | ||
| DNA methylation probes | cg03032025 (CPEB4) | ✓ | ✓ | High methylation in worse outcome | |
| cg14827391 (NXN) | ✓ | ✓ | High methylation in worse outcome |
Fig. 5Stage specific methylation changes. Higher methylation levels (shown in red) are observed in stage III and stage IV patients, while lower methylation levels (in green) are observed in stage I and stage II patients. “cgX” denotes the identifier of the plotted methylation probe
Fig. 6Interconnection between DNA methylation levels and RNA abundance illustrated for mir-21 in normal (red points) and tumor samples (black points). KIRC tumor samples are characterized by lower methylation levels and increased mir-21 expression
Fig. 7Interactions between some of the genes/proteins selected as survival predictive by our analysis. The shape of the nodes in this network corresponds to their biological function (see the legends on topleft). The genes/proteins that are underlined with purple are negatively associated with clinical outcome (i.e. higher expression is linked to poor survival); the ones underlined with yellow are positively associated with outcome (higher expression is linked to better survival). Higher methylation in genes underlined in blue is associated with worse outcome. This interaction network has been generated using MetaCore bioinformatics software version 6.26 build 68498 from Thomson Reuters https://portal.genego.com [41]