| Literature DB >> 28454375 |
Chengmei Long1, Jinbo Jian2, Xinchang Li1, Gongxian Wang3, Jingen Wang4.
Abstract
An accumulation of driver mutations is important for cancer formation and progression, and leads to the disruption of genes and signaling pathways. The identification of driver mutations and genes has been the subject of numerous previous studies. The present study was performed to identify cancer-driving mutations and genes in renal cell carcinoma (RCC), prioritizing noncoding variants with a high functional impact, in order to analyze the most informative features. Sorting Intolerant From Tolerant (SIFT), Polymorphism Phenotyping version 2 (Polyphen2) and MutationAssessor were applied to predict deleterious mutations in the coding genome. OncodriveFM and OncodriveCLUST were used to detect potential driver genes and signaling pathways. The functional impact of noncoding variants was evaluated using Combined Annotation Dependent Depletion, FunSeq2 and Genome-Wide Annotation of Variants. Noncoding features were analyzed with respect to their enrichment of high-scoring variants. A total of 1,327 coding mutations in clear cell RCC, 258 in chromophobe RCC and 1,186 in papillary RCC were predicted to be deleterious by all three of MutationAssessor, Polyphen2 and SIFT. In total, 77 genes were positively selected by OncodriveFM and 1 by OncodriveCLUST, 45 of which were recurrently mutated genes. In addition, 10 signaling pathways were recurrently mutated and had a high functional impact bias (FM bias), and 31 novel signaling pathways with high FM bias were identified. Furthermore, noncoding regulatory features and conserved regions contained numerous high-scoring variants, and expression, replication time, GC content and recombination rate were positively correlated with the densities of high-scoring variants. In conclusion, the present study identified a list of cancer-driving genes and signaling pathways, features like regulatory elements, conserved regions, replication time, expression, GC content and recombination rate are major factors that affect the distribution of high-scoring non-coding mutations in kidney cancer.Entities:
Keywords: driver gene; driver mutation; driver pathway; functional non-coding variants; kidney cancer
Year: 2017 PMID: 28454375 PMCID: PMC5403472 DOI: 10.3892/ol.2017.5689
Source DB: PubMed Journal: Oncol Lett ISSN: 1792-1074 Impact factor: 2.967
Figure 1.Venn diagram revealing the number of variants with deleterious effects predicted by SIFT, MutationAssesor and Polyphen2, and (A) the overlap between variants in clear cell renal cell carcinoma, (B) the overlap between variants in chromophobe renal cell carcinoma and (C) the overlap between variants in papillary renal cell carcinoma. (D) Mutation signatures in kidney cancer. (E) Densities of deleterious mutations in the coding regions of cancer genes and non-cancer genes. SIFT, Sorting Intolerant From Tolerant; polyphen2, Polymorphism Phenotyping version 2.
Cancer-driving signaling pathways as detected by OncodriveFM in kidney cancer.
| A, Clear cell renal cell carcinoma | |||||
|---|---|---|---|---|---|
| Pathway name | Pathway Identification number | Gene number | FM_Z score | P-value | Q-value |
| Oxidative phosphorylation | hsa00190 | 121 | 3.82 | 6.80×10−5 | 1.90×10−3 |
| Spliceosome | hsa03040 | 125 | 3.54 | 1.97×10−4 | 3.44×10−3 |
| RNA degradation | hsa03018 | 70 | 4.14 | 1.74×10−5 | 7.98×10−4 |
| Ubiquitin-mediated proteolysis | hsa04120 | 137 | 6.64 | 1.60×10−11 | 2.24×10−9 |
| Phagosome | hsa04145 | 147 | 3.08 | 1.05×10−3 | 1.63×10−2 |
| Legionellosis | hsa05134 | 53 | 2.71 | 3.36×10−3 | 4.71×10−2 |
| Pathways in cancer | hsa05200 | 326 | 4.08 | 2.28×10−5 | 7.98×10−4 |
| HIF-1 signaling | hsa04066 | 110 | 3.59 | 1.67×10−4 | 3.44×10−3 |
| Leukocyte transendothelial migration | hsa04670 | 114 | 3.55 | 1.94×10−4 | 3.44×10−3 |
| Renal cell carcinoma | hsa05211 | 70 | 5.57 | 1.24×10−8 | 8.69×10−7 |
| B, Papillary renal cell carcinoma | |||||
| Pathway name | Pathway Identification number | Gene number | FM_Z score | P-value | Q-value |
| Wnt signaling pathway | hsa04310 | 152 | 2.88 | 1.97×10−3 | 8.94×10−52 |
| Metabolic pathways | hsa01100 | 1160 | 2.81 | 2.46×10−3 | 0.09 |
| Pyrimidine metabolism | hsa00240 | 101 | 1.69 | 4.56×10−2 | 0.98 |
| Citrate cycle | hsa00020 | 30 | 2.44 | 7.30×10−3 | 0.20 |
| Viral myocarditis | hsa05416 | 68 | 3.60 | 1.57×10−4 | 1.71×10−2 |
| C, Chromophobe renal cell carcinoma | |||||
| Pathway name | Pathway Identification number | Gene number | FM_Z score | P-value | Q-value |
| Neurotrophin signaling pathway | hsa04722 | 119 | 5.38 | 3.63×10−8 | 4.23×10−8 |
| Herpes simplex infection | hsa05168 | 182 | 6.96 | 1.71×10−12 | 4.27×10−12 |
| Epstein-Barr virus infection | hsa05169 | 199 | 7.18 | 3.58×10−13 | 1.25×10−12 |
| HTLV–I infection | hsa05166 | 260 | 6.01 | 9.02×10−10 | 1.17×10−9 |
| Hepatitis C | hsa05160 | 131 | 6.71 | 9.80×10−12 | 1.91×10−11 |
| Hepatitis B | hsa05161 | 147 | 7.39 | 7.34×10−14 | 3.67×10−13 |
| Measles | hsa05162 | 134 | 6.83 | 4.18×10−12 | 9.15×10−12 |
| Wnt signaling pathway | hsa04310 | 152 | 6.02 | 8.93×10−10 | 1.17×10−9 |
| MAPK signaling pathway | hsa04010 | 257 | 7.41 | 6.15×10−14 | 3.59×10−13 |
| Chronic myeloid leukemia | hsa05220 | 73 | 7.57 | 1.94×10−14 | 1.70×10−13 |
| Non-small cell lung cancer | hsa05223 | 54 | 7.09 | 6.48×10−13 | 2.06×10−12 |
| Small cell lung cancer | hsa05222 | 85 | 7.23 | 2.49×10−13 | 9.68×10−13 |
| Focal adhesion | hsa04510 | 204 | 2.96 | 1.56×10−3 | 1.71×10−3 |
| Cell cycle | hsa04110 | 124 | 6.96 | 1.71×10−12 | 4.27×10−12 |
| Apoptosis | hsa04210 | 87 | 6.57 | 2.54×10−11 | 4.67×10−11 |
| p53 signaling pathway | hsa04115 | 68 | 6.35 | 1.08×10−10 | 1.79×10−10 |
| Transcriptional misregulation in cancer | hsa05202 | 180 | 5.78 | 3.69×10−9 | 4.46×10−9 |
| Viral carcinogenesis | hsa05203 | 203 | 6.92 | 2.32×10−12 | 5.41×10−12 |
| Pathways in cancer | hsa05200 | 326 | 8.17 | 1.53×10−16 | 5.34×10−15 |
| Amyotrophic lateral sclerosis | hsa05014 | 53 | 6.24 | 2.22×10−10 | 3.53×10−10 |
| Bladder cancer | hsa05219 | 42 | 6.15 | 3.84×10−10 | 5.37×10−10 |
| mTOR signaling pathway | hsa04150 | 64 | 3.92 | 4.47×10−5 | 5.05×10−5 |
| Huntington's disease | hsa05016 | 180 | 7.03 | 1.01×10−12 | 2.94×10−12 |
| Thyroid cancer | hsa05216 | 29 | 6.17 | 3.37×10−10 | 4.92×10−10 |
| Prostate cancer | hsa05215 | 88 | 7.98 | 7.14×10−16 | 8.33×10−15 |
| Melanoma | hsa05218 | 71 | 7.46 | 4.40×10−14 | 3.08×10−13 |
| Basal cell carcinoma | hsa05217 | 55 | 6.17 | 3.37×10−10 | 4.92×10−10 |
| PI3K-Akt signaling pathway | hsa04151 | 338 | 5.90 | 1.81×10−9 | 2.26×10−9 |
| Pancreatic cancer | hsa05212 | 70 | 7.35 | 9.55×10−14 | 4.18×10−13 |
| Endometrial cancer | hsa05213 | 52 | 6.81 | 4.92×10−12 | 1.01×10−11 |
| Glioma | hsa05214 | 65 | 8.07 | 3.60×10−16 | 6.30×10−15 |
| Colorectal cancer | hsa05210 | 62 | 6.56 | 2.69×10−11 | 4.70×10−11 |
HIF-1, hypoxia-inducible factor 1; HTLV-I, human T-lymphotropic virus I; MAPK, mitogen-activated protein kinase; p53, tumor protein 53; mTOR, mechanistic target of rapamycin; PI3K-Akt, phosphoinositide 3-kinase-protein kinase B.
Figure 2.(A) Density plots of the scores of all noncoding variants, as predicted by CADD, FunSeq2 and GWAVA. (B) The highest 10,000 scoring noncoding variants, as predicted by each method, and the overlap between them. (C) Barplot presenting the densities of the 1,454 overlapping high-scoring noncoding variants, as predicted by each of the methods individually as well as in combination, in various noncoding features. CADD, Combined Annotation Dependent Depletion; GWAVA, Genome-Wide Annotation of Variants; GCL, GC content low in 1-Kb windows; lncRNA, long noncoding RNA; LE, low expression levels; LR, late replicated; PCgene, protein-coding gene; RRL, replication rate low in 1-Kb windows; Intron L, introns of lncRNAs; ncExon, non coding exon; Intron P, introns of PCgenes; Exon L, exons of lncRNAs; Exon P, exons of PCgenes; ER, early replicated; UTR, untranslated region; RRH, replication rate high in 1-Kb windows; HE, high expression levels; GCH, GC content high in 1-Kb windows; cTFBS, conserved transcription factor binding sites; CR, conserved region.
Figure 3.(A) Correlation between gene expression levels (RPKM) and the densities of high-scoring variants. (B) Correlation between replication time calculated as (G1b+S1)/(S4+G2) and the densities of high-scoring variants. (C) Correlation between GC content (representing the fraction of GC bases in 1-Kb windows) and the densities of high-scoring variants. (D) Correlation between average recombination rate and the densities of high-scoring variants. CADD, Combined Annotation Dependent Depletion; GWAVA, Genome-Wide Annotation of Variants; RPKM, reads per Kb per million reads.