| Literature DB >> 26865835 |
Chuance Du1, Xiaoyuan Wu2, Jia Li3.
Abstract
BACKGROUND: Mutation rates are consistently varied in cancer genome and play an important role in tumorigenesis, however, little has been known about their function potential and impact on the distribution of functional mutations. In this study, we investigated genomic features which affect mutation pattern and the function importance of mutation pattern in cancer.Entities:
Keywords: Clear-cell renal cell carcinoma; Functional variants; Liver cancer; Lung cancer; Melanoma; Mutation pattern; Random forest
Year: 2016 PMID: 26865835 PMCID: PMC4748466 DOI: 10.1186/s12935-016-0278-5
Source DB: PubMed Journal: Cancer Cell Int ISSN: 1475-2867 Impact factor: 5.722
Fig. 1Densities of cancer somatic mutations of ccRCC associated to a wide range of genomic features chromosome by chromosome. For each chromosome, the size of each feature was calculated and numbers of somatic mutations were counted, somatic mutation densities were computed. The red and blue dashed lines stand for CDS and genome-wide average mutation densities. CDS coding sequence, CR conserved region, ECS evolutionarily conserved structure, GCH and GCL 1 Kb-windows with high and low GC content, Exon.P and Exon.L, Intron.P and Intron.P exon and intron of protein coding gene and lncRNA respectively, PCgene protein coding gene, cTFBS conserved TFBS, UTR Untranslated region, PCgene.ER and PCgene.LR, LncRNA.ER and LncRNA.LR early and late replicated protein coding gene and lncRNA, PCgene.HE and PCgene.LE, LncRNA.HE and LncRNA.LE high and low expressed protein coding gene and lncRNA, RRH and RRL 1 Kb-windows with high and low recombination rate, H3K4me1, H3K9ac, etc. histone methylation and acetylation data, ncExon non coding Exon
Fig. 2Correlation coefficients of cancer somatic mutation (CSM), SNP densities with diverse genetic and epigenetic features at 1-Mb resolution
Fig. 3Importance measured by %IncMSE for each feature in the CSM and SNP RF models a %IncMSE importance of all features in the CSM RF model; b %IncMSE importance of all features in the SNP RF model
Fig. 4The impact of CSM and SNP scores on the distribution of deleterious coding variants and functional non-coding variants. a average CSM scores correlate negatively with densities of deleterious coding variants predicted by SIFT and Mutation Assessor on a 200 Mb scale; b average SNP scores correlate negatively with densities of deleterious coding variants predicted by SIFT and Mutation Assessor on a 200 Mb scale; c The distribution of FunSeq 2 and GWAVA scores for non-coding variants with regards to CSM scores; d The distribution of FunSeq 2 and GWAVA scores for non-coding variants with regards to SNP scores
Fig. 5The impact of CSM and SNP scores on the distribution of disease-causing variants from the ClinVar and HGMD databases. a The average fraction of 100 Mb low CSM, SNP, high CSM and SNP scoring regions in various features; b The distribution of CSM and SNP scores for ClinVar, HGMD disease-causing variants and random SNPs, random SNPs are 1 % SNPs randomly chosen from 1000 human genomes project; c The correlation of average CSM scores with densities of disease-causing variants on a 200 Mb scale; d The correlation of average SNP scores with densities of ClinVar, and HGMD disease-causing variants on a 200 Mb scale