| Literature DB >> 25435088 |
Jason Roszik, Scott E Woodman1.
Abstract
BACKGROUND: Driver mutations are positively selected during the evolution of cancers. The relative frequency of a particular mutation within a gene is typically used as a criterion for identifying a driver mutation. However, driver mutations may occur with relative infrequency at a particular site, but cluster within a region of the gene. When analyzing across different cancers, particular mutation sites or mutations within a particular region of the gene may be of relatively low frequency in some cancers, but still provide selective growth advantage.Entities:
Mesh:
Year: 2014 PMID: 25435088 PMCID: PMC4265503 DOI: 10.1186/1471-2164-15-1044
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Driver mutation amino acid sites and regions. The COSMIC dataset was used to identify the most frequent mutations for well-characterized driver mutation sites/regions. The y-axis demarcates the names of genes and frequency of mutations within each gene. The x-axis demarcates the amino acid position within the protein product for each mutation. The vertical position of the blue dots represents the frequency of an amino acid alteration at a specific site. For illustrative purposes a few well-characterized mutant proteins are displayed, and a threshold of at least 10 samples with a specific amino acid aberration was employed. For the IDH1 protein, the R132 amino acid site is clearly the most aberrant (n = 5,557 entries) to the exclusion of any other site. The BRAF protein displays both a dominant site at V600 (n = 32,371 entries) within one of two relatively high frequency mutation regions. Finally, the KIT protein demonstrates regions with relatively high frequencies of amino acid alterations.
Figure 2HotSpotter identification of potential mutation hotspot sites/regions in the TCGA UCEC data set. The y-axis demarcates the names of genes and frequency of mutations within each gene. The x-axis demarcates the amino acid position within the protein product for each mutation. Orange dots (intentionally large for quick visualization) and their vertical position represent the frequency of mutation at a specific site in the UCEC test set. Blue dots (intentionally smaller) and their vertical position represent the frequency of mutations at specific sites in the COSMIC dataset. For non-substitution mutations, the first amino acid at which the alteration occurs is used as the “position”.