| Literature DB >> 24798945 |
Lucy F Stead1, Helene Thygesen, David R Westhead, Pamela Rabbitts.
Abstract
The catalogue of tumour-specific somatic mutations (SMs) is growing rapidly owing to the advent of next-generation sequencing. Identifying those mutations responsible for the development and progression of the disease, so-called driver mutations, will increase our understanding of carcinogenesis and provide candidates for targeted therapeutics. The phenotypic consequence(s) of driver mutations cause them to be selected for within the tumour environment, such that many approaches aimed at distinguishing drivers are based on finding significantly somatically mutated genes. Currently, these methods are designed to analyse, or be specifically applied to, nonsynonymous mutations: those that alter an encoded protein. However, growing evidence suggests the involvement of noncoding transcripts in carcinogenesis, mutations in which may also be disease-driving. We wished to test the hypothesis that common DNA variation rates within humans can be used as a baseline from which to score the rate of SMs, irrespective of coding capacity. We preliminarily tested this by applying it to a dataset of 159,498 SMs and using the results to rank genes. This resulted in significant enrichment of known cancer genes, indicating that the approach has merit. As additional data from cancer sequencing studies are made publicly available, this approach can be refined and applied to specific cancer subtypes. We named this preliminary version of our approach PRISMAD (polymorphism rates indicate somatic mutations as drivers) and have made it publicly accessible, with scripts, via a link at www.precancer.leeds.ac.uk/software-and-datasets.Entities:
Keywords: cancer driver genes; next-generation sequencing; somatic mutation
Mesh:
Substances:
Year: 2014 PMID: 24798945 PMCID: PMC4277321 DOI: 10.1002/ijc.28951
Source DB: PubMed Journal: Int J Cancer ISSN: 0020-7136 Impact factor: 7.396
Highlighting genes that contain candidate somatic driver mutations in different functional classes
| Class of gene | Total | Mean RD (variants/kb) | Median RD (variants/kb) |
|---|---|---|---|
| Protein-coding | 20,036 | −1.49 | −1.16 |
| lincRNA | 6,296 | −2.80 | −2.24 |
| miRNA | 3,110 | −2.67 | 0 |
| lncRNA | 786 | −2.43 | −1.96 |
| Pseudogene | 13,004 | −2.69 | −1.83 |
RD: rate difference (somatic mutation rate minus common polymorphism rate).
Figure 1Predicted folding of the hsa-mir-99b precursor in wild-type (a) and somatically mutated (b) form. The locations of the mature miRNAs (had-miR-99b-3p and had-miR-99b-5p) that are excised from the precursor are annotated. The colouring indicates the probability of base pairing as indicated by the scale bar. The location of the variant position is given by the block arrow, with the change in the mutant sequence labelled on the figure. [Color figure can be viewed in the online issue, which is available at wileyonlinelibrary.com.]