| Literature DB >> 29695279 |
Francis Blokzijl1, Roel Janssen1, Ruben van Boxtel1,2, Edwin Cuppen3.
Abstract
BACKGROUND: Base substitution catalogues represent historical records of mutational processes that have been active in a cell. Such processes can be distinguished by various characteristics, like mutation type, sequence context, transcriptional and replicative strand bias, genomic distribution and association with (epi)-genomic features.Entities:
Keywords: Base substitutions; Mutational processes; Mutational signatures; R; Replicative strand bias; Somatic mutations; Transcriptional strand bias
Mesh:
Year: 2018 PMID: 29695279 PMCID: PMC5922316 DOI: 10.1186/s13073-018-0539-0
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Characteristics of somatic mutations acquired in human ASCs of different tissues. a Relative contribution of the indicated mutation types to the point mutation spectrum for each tissue type. Bars depict the mean relative contribution of each mutation type over all ASCs per tissue type and error bars indicate the standard deviation. The total number of somatic point mutations per tissue is indicated. b Relative contribution of each indicated trinucleotide change to the three mutational signatures that were identified by NMF analysis of the somatic mutation catalogues of the ASCs. c Relative contribution of each mutational signature for each sample. d Heatmap showing the cosine similarity of the mutational signatures in b with the COSMIC signatures
Fig. 2Reconstruction of mutational profiles using known mutational signatures. a The optimal relative contribution of COSMIC signatures to reconstruct the mutational profiles of the samples. The signatures with at least 10% contribution in at least one of the samples are plotted. b Cosine similarity between the original mutational profile and the reconstructed mutational profile based on the optimal linear combination of all 30 COSMIC signatures. The line indicates the threshold of cosine similarity = 0.95. c Relative contribution of each of the 96 trinucleotide changes to the original mutational profile (upper panel) and the reconstructed mutational profile (middle panel), and the difference between these profiles (lower panel) for the ASC with the lowest cosine similarity (1-a). The residual sum of squares (RSS) and the cosine similarity between the original and the reconstructed mutational profile are indicated
Fig. 3Transcriptional strand bias and genomic distribution. a Mutational signatures with transcriptional strand information. The relative contribution of each trinucleotide change, subdivided into the fraction of trinucleotide changes present on the transcribed (T, light shades) and untranscribed strand (U, dark shades) b Log2 ratio of the number of mutations on the transcribed and untranscribed strand per indicated base substitution for each signature depicted in a. The log2 ratio indicates the effect size of the bias and asterisks indicate significant transcriptional strand asymmetries (P < 0.05, two-sided binomial test). c Log2 ratio of the number of mutations on the transcribed and untranscribed strand per indicated base substitution for each tissue type. Asterisks indicate significant transcriptional strand asymmetries (P < 0.05, two-sided Poisson test) d Enrichment and depletion of somatic point mutations in the promoter regions, gene bodies and intergenic genomic regions for all tissues. The log2 ratio of the number of observed and expected point mutations indicates the effect size of the enrichment or depletion in each region. Asterisks indicate significant enrichments or depletions (P < 0.05, one-sided binomial test). e Rainfall plot showing the genomic location of mutations, intermutation distance and the mutation types for sample 14-b
Fig. 4Heatmap of cosine similarities between the mutational profile of each sample and COSMIC signature. The samples are hierarchically clustered (average linkage) using the Euclidean distance between the vectors of cosine similarities with the signatures. The signatures have been ordered according to hierarchical clustering (average linkage) using the cosine similarity between signatures, such that similar signatures are displayed close together
MutationalPatterns feature overview and comparison with related software tools
| Functionality | Analysis | Mutational | pmsignature [ | MutSpec [ | Somatic | deconstructSigs [ | EMu [ |
|---|---|---|---|---|---|---|---|
| Language/platform | R | R | Galaxy | R | R | C++ | |
| Mutational characteristics | Mutation spectrum | X | X | X | X | X | – |
| Transcriptional strand bias | X | – | X | – | – | – | |
| 96 mutation profile | X | – | X | X | – | X | |
| Mutational signatures | Signature extraction (NMF) | X | X | X | X | – | – |
| Signature extraction (NMF) with strand bias | X | X | – | – | – | – | |
| Signature contribution heatmap | X | – | – | X | – | – | |
| Signature contribution barplot | X | – | X | X | – | X | |
| Hierarchical sample clustering based on signature contribution | X | – | X | X | – | – | |
| Signature similarity heatmap | X | – | X | – | – | – | |
| Plot and compare two 96 profiles | X | – | – | – | X | – | |
| Sample signature similarity heatmap | X | – | – | – | – | – | |
| Find optimal linear combination of known signatures | X | – | – | – | X | – | |
| Genomic distribution | Rainfall plot/mutation clustering along the genome | X | – | – | – | – | X |
| Enrichment/depletion in genomic regions | X | – | – | – | – | X |