| Literature DB >> 35168570 |
Freek Manders1,2, Arianne M Brandsma1,2, Jurrian de Kanter1,2, Mark Verheul1,2, Rurika Oka1,2, Markus J van Roosmalen1,2, Bastiaan van der Roest2,3,4, Arne van Hoeck2,3, Edwin Cuppen2,3, Ruben van Boxtel5,6.
Abstract
BACKGROUND: The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data.Entities:
Keywords: Base substitutions; Indels; Mutagenic processes; Mutational signatures; R; Regional mutation patterns; Somatic mutations
Mesh:
Year: 2022 PMID: 35168570 PMCID: PMC8845394 DOI: 10.1186/s12864-022-08357-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Mutation profiles can be made for multiple mutation types. a Relative contribution of the indicated mutation types to the point mutation spectrum. Bars depict the mean relative contribution of each mutation type over all the samples and error bars indicate the 95% confidence interval. The dots show the relative contributions of the individual samples. The total number of somatic point mutations per tissue is indicated. b Absolute contribution of the indicated mutation types to the indel spectrum for the wild-type (WT) and MSH2 knockout. The total number of indels per sample is indicated. c Absolute contribution of the indicated mutation types to the DBS spectrum for the wild-type (WT) and XPC knockout. The total number of DBSs per sample is indicated. d Heatmap depicting the relative contribution of the indicated mutation types and the surrounding bases to the point mutation spectrum for the WT and MSH2 knockout. The total number of somatic point mutations per tissue is indicated
Fig. 2Regional spectra show differences between genomic regions. a Relative contribution of the indicated mutation types to the point mutation spectrum split between exons and the rest of the genome for each sample. The number of substitutions in each sample is indicated at the top of the figure. b Relative contribution of the indicated mutation types to the point mutation spectrum split between early-, intermediate-, and late-replicating DNA for each sample. The number of substitutions in each sample is indicated at the top of the figure. c Relative contribution of each trinucleotide change to the point mutation spectrum split between early- intermediate and late-replicating DNA for each sample. d A jitter plot depicting the presence of lesion segregation for each sample per chromosome. Each dot depicts a single base substitution. Any C > N or T > N is shown as a “+” strand mutation, while G > N and A > N mutations are shown on the “-“ strand. The x-axis shows the position of the mutations. The horizontal lines are calculated as the mean of the “+“ and “-“ strand, where “+“ equals 1 and “-” equals 0. They indicate per chromosome on which strand most of the mutations are located. The mutations were downsampled to 33% to reduce the file size
Fig. 3Signature refitting is improved. a Absolute contribution of each mutational signature for each sample using “regular” signature refitting and b “strict” signature refitting. c Dot plot showing the contribution of each mutational signature for each sample using bootstrapped signature refitting. The colour of a dot indicates the fraction of bootstrap iterations in which a signature contributed to a sample. The size indicates the mean number of contributing mutations across bootstrap iterations in which the contribution was not zero. d Heatmap depicting the Pearson correlation between signature contributions across the bootstrap iterations. e Bar graph depicting the cosine similarity between the original and reconstructed profiles of each sample based on signature refitting
Fig. 4Large cancer datasets show extended and regional mutation patterns. a Heatmap depicting the relative contribution of the indicated mutation types and the surrounding bases to the point mutation spectrum for metastatic melanomas. The total number of somatic point mutations is indicated. Only C > T substitutions are shown, because other substitution types are much less common. b Relative contribution of each C > T trinucleotide change to the point mutation spectrum split between different genomic regions. c Graph depicting the similarity in the mutation profile between genomic windows and the rest of the genome. Each dot shows the cosine similarity between the mutation profiles of a single window and the rest of the genome. The dots are colored based on the sizes in mega bases of the windows. The IGK (chr2), IGH (chr14) and IGL (chr22) loci are visualized with vertical orange lines [31]. The width of the lines is set at 1 pt., because using the actual widths of these loci results in lines that are too small to be visible
Feature comparison with other packages
| Group | Feature | MutationalPatterns v3.4.0 | MutationalPatterns v1.4.3 [ | Sigprofiler [ | SignatureAnalyzer [ | deconstructSigs [ | sparseSignatures [ | signeR [ | somaticSignatures [ | Maftools [ | decompTumor2Sig [ |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Language | Language/platform | R (bioconductor) | R (bioconductor) | Python (+ R wrapper) | Python | R (cran) | R (bioconductor) | R (bioconductor) | R (bioconductor) | R (bioconductor) | R (bioconductor) |
| Genome | Supported genomes | Genome agnostic | Genome agnostic | Human, Mice, Rat, Yeast | – | Human | Genome agnostic | Genome agnostic | Genome agnostic | Genome agnostic | Genome agnostic |
| Mutation profile | 96 SNV profile | X | X | X | – | X | – | X | X | X | X |
| extended SNV profile | X | – | X | – | – | – | – | X | – | X | |
| Indel profile | X | – | X | – | – | – | – | – | – | – | |
| DBS profile | X | – | X | – | – | – | – | – | – | – | |
| MBS profile | X | – | – | – | – | – | – | – | – | – | |
| Transcriptional strand bias profile | X | X | X | – | – | – | – | – | – | – | |
| Replicative strand bias profile | X | X | X | – | – | – | – | – | – | – | |
| Pool samples | X | – | – | – | – | – | – | – | – | – | |
| Signature extraction | Signature extraction (NMF) | X | X | X | – | – | – | – | X | X | – |
| Signature extraction (Bayes NMF) | X | – | – | X | – | – | X | – | – | – | |
| Signature extraction (Lasso NMF) | – | – | – | – | – | X | – | – | – | – | |
| Update signature names | X | – | – | – | – | – | – | – | – | – | |
| Signature refitting | Signature refitting | X | X | X | X | X | – | – | – | – | X |
| Strict signature refitting | X | – | X | X | X | – | – | – | – | X | |
| Strict signature refitting (best subset) | X | – | – | – | – | – | – | – | – | X | |
| – | Bootstrapped signature refitting | X | – | – | – | – | – | – | – | – | – |
| Correlation bootstrapped refitting | X | – | – | – | – | – | – | – | – | – | |
| Signature damage analysis | Signature potential damage analysis | X | – | – | – | – | – | – | – | – | – |
| Signature other | Plot supported profiles / signatures | X | X | X | X | X | X | X | X | X | X |
| Plot and compare supported profiles | X | X | – | – | – | – | – | – | – | – | |
| Signature contribution heatmap | X | X | – | – | – | – | X | X | – | – | |
| Signature contribution barplot | X | X | – | – | – | – | X | X | – | – | |
| Signature/profile similarity heatmap | X | X | – | – | – | – | – | – | X | – | |
| Similarity with reconstructed profile barplot | X | – | – | – | – | – | – | – | – | – | |
| Genomic distribution | Rainfall plot | X | X | – | – | – | – | – | X | X | – |
| Enrichment/depletion in genomic region | X | X | – | – | – | – | – | – | – | – | |
| Region specific profiles | X | – | – | – | – | – | – | – | – | – | |
| Region specific signatures | X | – | – | – | – | – | – | – | – | – | |
| Unsupervised regional similarity | X | – | – | – | – | – | – | – | – | – | |
| Lesion segregation | Lesion segregation | X | – | – | – | – | – | – | – | – | – |
Fig. 5Indel signatures can explain the MSH2 profile. a Relative contribution of each mutational signature for the wild-type (WT) and MSH2 samples using strict signature refitting. b Dot plot showing the contribution of each mutational signature for the WT and MSH2 samples using bootstrapped signature refitting. The color of a dot indicates the fraction of bootstrap iterations in which a signature contributed to a sample. The size indicates the mean number of contributing mutations across bootstrap iterations in which the contribution was not zero. c Bar graph depicting the cosine similarity between the original and reconstructed profiles of the WT and MSH2 samples based on signature refitting