| Literature DB >> 35822012 |
August Yue Huang1, Eunjung Alice Lee1.
Abstract
Somatic mutations are DNA variants that occur after the fertilization of zygotes and accumulate during the developmental and aging processes in the human lifespan. Somatic mutations have long been known to cause cancer, and more recently have been implicated in a variety of non-cancer diseases. The patterns of somatic mutations, or mutational signatures, also shed light on the underlying mechanisms of the mutational process. Advances in next-generation sequencing over the decades have enabled genome-wide profiling of DNA variants in a high-throughput manner; however, unlike germline mutations, somatic mutations are carried only by a subset of the cell population. Thus, sensitive bioinformatic methods are required to distinguish mutant alleles from sequencing and base calling errors in bulk tissue samples. An alternative way to study somatic mutations, especially those present in an extremely small number of cells or even in a single cell, is to sequence single-cell genomes after whole-genome amplification (WGA); however, it is critical and technically challenging to exclude numerous technical artifacts arising during error-prone and uneven genome amplification in current WGA methods. To address these challenges, multiple bioinformatic tools have been developed. In this review, we summarize the latest progress in methods for identification of somatic mutations and the challenges that remain to be addressed in the future.Entities:
Keywords: bioinformatic tool; bulk sequencing; single-cell sequencing; single-nucleotide variant; somatic mutation
Year: 2022 PMID: 35822012 PMCID: PMC9261417 DOI: 10.3389/fragi.2021.800380
Source DB: PubMed Journal: Front Aging ISSN: 2673-6217
FIGURE 1Occurrence of somatic mutations and their identification in next-generation sequencing data. (A) Somatic mutations that occur postzygotically after fertilization. Mutations arising during embryogenesis or under clonal expansion (green and blue) are shared in a fraction of the cell population, whereas mutations accumulating during the aging process (purple) may only be present in a single cell. (B) Identification of somatic mutations using bulk or single-cell sequencing. Bulk sequencing is suitable for detecting somatic mutations shared across multiple cells, though mutations with low allele fractions are difficult to distinguish from sequencing errors. Private somatic mutations can be detected with single-cell sequencing, but the whole-genome amplification before sequencing may introduce additional artifacts resulting from amplification errors.
A selected list of tools for somatic mutation calling.
| Tool | Reference | Sequencing type | Detectable mutation type | Optimized for non-cancer data | Built-in genotyper | Matched control required | Base-quality-aware in genotyper | Joint analysis of multiple samples |
|---|---|---|---|---|---|---|---|---|
| Strelka |
| bulk DNA | Shared | No | Yes | Yes | No | Yes, with matched control |
| VarScan2 |
| bulk DNA | Shared | No | Yes | Yes | No | Yes, with matched control |
| JointSNVMix |
| bulk DNA | Shared | No | Yes | Yes | Yes | Yes, with matched control |
| MuTect |
| bulk DNA | Shared | No | Yes | Yes | Yes | Yes, with matched control |
| MosaicHunter |
| bulk DNA | Shared | Yes | Yes | No | Yes | Yes, with matched control or parents |
| SomVarIUS |
| bulk DNA | Shared | No | Yes | No | Yes | No |
| EM-mosaic |
| bulk DNA | Shared | Yes | Yes | No | No | No |
| MosaicForecast |
| bulk DNA | Shared | Yes | No | No | NA | No |
| Shearwater |
| bulk DNA, ultra-deep | Shared | No | Yes | Yes | No | No |
| RareVar |
| bulk DNA, ultra-deep | Shared | No | Yes | No | No | No |
| RePlow |
| bulk DNA, ultra-deep | Shared | Yes | Yes | No | Yes | No |
| Monovar |
| single-cell DNA | Shared and private | No | Yes | Yes | No | Yes, with other single cells |
| SCcaller |
| single-cell DNA | Shared and private | Yes | Yes | Yes | Yes | No |
| LiRA |
| single-cell DNA | Shared and private | Yes | No | Yes | NA | No |
| Conbase |
| single-cell DNA | Shared | Yes | Yes | Yes | No | Yes, with other single cells |
| SCAN-SNV |
| single-cell DNA | Shared and private | Yes | No | Yes | NA | No |
| single-cell MosaicHunter |
| single-cell DNA | Shared | Yes | Yes | No | Yes | Yes, with bulk or other single cells |
| RNA-MuTect |
| bulk RNA | Shared | Yes | Yes | Yes | Yes | Yes, with matched DNA |
| SCmut |
| single-cell RNA | Shared | No | No | Yes | NA | Yes, with matched DNA |
Later versions of MuTect, with dramatic improvement from the method described in the original paper, allow somatic mutation calling in non-cancer samples and without matched control.