| Literature DB >> 34882898 |
Stepanka Zverinova1, Victor Guryev1.
Abstract
The success of many clinical, association, or population genetics studies critically relies on properly performed variant calling step. The variety of modern genomics protocols, techniques, and platforms makes our choices of methods and algorithms difficult and there is no "one size fits all" solution for study design and data analysis. In this review, we discuss considerations that need to be taken into account while designing the study and preparing for the experiments. We outline the variety of variant types that can be detected using sequencing approaches and highlight some specific requirements and basic principles of their detection. Finally, we cover interesting developments that enable variant calling for a broad range of applications in the genomics field. We conclude by discussing technological and algorithmic advances that have the potential to change the ways of calling DNA variants in the nearest future.Entities:
Keywords: best practices; genome sequencing; method development; variant calling
Mesh:
Year: 2021 PMID: 34882898 PMCID: PMC9545713 DOI: 10.1002/humu.24311
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.700
Figure 1Overview of experimental factors that are important for planning and performing a genome sequencing study
Selected list of tools commonly used for detection of DNA variants
| Tool | Approach, method | Application | References |
|---|---|---|---|
| Small variants | |||
| GATK Haplotypecaller | Local reassembly of haplotypes | Germline, MNPs | (Poplin, Ruano‐Rubio, et al., |
| BCFtools | Positional, pileups | Germline | (Danecek et al., |
| FreeBayes | Haplotype‐based, Bayesian model | Germline, MNPs | (Garrison & Marth, |
| GATK Mutect2 | Local reassembly | Somatic | (Cibulskis et al., |
| Strelka2 | Tiered haplotype model | Germline, somatic | (Kim et al., |
| Structural variants | |||
| Delly2 | RP, SR, RD | Germline SVs | (Rausch et al., |
| Pindel | SR, RP | Germline SVs | (Ye et al., |
| Manta | SR, RP, AS | Germline, somatic | (Chen et al., |
| GRIDSS2 | AS, SV Breakpoint | Somatic | (Cameron et al., |
| Varscan2 | RD, Circular Binary Segmentation | Exome, somatic, CNVs | (Koboldt et al., |
| EXCAVATOR2 | RD, In‐,Off‐target | Exome, CNVs | (D'Aurizio et al., |
| ExomeDepth | RD, beta‐binomial | Exome, CNVs | (Plagnol et al., |
| Other, exotic variants | |||
| Mobster | RP, clipped reads | MEIs | (Thung et al., |
| Expansion‐Hunter | Reads spanning, flanking, in‐repeat | Repeat expansion | (Dolzhenko et al., |
| sideRETRO | SR, RP at insert | GRIP | (Miller et al., |
| Harpak et al. ( | HMM model | NAGC | (Harpak et al., |
| Li et al. ( | k‐mer count, MDS | NUMT | (W. Li et al., |
Abbreviations: AS, assembly; CNV, copy‐number variants; GATK, Genome Analysis ToolKit; MEI, mobile element insertions; RD, read depth; RP, read pairing; SR, split‐read; SV, structural variants.
Figure 2Diversity of DNA variant types. (a) Variants that can be discovered by comparing reference genomes and mapped NGS reads. (b) Identification of structural variants using signatures from mapped read pairs. MEI, mobile element insertions; MNV, multi‐nucleotide variants; NRS, non‐reference sequences; SNV, single nucleotide variants; SV, structural variants
Figure 3Current developments and challenges in variant identification technologies and algorithms