| Literature DB >> 31494671 |
Varuna Chander1, Richard A Gibbs1, Fritz J Sedlazeck1.
Abstract
BACKGROUND: Structural variation (SV) plays a pivotal role in genetic disease. The discovery of SVs based on short DNA sequence reads from next-generation DNA sequence methods is error-prone, with low sensitivity and high false discovery rates. These shortcomings can be partially overcome with extensive orthogonal validation methods or use of long reads, but the current cost precludes their application for routine clinical diagnostics. In contrast, SV genotyping of known sites of SV occurrence is relatively robust and therefore offers a cost-effective clinical diagnostic tool with potentially few false-positive and false-negative results, even when applied to short-read DNA sequence data.Entities:
Keywords: clinical diagnosis; genotyping; next-generation sequencing; structural variations
Mesh:
Year: 2019 PMID: 31494671 PMCID: PMC6732172 DOI: 10.1093/gigascience/giz110
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Overview of the SV genotypers and their ability to assess different SV types
| Genotyper | Approach | SV type | Inputs | Dependencies | ||||
|---|---|---|---|---|---|---|---|---|
| Deletion | Insertion | Inversion | Duplication | Translocation/BND | ||||
|
| RD, PR, SR |
|
|
|
| BAM, VCF, Ref | Bcftools [ | |
|
| SR, PR |
|
|
|
| BAM, VCF, Ref | ||
|
| RD, PR, SR |
|
| BAM, SNV VCF, VCF, Ref, PED file | ||||
|
| PR, SR |
|
| BAM compressed, PED file, VCF, Ref | Excord, Giggle [ | |||
|
| RD, PR, SR |
|
| BAM, VCF, Ref | GATK [ | |||
: works on a standardized VCF file.
: marks dependencies on specialized tags in the VCF files. PR: paired-end reads; RD: read depth; SR: split reads.
Figure 1:Evaluation of Illumina-like reads to assess the SV genotyper ability to re-identify insertions, deletions, duplications, and inversions over different size ranges (x-axis). The colors indicate the SVs being detected/genotyped by the respective SV genotypers. They were classified as either precisely, indicated, not detected, or falsely identified (see Methods). For the SVs genotyped on the basis of SV calls (left) we used SURVIVOR, which is a union set of Delly, Lumpy, and Manta, to generate the VCF file as an input for the SV genotypers. Noteworthy, Delly and SVtyper can genotype more SVs, given the custom information from their respective callers, Delly and SVTyper, respectively. When the truth SV set is provided as a start point (right panel) we see marginal improvements across the SV genotyping methods while maintaining the overall trend.
Figure 2:Evaluation based on GIAB call set v0.5.0 deletions only.