| Literature DB >> 25879703 |
Aakrosh Ratan1,2, Thomas L Olson3, Thomas P Loughran4, Webb Miller5.
Abstract
BACKGROUND: The discovery and mapping of genomic variants is an essential step in most analysis done using sequencing reads. There are a number of mature software packages and associated pipelines that can identify single nucleotide polymorphisms (SNPs) with a high degree of concordance. However, the same cannot be said for tools that are used to identify the other types of variants. Indels represent the second most frequent class of variants in the human genome, after single nucleotide polymorphisms. The reliable detection of indels is still a challenging problem, especially for variants that are longer than a few bases.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25879703 PMCID: PMC4339746 DOI: 10.1186/s12859-015-0483-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of SAMtools, PINDEL, PRISM and indelMINER on simulated dataset with 3,723 deletions and 3,777 insertions
|
|
|
|
|
|---|---|---|---|
| SAMtools | 6,491 | 172 (2.65%) | 1181 |
| PINDEL | 7,239 | 328 (4.53%) | 589 |
| PRISM | 7,406 | 301 (4.06%) | 395 |
| indelMINER | 7,365 | 263 (3.57%) | 398 |
Figure 1Comparison of indels identified using SAMtools, PINDEL, PRISM and indelMINER drawn using VennDiagram [ 19 ].
Figure 2IGV snapshot of the alignments showing the deletion chr12:471,091-471,494 in the blood sample (top half of the plot) and the absence of the deletion in the saliva sample (bottom half of the plot).
Figure 3Overview of the indelMINER algorithm. Panel titled “Identification of candidate reads” shows three of the cases when a read is identified for realignment or paired-end analysis. (a) shows a case when mates align with the expected orientation but one of the mates is only partially aligned, (b) shows a case when one mate from a fragment aligns to a location mpos on the reference, while the other mate does not align, and (c) shows the case when both mates align with the expected relative orientation, but the outer distance constraint is violated. Panel titled “Identification of diagonal” shows the various alignments of the unaligned mate using k-mer comparisons, and the subsequent selection of one of the diagonals based on alignment score and distance from mpos. Panel “Split read alignment” shows the extension of the chosen diagonal, and the panel “Identification of indel” shows the alignment and extension of the remaining sequence from the unaligned mate, to a region around mpos selected based on a user-specified threshold.