Literature DB >> 25979471

Repeat- and error-aware comparison of deletions.

Roland Wittler1, Tobias Marschall1, Alexander Schönhuth1, Veli Mäkinen1.   

Abstract

MOTIVATION: The number of reported genetic variants is rapidly growing, empowered by ever faster accumulation of next-generation sequencing data. A major issue is comparability. Standards that address the combined problem of inaccurately predicted breakpoints and repeat-induced ambiguities are missing. This decisively lowers the quality of 'consensus' callsets and hampers the removal of duplicate entries in variant databases, which can have deleterious effects in downstream analyses.
RESULTS: We introduce a sound framework for comparison of deletions that captures both tool-induced inaccuracies and repeat-induced ambiguities. We present a maximum matching algorithm that outputs virtual duplicates among two sets of predictions/annotations. We demonstrate that our approach is clearly superior over ad hoc criteria, like overlap, and that it can reduce the redundancy among callsets substantially. We also identify large amounts of duplicate entries in the Database of Genomic Variants, which points out the immediate relevance of our approach.
AVAILABILITY AND IMPLEMENTATION: Implementation is open source and available from https://bitbucket.org/readdi/readdi CONTACT: roland.wittler@uni-bielefeld.de or t.marschall@mpi-inf.mpg.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Mesh:

Year:  2015        PMID: 25979471     DOI: 10.1093/bioinformatics/btv304

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  6 in total

1.  Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Authors:  Anish M S Shrestha; Martin C Frith; Kiyoshi Asai; Hugues Richard
Journal:  Nucleic Acids Res       Date:  2018-02-16       Impact factor: 16.971

2.  Varlociraptor: enhancing sensitivity and controlling false discovery rate in somatic indel discovery.

Authors:  Johannes Köster; Louis J Dijkstra; Tobias Marschall; Alexander Schönhuth
Journal:  Genome Biol       Date:  2020-04-28       Impact factor: 13.583

3.  Tools for annotation and comparison of structural variation.

Authors:  Fritz J Sedlazeck; Andi Dhroso; Dale L Bodian; Justin Paschall; Farrah Hermes; Justin M Zook
Journal:  F1000Res       Date:  2017-10-03

Review 4.  Structural variant detection in cancer genomes: computational challenges and perspectives for precision oncology.

Authors:  Ianthe A E M van Belzen; Alexander Schönhuth; Patrick Kemmeren; Jayne Y Hehir-Kwa
Journal:  NPJ Precis Oncol       Date:  2021-03-02

5.  UPS-indel: a Universal Positioning System for Indels.

Authors:  Mohammad Shabbir Hasan; Xiaowei Wu; Layne T Watson; Liqing Zhang
Journal:  Sci Rep       Date:  2017-10-26       Impact factor: 4.379

6.  Towards pan-genome read alignment to improve variation calling.

Authors:  Daniel Valenzuela; Tuukka Norri; Niko Välimäki; Esa Pitkänen; Veli Mäkinen
Journal:  BMC Genomics       Date:  2018-05-09       Impact factor: 3.969

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.