Literature DB >> 29182778

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Anish M S Shrestha1, Martin C Frith1,2,3, Kiyoshi Asai1,2, Hugues Richard4.   

Abstract

Performing sequence alignment to identify structural variants, such as large deletions, from genome sequencing data is a fundamental task, but current methods are far from perfect. The current practice is to independently align each DNA read to a reference genome. We show that the propensity of genomic rearrangements to accumulate in repeat-rich regions imposes severe ambiguities in these alignments, and consequently on the variant calls-with current read lengths, this affects more than one third of known large deletions in the C. Venter genome. We present a method to jointly align reads to a genome, whereby alignment ambiguity of one read can be disambiguated by other reads. We show this leads to a significant improvement in the accuracy of identifying large deletions (≥20 bases), while imposing minimal computational overhead and maintaining an overall running time that is at par with current tools. A software implementation is available as an open-source Python program called JRA at https://bitbucket.org/jointreadalignment/jra-src.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29182778      PMCID: PMC5815140          DOI: 10.1093/nar/gkx1175

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  23 in total

1.  Resolving Conflicting Predictions from Multimapping Reads.

Authors:  Stefan Canzar; Khaled Elbassioni; Mitchell Jones; Julián Mestre
Journal:  J Comput Biol       Date:  2016-01-08       Impact factor: 1.479

Review 2.  The quest for orthologs: finding the corresponding gene across genomes.

Authors:  Arnold Kuzniar; Roeland C H J van Ham; Sándor Pongor; Jack A M Leunissen
Journal:  Trends Genet       Date:  2008-09-24       Impact factor: 11.639

Review 3.  Computational methods for discovering structural variation with next-generation sequencing.

Authors:  Paul Medvedev; Monica Stanciu; Michael Brudno
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

4.  CLEVER: clique-enumerating variant finder.

Authors:  Tobias Marschall; Ivan G Costa; Stefan Canzar; Markus Bauer; Gunnar W Klau; Alexander Schliep; Alexander Schönhuth
Journal:  Bioinformatics       Date:  2012-10-11       Impact factor: 6.937

Review 5.  Repetitive DNA and next-generation sequencing: computational challenges and solutions.

Authors:  Todd J Treangen; Steven L Salzberg
Journal:  Nat Rev Genet       Date:  2011-11-29       Impact factor: 53.242

Review 6.  Genotype and SNP calling from next-generation sequencing data.

Authors:  Rasmus Nielsen; Joshua S Paul; Anders Albrechtsen; Yun S Song
Journal:  Nat Rev Genet       Date:  2011-06       Impact factor: 53.242

7.  AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision.

Authors:  Alexej Abyzov; Mark Gerstein
Journal:  Bioinformatics       Date:  2011-01-13       Impact factor: 6.937

8.  Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA.

Authors:  Nils Homer; Stanley F Nelson
Journal:  Genome Biol       Date:  2010-10-08       Impact factor: 13.583

9.  The diploid genome sequence of an individual human.

Authors:  Samuel Levy; Granger Sutton; Pauline C Ng; Lars Feuk; Aaron L Halpern; Brian P Walenz; Nelson Axelrod; Jiaqi Huang; Ewen F Kirkness; Gennady Denisov; Yuan Lin; Jeffrey R MacDonald; Andy Wing Chun Pang; Mary Shago; Timothy B Stockwell; Alexia Tsiamouri; Vineet Bafna; Vikas Bansal; Saul A Kravitz; Dana A Busam; Karen Y Beeson; Tina C McIntosh; Karin A Remington; Josep F Abril; John Gill; Jon Borman; Yu-Hui Rogers; Marvin E Frazier; Stephen W Scherer; Robert L Strausberg; J Craig Venter
Journal:  PLoS Biol       Date:  2007-09-04       Impact factor: 8.029

10.  Split-alignment of genomes finds orthologies more accurately.

Authors:  Martin C Frith; Risa Kawaguchi
Journal:  Genome Biol       Date:  2015-05-21       Impact factor: 13.583

View more
  1 in total

1.  Tool evaluation for the detection of variably sized indels from next generation whole genome and targeted sequencing data.

Authors:  Ning Wang; Vladislav Lysenkov; Katri Orte; Veli Kairisto; Juhani Aakko; Sofia Khan; Laura L Elo
Journal:  PLoS Comput Biol       Date:  2022-02-17       Impact factor: 4.475

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.