Literature DB >> 19269990

A consistency-based consensus algorithm for de novo and reference-guided sequence assembly of short reads.

Tobias Rausch1, Sergey Koren, Gennady Denisov, David Weese, Anne-Katrin Emde, Andreas Döring, Knut Reinert.   

Abstract

MOTIVATION: Novel high-throughput sequencing technologies pose new algorithmic challenges in handling massive amounts of short-read, high-coverage data. A robust and versatile consensus tool is of particular interest for such data since a sound multi-read alignment is a prerequisite for variation analyses, accurate genome assemblies and insert sequencing.
RESULTS: A multi-read alignment algorithm for de novo or reference-guided genome assembly is presented. The program identifies segments shared by multiple reads and then aligns these segments using a consistency-enhanced alignment graph. On real de novo sequencing data obtained from the newly established NCBI Short Read Archive, the program performs similarly in quality to other comparable programs. On more challenging simulated datasets for insert sequencing and variation analyses, our program outperforms the other tools. AVAILABILITY: The consensus program can be downloaded from http://www.seqan.de/projects/consensus.html. It can be used stand-alone or in conjunction with the Celera Assembler. Both application scenarios as well as the usage of the tool are described in the documentation.

Mesh:

Year:  2009        PMID: 19269990      PMCID: PMC2732307          DOI: 10.1093/bioinformatics/btp131

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  24 in total

1.  Fast algorithms for large-scale genome alignment and comparison.

Authors:  Arthur L Delcher; Adam Phillippy; Jane Carlton; Steven L Salzberg
Journal:  Nucleic Acids Res       Date:  2002-06-01       Impact factor: 16.971

2.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

3.  PCAP: a whole-genome assembly program.

Authors:  Xiaoqiu Huang; Jianmin Wang; Srinivas Aluru; Shiaw-Pyng Yang; LaDeana Hillier
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

4.  Comparative genome assembly.

Authors:  Mihai Pop; Adam Phillippy; Arthur L Delcher; Steven L Salzberg
Journal:  Brief Bioinform       Date:  2004-09       Impact factor: 11.622

5.  The Atlas genome assembly system.

Authors:  Paul Havlak; Rui Chen; K James Durbin; Amy Egan; Yanru Ren; Xing-Zhi Song; George M Weinstock; Richard A Gibbs
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

6.  MUSCLE: multiple sequence alignment with high accuracy and high throughput.

Authors:  Robert C Edgar
Journal:  Nucleic Acids Res       Date:  2004-03-19       Impact factor: 16.971

7.  A whole-genome assembly of Drosophila.

Authors:  E W Myers; G G Sutton; A L Delcher; I M Dew; D P Fasulo; M J Flanigan; S A Kravitz; C M Mobarry; K H Reinert; K A Remington; E L Anson; R A Bolanos; H H Chou; C M Jordan; A L Halpern; S Lonardi; E M Beasley; R C Brandon; L Chen; P J Dunn; Z Lai; Y Liang; D R Nusskern; M Zhan; Q Zhang; X Zheng; G M Rubin; M D Adams; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

8.  ARACHNE: a whole-genome shotgun assembler.

Authors:  Serafim Batzoglou; David B Jaffe; Ken Stanley; Jonathan Butler; Sante Gnerre; Evan Mauceli; Bonnie Berger; Jill P Mesirov; Eric S Lander
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

9.  Versatile and open software for comparing large genomes.

Authors:  Stefan Kurtz; Adam Phillippy; Arthur L Delcher; Michael Smoot; Martin Shumway; Corina Antonescu; Steven L Salzberg
Journal:  Genome Biol       Date:  2004-01-30       Impact factor: 13.583

10.  The phusion assembler.

Authors:  James C Mullikin; Zemin Ning
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

View more
  9 in total

1.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

2.  Detection and correction of false segmental duplications caused by genome mis-assembly.

Authors:  David R Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2010-03-10       Impact factor: 13.583

3.  LOCAS--a low coverage assembly tool for resequencing projects.

Authors:  Juliane D Klein; Stephan Ossowski; Korbinian Schneeberger; Detlef Weigel; Daniel H Huson
Journal:  PLoS One       Date:  2011-08-15       Impact factor: 3.240

4.  A scalable and accurate targeted gene assembly tool (SAT-Assembler) for next-generation sequencing data.

Authors:  Yuan Zhang; Yanni Sun; James R Cole
Journal:  PLoS Comput Biol       Date:  2014-08-14       Impact factor: 4.475

5.  aTRAM - automated target restricted assembly method: a fast method for assembling loci across divergent taxa from next-generation sequencing data.

Authors:  Julie M Allen; Daisie I Huang; Quentin C Cronk; Kevin P Johnson
Journal:  BMC Bioinformatics       Date:  2015-03-25       Impact factor: 3.169

6.  Sparc: a sparsity-based consensus algorithm for long erroneous sequencing reads.

Authors:  Chengxi Ye; Zhanshan Sam Ma
Journal:  PeerJ       Date:  2016-06-08       Impact factor: 2.984

7.  CLAME: a new alignment-based binning algorithm allows the genomic description of a novel Xanthomonadaceae from the Colombian Andes.

Authors:  Andres Benavides; Juan Pablo Isaza; Juan Pablo Niño-García; Juan Fernando Alzate; Felipe Cabarcas
Journal:  BMC Genomics       Date:  2018-12-11       Impact factor: 3.969

8.  PRICE: software for the targeted assembly of components of (Meta) genomic sequence data.

Authors:  J Graham Ruby; Priya Bellare; Joseph L Derisi
Journal:  G3 (Bethesda)       Date:  2013-05-20       Impact factor: 3.154

9.  SHEAR: sample heterogeneity estimation and assembly by reference.

Authors:  Sean R Landman; Tae Hyun Hwang; Kevin A T Silverstein; Yingming Li; Scott M Dehm; Michael Steinbach; Vipin Kumar
Journal:  BMC Genomics       Date:  2014-01-29       Impact factor: 3.969

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.