Literature DB >> 18340039

ALLPATHS: de novo assembly of whole-genome shotgun microreads.

Jonathan Butler1, Iain MacCallum, Michael Kleber, Ilya A Shlyakhter, Matthew K Belmonte, Eric S Lander, Chad Nusbaum, David B Jaffe.   

Abstract

New DNA sequencing technologies deliver data at dramatically lower costs but demand new analytical methods to take full advantage of the very short reads that they produce. We provide an initial, theoretical solution to the challenge of de novo assembly from whole-genome shotgun "microreads." For 11 genomes of sizes up to 39 Mb, we generated high-quality assemblies from 80x coverage by paired 30-base simulated reads modeled after real Illumina-Solexa reads. The bacterial genomes of Campylobacter jejuni and Escherichia coli assemble optimally, yielding single perfect contigs, and larger genomes yield assemblies that are highly connected and accurate. Assemblies are presented in a graph form that retains intrinsic ambiguities such as those arising from polymorphism, thereby providing information that has been absent from previous genome assemblies. For both C. jejuni and E. coli, this assembly graph is a single edge encompassing the entire genome. Larger genomes produce more complicated graphs, but the vast majority of the bases in their assemblies are present in long edges that are nearly always perfect. We describe a general method for genome assembly that can be applied to all types of DNA sequence data, not only short read data, but also conventional sequence reads.

Entities:  

Mesh:

Year:  2008        PMID: 18340039      PMCID: PMC2336810          DOI: 10.1101/gr.7337908

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  10 in total

1.  An Eulerian path approach to DNA fragment assembly.

Authors:  P A Pevzner; H Tang; M S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

2.  Accurate multiplex polony sequencing of an evolved bacterial genome.

Authors:  Jay Shendure; Gregory J Porreca; Nikos B Reppas; Xiaoxia Lin; John P McCutcheon; Abraham M Rosenbaum; Michael D Wang; Kun Zhang; Robi D Mitra; George M Church
Journal:  Science       Date:  2005-08-04       Impact factor: 47.728

3.  SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencing.

Authors:  Juliane C Dohm; Claudio Lottaz; Tatiana Borodina; Heinz Himmelbauer
Journal:  Genome Res       Date:  2007-10-01       Impact factor: 9.043

4.  Extending assembly of short DNA sequences to handle error.

Authors:  William R Jeck; Josephine A Reinhardt; David A Baltrus; Matthew T Hickenbotham; Vincent Magrini; Elaine R Mardis; Jeffery L Dangl; Corbin D Jones
Journal:  Bioinformatics       Date:  2007-09-24       Impact factor: 6.937

5.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

Authors:  R Sachidanandam; D Weissman; S C Schmidt; J M Kakol; L D Stein; G Marth; S Sherry; J C Mullikin; B J Mortimore; D L Willey; S E Hunt; C G Cole; P C Coggill; C M Rice; Z Ning; J Rogers; D R Bentley; P Y Kwok; E R Mardis; R T Yeh; B Schultz; L Cook; R Davenport; M Dante; L Fulton; L Hillier; R H Waterston; J D McPherson; B Gilman; S Schaffner; W J Van Etten; D Reich; J Higgins; M J Daly; B Blumenstiel; J Baldwin; N Stange-Thomann; M C Zody; L Linton; E S Lander; D Altshuler
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

6.  ARACHNE: a whole-genome shotgun assembler.

Authors:  Serafim Batzoglou; David B Jaffe; Ken Stanley; Jonathan Butler; Sante Gnerre; Evan Mauceli; Bonnie Berger; Jill P Mesirov; Eric S Lander
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

7.  DNA sequencing with chain-terminating inhibitors.

Authors:  F Sanger; S Nicklen; A R Coulson
Journal:  Proc Natl Acad Sci U S A       Date:  1977-12       Impact factor: 11.205

8.  Genome-wide mapping of in vivo protein-DNA interactions.

Authors:  David S Johnson; Ali Mortazavi; Richard M Myers; Barbara Wold
Journal:  Science       Date:  2007-05-31       Impact factor: 47.728

9.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells.

Authors:  Tarjei S Mikkelsen; Manching Ku; David B Jaffe; Biju Issac; Erez Lieberman; Georgia Giannoukos; Pablo Alvarez; William Brockman; Tae-Kyung Kim; Richard P Koche; William Lee; Eric Mendenhall; Aisling O'Donovan; Aviva Presser; Carsten Russ; Xiaohui Xie; Alexander Meissner; Marius Wernig; Rudolf Jaenisch; Chad Nusbaum; Eric S Lander; Bradley E Bernstein
Journal:  Nature       Date:  2007-07-01       Impact factor: 49.962

10.  Assembling millions of short DNA sequences using SSAKE.

Authors:  René L Warren; Granger G Sutton; Steven J M Jones; Robert A Holt
Journal:  Bioinformatics       Date:  2006-12-08       Impact factor: 6.937

  10 in total
  356 in total

1.  Transcriptome analysis of the roots at early and late seedling stages using Illumina paired-end sequencing and development of EST-SSR markers in radish.

Authors:  Shufen Wang; Xiufeng Wang; Qiwei He; Xianxian Liu; Wenling Xu; Libin Li; Jianwei Gao; Fengde Wang
Journal:  Plant Cell Rep       Date:  2012-04-04       Impact factor: 4.570

2.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

3.  Paired de bruijn graphs: a novel approach for incorporating mate pair information into genome assemblers.

Authors:  Paul Medvedev; Son Pham; Mark Chaisson; Glenn Tesler; Pavel Pevzner
Journal:  J Comput Biol       Date:  2011-10-14       Impact factor: 1.479

4.  Using the Velvet de novo assembler for short-read sequencing technologies.

Authors:  Daniel R Zerbino
Journal:  Curr Protoc Bioinformatics       Date:  2010-09

5.  Optimization of de novo transcriptome assembly from next-generation sequencing data.

Authors:  Yann Surget-Groba; Juan I Montoya-Burgos
Journal:  Genome Res       Date:  2010-08-06       Impact factor: 9.043

6.  Assembly of large genomes using second-generation sequencing.

Authors:  Michael C Schatz; Arthur L Delcher; Steven L Salzberg
Journal:  Genome Res       Date:  2010-05-27       Impact factor: 9.043

7.  Ray: simultaneous assembly of reads from a mix of high-throughput sequencing technologies.

Authors:  Sébastien Boisvert; François Laviolette; Jacques Corbeil
Journal:  J Comput Biol       Date:  2010-10-20       Impact factor: 1.479

Review 8.  Next-generation sequencing techniques for eukaryotic microorganisms: sequencing-based solutions to biological problems.

Authors:  Minou Nowrousian
Journal:  Eukaryot Cell       Date:  2010-07-02

Review 9.  Detecting structural variations in the human genome using next generation sequencing.

Authors:  Ruibin Xi; Tae-Min Kim; Peter J Park
Journal:  Brief Funct Genomics       Date:  2011-01-06       Impact factor: 4.241

10.  Metagenomics: Facts and Artifacts, and Computational Challenges*

Authors:  John C Wooley; Yuzhen Ye
Journal:  J Comput Sci Technol       Date:  2009-01       Impact factor: 1.571

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.