Literature DB >> 19056694

De novo fragment assembly with short mate-paired reads: Does the read length matter?

Mark J Chaisson1, Dumitru Brinza, Pavel A Pevzner.   

Abstract

Increasing read length is currently viewed as the crucial condition for fragment assembly with next-generation sequencing technologies. However, introducing mate-paired reads (separated by a gap of length, GapLength) opens a possibility to transform short mate-pairs into long mate-reads of length approximately GapLength, and thus raises the question as to whether the read length (as opposed to GapLength) even matters. We describe a new tool, EULER-USR, for assembling mate-paired short reads and use it to analyze the question of whether the read length matters. We further complement the ongoing experimental efforts to maximize read length by a new computational approach for increasing the effective read length. While the common practice is to trim the error-prone tails of the reads, we present an approach that substitutes trimming with error correction using repeat graphs. An important and counterintuitive implication of this result is that one may extend sequencing reactions that degrade with length "past their prime" to where the error rate grows above what is normally acceptable for fragment assembly.

Mesh:

Substances:

Year:  2008        PMID: 19056694      PMCID: PMC2652199          DOI: 10.1101/gr.079053.108

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  25 in total

1.  Fragment assembly with double-barreled data.

Authors:  P A Pevzner; H Tang
Journal:  Bioinformatics       Date:  2001       Impact factor: 6.937

2.  Correcting errors in shotgun sequences.

Authors:  Martti T Tammi; Erik Arner; Ellen Kindlund; Björn Andersson
Journal:  Nucleic Acids Res       Date:  2003-08-01       Impact factor: 16.971

3.  PCAP: a whole-genome assembly program.

Authors:  Xiaoqiu Huang; Jianmin Wang; Srinivas Aluru; Shiaw-Pyng Yang; LaDeana Hillier
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

4.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

5.  Fragment assembly with short reads.

Authors:  Mark Chaisson; Pavel Pevzner; Haixu Tang
Journal:  Bioinformatics       Date:  2004-04-01       Impact factor: 6.937

Review 6.  1-Tuple DNA sequencing: computer analysis.

Authors:  P A Pevzner
Journal:  J Biomol Struct Dyn       Date:  1989-08

7.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

8.  Human whole-genome shotgun sequencing.

Authors:  J L Weber; E W Myers
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

9.  A new algorithm for DNA sequence assembly.

Authors:  R M Idury; M S Waterman
Journal:  J Comput Biol       Date:  1995       Impact factor: 1.479

10.  Whole-genome sequence assembly for mammalian genomes: Arachne 2.

Authors:  David B Jaffe; Jonathan Butler; Sante Gnerre; Evan Mauceli; Kerstin Lindblad-Toh; Jill P Mesirov; Michael C Zody; Eric S Lander
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

View more
  102 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Paired de bruijn graphs: a novel approach for incorporating mate pair information into genome assemblers.

Authors:  Paul Medvedev; Son Pham; Mark Chaisson; Glenn Tesler; Pavel Pevzner
Journal:  J Comput Biol       Date:  2011-10-14       Impact factor: 1.479

3.  Family-based association studies for next-generation sequencing.

Authors:  Yun Zhu; Momiao Xiong
Journal:  Am J Hum Genet       Date:  2012-06-08       Impact factor: 11.025

4.  Using the Velvet de novo assembler for short-read sequencing technologies.

Authors:  Daniel R Zerbino
Journal:  Curr Protoc Bioinformatics       Date:  2010-09

5.  Ray: simultaneous assembly of reads from a mix of high-throughput sequencing technologies.

Authors:  Sébastien Boisvert; François Laviolette; Jacques Corbeil
Journal:  J Comput Biol       Date:  2010-10-20       Impact factor: 1.479

6.  Biofuels from algae: challenges and potential.

Authors:  Michael Hannon; Javier Gimpel; Miller Tran; Beth Rasala; Stephen Mayfield
Journal:  Biofuels       Date:  2010-09       Impact factor: 2.956

7.  DNA phosphorothioation is widespread and quantized in bacterial genomes.

Authors:  Lianrong Wang; Shi Chen; Kevin L Vergin; Stephen J Giovannoni; Simon W Chan; Michael S DeMott; Koli Taghizadeh; Otto X Cordero; Michael Cutler; Sonia Timberlake; Eric J Alm; Martin F Polz; Jarone Pinhassi; Zixin Deng; Peter C Dedon
Journal:  Proc Natl Acad Sci U S A       Date:  2011-02-01       Impact factor: 11.205

8.  Genome assembly reborn: recent computational challenges.

Authors:  Mihai Pop
Journal:  Brief Bioinform       Date:  2009-05-29       Impact factor: 11.622

9.  Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly.

Authors:  Heng Li
Journal:  Bioinformatics       Date:  2012-05-07       Impact factor: 6.937

10.  Assembler for de novo assembly of large genomes.

Authors:  Te-Chin Chu; Chen-Hua Lu; Tsunglin Liu; Greg C Lee; Wen-Hsiung Li; Arthur Chun-Chieh Shih
Journal:  Proc Natl Acad Sci U S A       Date:  2013-08-21       Impact factor: 11.205

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.