Literature DB >> 16204131

The fragment assembly string graph.

Eugene W Myers1.   

Abstract

We present a concept and formalism, the string graph, which represents all that is inferable about a DNA sequence from a collection of shotgun sequencing reads collected from it. We give time and space efficient algorithms for constructing a string graph given the collection of overlaps between the reads and, in particular, present a novel linear expected time algorithm for transitive reduction in this context. The result demonstrates that the decomposition of reads into kmers employed in the de Bruijn graph approach described earlier is not essential, and exposes its close connection to the unitig approach we developed at Celera. This paper is a preliminary piece giving the basic algorithm and results that demonstrate the efficiency and scalability of the method. These ideas are being used to build a next-generation whole genome assembler called BOA (Berkeley Open Assembler) that will easily scale to mammalian genomes.

Entities:  

Mesh:

Year:  2005        PMID: 16204131     DOI: 10.1093/bioinformatics/bti1114

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  119 in total

1.  AN EFFICIENT ALGORITHM FOR CHINESE POSTMAN WALK ON BI-DIRECTED DE BRUIJN GRAPHS.

Authors:  Vamsi Kundeti; Sanguthevar Rajasekaran; Heiu Dinh
Journal:  Discrete Math Algorithms Appl       Date:  2010

2.  New Generations: Sequencing Machines and Their Computational Challenges.

Authors:  David C Schwartz; Michael S Waterman
Journal:  J Comput Sci Technol       Date:  2010-01-01       Impact factor: 1.571

3.  Paired de bruijn graphs: a novel approach for incorporating mate pair information into genome assemblers.

Authors:  Paul Medvedev; Son Pham; Mark Chaisson; Glenn Tesler; Pavel Pevzner
Journal:  J Comput Biol       Date:  2011-10-14       Impact factor: 1.479

4.  Assembly of large genomes using second-generation sequencing.

Authors:  Michael C Schatz; Arthur L Delcher; Steven L Salzberg
Journal:  Genome Res       Date:  2010-05-27       Impact factor: 9.043

5.  ScaffoldScaffolder: solving contig orientation via bidirected to directed graph reduction.

Authors:  Paul M Bodily; M Stanley Fujimoto; Quinn Snell; Dan Ventura; Mark J Clement
Journal:  Bioinformatics       Date:  2015-09-17       Impact factor: 6.937

6.  Short read fragment assembly of bacterial genomes.

Authors:  Mark J Chaisson; Pavel A Pevzner
Journal:  Genome Res       Date:  2007-12-14       Impact factor: 9.043

7.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

8.  Aligning sequences by minimum description length.

Authors:  John S Conery
Journal:  EURASIP J Bioinform Syst Biol       Date:  2007

9.  Genome assembly reborn: recent computational challenges.

Authors:  Mihai Pop
Journal:  Brief Bioinform       Date:  2009-05-29       Impact factor: 11.622

10.  De novo fragment assembly with short mate-paired reads: Does the read length matter?

Authors:  Mark J Chaisson; Dumitru Brinza; Pavel A Pevzner
Journal:  Genome Res       Date:  2008-12-03       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.