Literature DB >> 19477687

Whole genome assembly from 454 sequencing output via modified DNA graph concept.

Jacek Blazewicz1, Marcin Bryja, Marek Figlerowicz, Piotr Gawron, Marta Kasprzak, Edward Kirton, Darren Platt, Jakub Przybytek, Aleksandra Swiercz, Lukasz Szajkowski.   

Abstract

Recently, 454 Life Sciences Corporation proposed a new biochemical approach to DNA sequencing (the 454 sequencing). It is based on the pyrosequencing protocol. The 454 sequencing aims to give reliable output at a low cost and in a short time. The produced sequences are shorter than reads produced by classical methods. Our paper proposes a new DNA assembly algorithm which deals well with such data and outperforms other assembly algorithms used in practice. The constructed SR-ASM algorithm is a heuristic method based on a graph model, the graph being a modified DNA graph proposed for DNA sequencing by hybridization procedure. Other new features of the assembly algorithm are, among others, temporary compression of input sequences, and a new and fast multiple alignment heuristics taking advantage of the way the output data for the 454 sequencing are presented and coded. The usefulness of the algorithm has been proved in tests on raw data generated during sequencing of the whole 1.84Mbp genome of Prochlorococcus marinus bacteria and also on a part of chromosome 15 of Homo sapiens. The source code of SR-ASM can be downloaded from http://bio.cs.put.poznan.pl/ in the section 'Current research'--> 'DNA Assembly'. Among publicly available assemblers our algorithm appeared to generate the best results, especially in the number of produced contigs and in the lengths of the contigs with high similarity to the genome sequence.

Entities:  

Mesh:

Year:  2009        PMID: 19477687     DOI: 10.1016/j.compbiolchem.2009.04.005

Source DB:  PubMed          Journal:  Comput Biol Chem        ISSN: 1476-9271            Impact factor:   2.877


  7 in total

Review 1.  Marine genomics: at the interface of marine microbial ecology and biodiscovery.

Authors:  Karla B Heidelberg; Jack A Gilbert; Ian Joint
Journal:  Microb Biotechnol       Date:  2010-09       Impact factor: 5.813

2.  Genome-scale de novo assembly using ALGA.

Authors:  Sylwester Swat; Artur Laskowski; Jan Badura; Wojciech Frohmberg; Pawel Wojciechowski; Aleksandra Swiercz; Marta Kasprzak; Jacek Blazewicz
Journal:  Bioinformatics       Date:  2021-01-20       Impact factor: 6.937

3.  Bioinformatics and computational biology in Poland.

Authors:  Janusz M Bujnicki; Jerzy Tiuryn
Journal:  PLoS Comput Biol       Date:  2013-05-02       Impact factor: 4.475

4.  Protein alignment algorithms with an efficient backtracking routine on multiple GPUs.

Authors:  Jacek Blazewicz; Wojciech Frohmberg; Michal Kierzynka; Erwin Pesch; Pawel Wojciechowski
Journal:  BMC Bioinformatics       Date:  2011-05-20       Impact factor: 3.307

5.  Next generation sequencing reads comparison with an alignment-free distance.

Authors:  Emanuel Weitschek; Daniele Santoni; Giulia Fiscon; Maria Cristina De Cola; Paola Bertolazzi; Giovanni Felici
Journal:  BMC Res Notes       Date:  2014-12-03

6.  Study of biological networks using graph theory.

Authors:  Wei Gao; Hualong Wu; Muhammad Kamran Siddiqui; Abdul Qudair Baig
Journal:  Saudi J Biol Sci       Date:  2017-11-14       Impact factor: 4.219

7.  GRASShopPER-An algorithm for de novo assembly based on GPU alignments.

Authors:  Aleksandra Swiercz; Wojciech Frohmberg; Michal Kierzynka; Pawel Wojciechowski; Piotr Zurkowski; Jan Badura; Artur Laskowski; Marta Kasprzak; Jacek Blazewicz
Journal:  PLoS One       Date:  2018-08-16       Impact factor: 3.240

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.