Literature DB >> 29445416

A safe and complete algorithm for metagenomic assembly.

Nidia Obscura Acosta1, Veli Mäkinen1, Alexandru I Tomescu1.   

Abstract

BACKGROUND: Reconstructing the genome of a species from short fragments is one of the oldest bioinformatics problems. Metagenomic assembly is a variant of the problem asking to reconstruct the circular genomes of all bacterial species present in a sequencing sample. This problem can be naturally formulated as finding a collection of circular walks of a directed graph G that together cover all nodes, or edges, of G. APPROACH: We address this problem with the "safe and complete" framework of Tomescu and Medvedev (Research in computational Molecular biology-20th annual conference, RECOMB 9649:152-163, 2016). An algorithm is called safe if it returns only those walks (also called safe) that appear as subwalk in all metagenomic assembly solutions for G. A safe algorithm is called complete if it returns all safe walks of G.
RESULTS: We give graph-theoretic characterizations of the safe walks of G, and a safe and complete algorithm finding all safe walks of G. In the node-covering case, our algorithm runs in time [Formula: see text], and in the edge-covering case it runs in time [Formula: see text]; n and m denote the number of nodes and edges, respectively, of G. This algorithm constitutes the first theoretical tight upper bound on what can be safely assembled from metagenomic reads using this problem formulation.

Entities:  

Keywords:  Circular walk; Contig assembly; Genome assembly; Graph algorithm; Metagenomics

Year:  2018        PMID: 29445416      PMCID: PMC5802251          DOI: 10.1186/s13015-018-0122-7

Source DB:  PubMed          Journal:  Algorithms Mol Biol        ISSN: 1748-7188            Impact factor:   1.405


  32 in total

1.  Bifidobacterium animalis subsp. lactis fermented milk product reduces inflammation by altering a niche for colitogenic microbes.

Authors:  Patrick Veiga; Carey Ann Gallini; Chloé Beal; Monia Michaud; Mary L Delaney; Andrea DuBois; Artem Khlebnikov; Johan E T van Hylckama Vlieg; Shivesh Punit; Jonathan N Glickman; Andrew Onderdonk; Laurie H Glimcher; Wendy S Garrett
Journal:  Proc Natl Acad Sci U S A       Date:  2010-10-04       Impact factor: 11.205

2.  IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth.

Authors:  Yu Peng; Henry C M Leung; S M Yiu; Francis Y L Chin
Journal:  Bioinformatics       Date:  2012-04-11       Impact factor: 6.937

3.  The fragment assembly string graph.

Authors:  Eugene W Myers
Journal:  Bioinformatics       Date:  2005-09-01       Impact factor: 6.937

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  Maximum likelihood genome assembly.

Authors:  Paul Medvedev; Michael Brudno
Journal:  J Comput Biol       Date:  2009-08       Impact factor: 1.479

Review 6.  The Theory and Practice of Genome Sequence Assembly.

Authors:  Jared T Simpson; Mihai Pop
Journal:  Annu Rev Genomics Hum Genet       Date:  2015-04-22       Impact factor: 8.929

7.  Efficient de novo assembly of large genomes using compressed data structures.

Authors:  Jared T Simpson; Richard Durbin
Journal:  Genome Res       Date:  2011-12-07       Impact factor: 9.043

8.  Ray Meta: scalable de novo metagenome assembly and profiling.

Authors:  Sébastien Boisvert; Frédéric Raymond; Elénie Godzaridis; François Laviolette; Jacques Corbeil
Journal:  Genome Biol       Date:  2012-12-22       Impact factor: 13.583

9.  ALLPATHS: de novo assembly of whole-genome shotgun microreads.

Authors:  Jonathan Butler; Iain MacCallum; Michael Kleber; Ilya A Shlyakhter; Matthew K Belmonte; Eric S Lander; Chad Nusbaum; David B Jaffe
Journal:  Genome Res       Date:  2008-03-13       Impact factor: 9.043

10.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.

Authors:  Toshiaki Namiki; Tsuyoshi Hachiya; Hideaki Tanaka; Yasubumi Sakakibara
Journal:  Nucleic Acids Res       Date:  2012-07-19       Impact factor: 16.971

View more
  1 in total

1.  Precise genotyping of circular mobile elements from metagenomic data uncovers human-associated plasmids with recent common ancestors.

Authors:  Nitan Shalon; David A Relman; Eitan Yaffe
Journal:  Genome Res       Date:  2022-04-12       Impact factor: 9.438

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.