Literature DB >> 24058058

Exploring variation-aware contig graphs for (comparative) metagenomics using MaryGold.

Jurgen F Nijkamp1, Mihai Pop, Marcel J T Reinders, Dick de Ridder.   

Abstract

MOTIVATION: Although many tools are available to study variation and its impact in single genomes, there is a lack of algorithms for finding such variation in metagenomes. This hampers the interpretation of metagenomics sequencing datasets, which are increasingly acquired in research on the (human) microbiome, in environmental studies and in the study of processes in the production of foods and beverages. Existing algorithms often depend on the use of reference genomes, which pose a problem when a metagenome of a priori unknown strain composition is studied. In this article, we develop a method to perform reference-free detection and visual exploration of genomic variation, both within a single metagenome and between metagenomes.
RESULTS: We present the MaryGold algorithm and its implementation, which efficiently detects bubble structures in contig graphs using graph decomposition. These bubbles represent variable genomic regions in closely related strains in metagenomic samples. The variation found is presented in a condensed Circos-based visualization, which allows for easy exploration and interpretation of the found variation. We validated the algorithm on two simulated datasets containing three respectively seven Escherichia coli genomes and showed that finding allelic variation in these genomes improves assemblies. Additionally, we applied MaryGold to publicly available real metagenomic datasets, enabling us to find within-sample genomic variation in the metagenomes of a kimchi fermentation process, the microbiome of a premature infant and in microbial communities living on acid mine drainage. Moreover, we used MaryGold for between-sample variation detection and exploration by comparing sequencing data sampled at different time points for both of these datasets. AVAILABILITY: MaryGold has been written in C++ and Python and can be downloaded from http://bioinformatics.tudelft.nl/software

Entities:  

Mesh:

Year:  2013        PMID: 24058058      PMCID: PMC3916741          DOI: 10.1093/bioinformatics/btt502

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  30 in total

1.  Fast algorithms for large-scale genome alignment and comparison.

Authors:  Arthur L Delcher; Adam Phillippy; Jane Carlton; Steven L Salzberg
Journal:  Nucleic Acids Res       Date:  2002-06-01       Impact factor: 16.971

2.  Efficiently detecting polymorphisms during the fragment assembly process.

Authors:  Daniel Fasulo; Aaron Halpern; Ian Dew; Clark Mobarry
Journal:  Bioinformatics       Date:  2002       Impact factor: 6.937

3.  Cytoscape: a software environment for integrated models of biomolecular interaction networks.

Authors:  Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker
Journal:  Genome Res       Date:  2003-11       Impact factor: 9.043

4.  The fragment assembly string graph.

Authors:  Eugene W Myers
Journal:  Bioinformatics       Date:  2005-09-01       Impact factor: 6.937

5.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

6.  Circos: an information aesthetic for comparative genomics.

Authors:  Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra
Journal:  Genome Res       Date:  2009-06-18       Impact factor: 9.043

Review 7.  Computational methods for discovering structural variation with next-generation sequencing.

Authors:  Paul Medvedev; Monica Stanciu; Michael Brudno
Journal:  Nat Methods       Date:  2009-11       Impact factor: 28.547

8.  Strain-resolved community genomic analysis of gut microbial colonization in a premature infant.

Authors:  Michael J Morowitz; Vincent J Denef; Elizabeth K Costello; Brian C Thomas; Valeriy Poroyko; David A Relman; Jillian F Banfield
Journal:  Proc Natl Acad Sci U S A       Date:  2010-12-29       Impact factor: 11.205

Review 9.  Structure, function, and evolution of bacterial ATP-binding cassette systems.

Authors:  Amy L Davidson; Elie Dassa; Cedric Orelle; Jue Chen
Journal:  Microbiol Mol Biol Rev       Date:  2008-06       Impact factor: 11.056

10.  Efficient parallel and out of core algorithms for constructing large bi-directed de Bruijn graphs.

Authors:  Vamsi K Kundeti; Sanguthevar Rajasekaran; Hieu Dinh; Matthew Vaughn; Vishal Thapar
Journal:  BMC Bioinformatics       Date:  2010-11-15       Impact factor: 3.169

View more
  14 in total

1.  Metagenomic assembly through the lens of validation: recent advances in assessing and improving the quality of genomes assembled from metagenomes.

Authors:  Nathan D Olson; Todd J Treangen; Christopher M Hill; Victoria Cepeda-Espinoza; Jay Ghurye; Sergey Koren; Mihai Pop
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

2.  Metagenome SNP calling via read-colored de Bruijn graphs.

Authors:  Bahar Alipanahi; Martin D Muggli; Musa Jundi; Noelle R Noyes; Christina Boucher
Journal:  Bioinformatics       Date:  2021-04-01       Impact factor: 6.937

3.  KOMB: K-core based de novo characterization of copy number variation in microbiomes.

Authors:  Advait Balaji; Nicolae Sapoval; Charlie Seto; R A Leo Elworth; Yilei Fu; Michael G Nute; Tor Savidge; Santiago Segarra; Todd J Treangen
Journal:  Comput Struct Biotechnol J       Date:  2022-06-17       Impact factor: 6.155

4.  Utilizing de Bruijn graph of metagenome assembly for metatranscriptome analysis.

Authors:  Yuzhen Ye; Haixu Tang
Journal:  Bioinformatics       Date:  2015-08-29       Impact factor: 6.937

5.  Synthetic long-read sequencing reveals intraspecies diversity in the human microbiome.

Authors:  Volodymyr Kuleshov; Chao Jiang; Wenyu Zhou; Fereshteh Jahanbani; Serafim Batzoglou; Michael Snyder
Journal:  Nat Biotechnol       Date:  2015-12-14       Impact factor: 54.908

6.  MetaSort untangles metagenome assembly by reducing microbial community complexity.

Authors:  Peifeng Ji; Yanming Zhang; Jinfeng Wang; Fangqing Zhao
Journal:  Nat Commun       Date:  2017-01-23       Impact factor: 14.919

7.  metaSPAdes: a new versatile metagenomic assembler.

Authors:  Sergey Nurk; Dmitry Meleshko; Anton Korobeynikov; Pavel A Pevzner
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

Review 8.  Reference-free SNP detection: dealing with the data deluge.

Authors:  Richard M Leggett; Dan MacLean
Journal:  BMC Genomics       Date:  2014-05-20       Impact factor: 3.969

Review 9.  Metagenomic Assembly: Overview, Challenges and Applications.

Authors:  Jay S Ghurye; Victoria Cepeda-Espinoza; Mihai Pop
Journal:  Yale J Biol Med       Date:  2016-09-30

10.  ConStrains identifies microbial strains in metagenomic datasets.

Authors:  Chengwei Luo; Rob Knight; Heli Siljander; Mikael Knip; Ramnik J Xavier; Dirk Gevers
Journal:  Nat Biotechnol       Date:  2015-09-07       Impact factor: 54.908

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.