Zachary Deng1,2, Eric Delwart3,4. 1. Vitalant Research Institute, San Francisco, CA, 94118, USA. dengzac@gmail.com. 2. Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA, 94107, USA. dengzac@gmail.com. 3. Vitalant Research Institute, San Francisco, CA, 94118, USA. delwarte@medicine.ucsf.edu. 4. Department of Laboratory Medicine, University of California at San Francisco, San Francisco, CA, 94107, USA. delwarte@medicine.ucsf.edu.
Abstract
BACKGROUND: Metagenomics is the study of microbial genomes for pathogen detection and discovery in human clinical, animal, and environmental samples via Next-Generation Sequencing (NGS). Metagenome de novo sequence assembly is a crucial analytical step in which longer contigs, ideally whole chromosomes/genomes, are formed from shorter NGS reads. However, the contigs generated from the de novo assembly are often very fragmented and rarely longer than a few kilo base pairs (kb). Therefore, a time-consuming extension process is routinely performed on the de novo assembled contigs. RESULTS: To facilitate this process, we propose a new tool for metagenome contig extension after de novo assembly. ContigExtender employs a novel recursive extending strategy that explores multiple extending paths to achieve highly accurate longer contigs. We demonstrate that ContigExtender outperforms existing tools in synthetic, animal, and human metagenomics datasets. CONCLUSIONS: A novel software tool ContigExtender has been developed to assist and enhance the performance of metagenome de novo assembly. ContigExtender effectively extends contigs from a variety of sources and can be incorporated in most viral metagenomics analysis pipelines for a wide variety of applications, including pathogen detection and viral discovery.
BACKGROUND: Metagenomics is the study of microbial genomes for pathogen detection and discovery in human clinical, animal, and environmental samples via Next-Generation Sequencing (NGS). Metagenome de novo sequence assembly is a crucial analytical step in which longer contigs, ideally whole chromosomes/genomes, are formed from shorter NGS reads. However, the contigs generated from the de novo assembly are often very fragmented and rarely longer than a few kilo base pairs (kb). Therefore, a time-consuming extension process is routinely performed on the de novo assembled contigs. RESULTS: To facilitate this process, we propose a new tool for metagenome contig extension after de novo assembly. ContigExtender employs a novel recursive extending strategy that explores multiple extending paths to achieve highly accurate longer contigs. We demonstrate that ContigExtender outperforms existing tools in synthetic, animal, and human metagenomics datasets. CONCLUSIONS: A novel software tool ContigExtender has been developed to assist and enhance the performance of metagenome de novo assembly. ContigExtender effectively extends contigs from a variety of sources and can be incorporated in most viral metagenomics analysis pipelines for a wide variety of applications, including pathogen detection and viral discovery.
Entities:
Keywords:
De novo assembly; Metagenomics; Next-Gen Sequencing; Pathogen detection; Viral discovery
Authors: Xiao Yang; Patrick Charlebois; Sante Gnerre; Matthew G Coole; Niall J Lennon; Joshua Z Levin; James Qu; Elizabeth M Ryan; Michael C Zody; Matthew R Henn Journal: BMC Genomics Date: 2012-09-13 Impact factor: 3.969
Authors: Samia N Naccache; Scot Federman; Narayanan Veeraraghavan; Matei Zaharia; Deanna Lee; Erik Samayoa; Jerome Bouquet; Alexander L Greninger; Ka-Cheung Luk; Barryett Enge; Debra A Wadford; Sharon L Messenger; Gillian L Genrich; Kristen Pellegrino; Gilda Grard; Eric Leroy; Bradley S Schneider; Joseph N Fair; Miguel A Martínez; Pavel Isa; John A Crump; Joseph L DeRisi; Taylor Sittler; John Hackett; Steve Miller; Charles Y Chiu Journal: Genome Res Date: 2014-06-04 Impact factor: 9.043
Authors: João M P Alves; André L de Oliveira; Tatiana O M Sandberg; Jaime L Moreno-Gallego; Marcelo A F de Toledo; Elisabeth M M de Moura; Liliane S Oliveira; Alan M Durham; Dolores U Mehnert; Paolo M de A Zanotto; Alejandro Reyes; Arthur Gruber Journal: Front Microbiol Date: 2016-03-04 Impact factor: 5.640