MOTIVATION: Next-generation sequencing allows us to sequence reads from a microbial environment using single-cell sequencing or metagenomic sequencing technologies. However, both technologies suffer from the problem that sequencing depth of different regions of a genome or genomes from different species are highly uneven. Most existing genome assemblers usually have an assumption that sequencing depths are even. These assemblers fail to construct correct long contigs. RESULTS: We introduce the IDBA-UD algorithm that is based on the de Bruijn graph approach for assembling reads from single-cell sequencing or metagenomic sequencing technologies with uneven sequencing depths. Several non-trivial techniques have been employed to tackle the problems. Instead of using a simple threshold, we use multiple depthrelative thresholds to remove erroneous k-mers in both low-depth and high-depth regions. The technique of local assembly with paired-end information is used to solve the branch problem of low-depth short repeat regions. To speed up the process, an error correction step is conducted to correct reads of high-depth regions that can be aligned to highconfident contigs. Comparison of the performances of IDBA-UD and existing assemblers (Velvet, Velvet-SC, SOAPdenovo and Meta-IDBA) for different datasets, shows that IDBA-UD can reconstruct longer contigs with higher accuracy. AVAILABILITY: The IDBA-UD toolkit is available at our website http://www.cs.hku.hk/~alse/idba_ud
MOTIVATION: Next-generation sequencing allows us to sequence reads from a microbial environment using single-cell sequencing or metagenomic sequencing technologies. However, both technologies suffer from the problem that sequencing depth of different regions of a genome or genomes from different species are highly uneven. Most existing genome assemblers usually have an assumption that sequencing depths are even. These assemblers fail to construct correct long contigs. RESULTS: We introduce the IDBA-UD algorithm that is based on the de Bruijn graph approach for assembling reads from single-cell sequencing or metagenomic sequencing technologies with uneven sequencing depths. Several non-trivial techniques have been employed to tackle the problems. Instead of using a simple threshold, we use multiple depthrelative thresholds to remove erroneous k-mers in both low-depth and high-depth regions. The technique of local assembly with paired-end information is used to solve the branch problem of low-depth short repeat regions. To speed up the process, an error correction step is conducted to correct reads of high-depth regions that can be aligned to highconfident contigs. Comparison of the performances of IDBA-UD and existing assemblers (Velvet, Velvet-SC, SOAPdenovo and Meta-IDBA) for different datasets, shows that IDBA-UD can reconstruct longer contigs with higher accuracy. AVAILABILITY: The IDBA-UD toolkit is available at our website http://www.cs.hku.hk/~alse/idba_ud
Authors: K M Handley; Y M Piceno; P Hu; L M Tom; O U Mason; G L Andersen; J K Jansson; J A Gilbert Journal: ISME J Date: 2017-08-04 Impact factor: 10.302
Authors: Andrew B Allison; Jennifer R Ballard; Robert B Tesh; Justin D Brown; Mark G Ruder; M Kevin Keel; Brandon A Munk; Randall M Mickley; Samantha E J Gibbs; Amelia P A Travassos da Rosa; Julie C Ellis; Hon S Ip; Valerie I Shearn-Bochsler; Matthew B Rogers; Elodie Ghedin; Edward C Holmes; Colin R Parrish; Chris Dwyer Journal: J Virol Date: 2014-11-12 Impact factor: 5.103
Authors: William C Nelson; Yukari Maezato; Yu-Wei Wu; Margaret F Romine; Stephen R Lindemann Journal: Appl Environ Microbiol Date: 2015-10-23 Impact factor: 4.792
Authors: Arturo Vera-Ponce de León; Benjamin C Jahnes; Jun Duan; Lennel A Camuy-Vélez; Zakee L Sabree Journal: Appl Environ Microbiol Date: 2020-04-01 Impact factor: 4.792
Authors: Karen Andrade; Jörn Logemann; Karla B Heidelberg; Joanne B Emerson; Luis R Comolli; Laura A Hug; Alexander J Probst; Angus Keillar; Brian C Thomas; Christopher S Miller; Eric E Allen; John W Moreau; Jochen J Brocks; Jillian F Banfield Journal: ISME J Date: 2015-04-28 Impact factor: 10.302
Authors: Pilar Manrique; Benjamin Bolduc; Seth T Walk; John van der Oost; Willem M de Vos; Mark J Young Journal: Proc Natl Acad Sci U S A Date: 2016-08-29 Impact factor: 11.205
Authors: Brett J Baker; Jimmy H Saw; Anders E Lind; Cassandre Sara Lazar; Kai-Uwe Hinrichs; Andreas P Teske; Thijs J G Ettema Journal: Nat Microbiol Date: 2016-02-15 Impact factor: 17.745
Authors: Nina Dombrowski; John A Donaho; Tony Gutierrez; Kiley W Seitz; Andreas P Teske; Brett J Baker Journal: Nat Microbiol Date: 2016-05-09 Impact factor: 17.745