David Hernandez1, Ryan Tewhey, Jean-Baptiste Veyrieras, Laurent Farinelli, Magne Østerås, Patrice François, Jacques Schrenzel. 1. Genomic Research Laboratory, Infectious Diseases Service, Geneva University Hospitals, 1211 Geneva 4, Switzerland, Scripps Translational Science Institute, The Scripps Research Institute, La Jolla, CA 92037, USA, BioMérieux, Data and Knowledge Laboratory, 69280 Marcy l'Etoile, France and Fasteris SA, PO Box 28, 1228 Plan-les-Ouates, Switzerland.
Abstract
MOTIVATION: Paired-end sequencing allows circumventing the shortness of the reads produced by second generation sequencers and is essential for de novo assembly of genomes. However, obtaining a finished genome from short reads is still an open challenge. We present an algorithm that exploits the pairing information issued from inserts of potentially any length. The method determines paths through an overlaps graph by using a constrained search tree. We also present a method that automatically determines suited overlaps cutoffs according to the contextual coverage, reducing thus the need for manual parameterization. Finally, we introduce an interactive mode that allows querying an assembly at targeted regions. RESULTS: We assess our methods by assembling two Staphylococcus aureus strains that were sequenced on the Illumina platform. Using 100 bp paired-end reads and minimal manual curation, we produce a finished genome sequence for the previously undescribed isolate SGH-10-168. AVAILABILITY AND IMPLEMENTATION: The presented algorithms are implemented in the standalone Edena software, freely available under the General Public License (GPLv3) at www.genomic.ch/edena.php.
MOTIVATION: Paired-end sequencing allows circumventing the shortness of the reads produced by second generation sequencers and is essential for de novo assembly of genomes. However, obtaining a finished genome from short reads is still an open challenge. We present an algorithm that exploits the pairing information issued from inserts of potentially any length. The method determines paths through an overlaps graph by using a constrained search tree. We also present a method that automatically determines suited overlaps cutoffs according to the contextual coverage, reducing thus the need for manual parameterization. Finally, we introduce an interactive mode that allows querying an assembly at targeted regions. RESULTS: We assess our methods by assembling two Staphylococcus aureus strains that were sequenced on the Illumina platform. Using 100 bp paired-end reads and minimal manual curation, we produce a finished genome sequence for the previously undescribed isolate SGH-10-168. AVAILABILITY AND IMPLEMENTATION: The presented algorithms are implemented in the standalone Edena software, freely available under the General Public License (GPLv3) at www.genomic.ch/edena.php.
Authors: Sante Gnerre; Iain Maccallum; Dariusz Przybylski; Filipe J Ribeiro; Joshua N Burton; Bruce J Walker; Ted Sharpe; Giles Hall; Terrance P Shea; Sean Sykes; Aaron M Berlin; Daniel Aird; Maura Costello; Riza Daza; Louise Williams; Robert Nicol; Andreas Gnirke; Chad Nusbaum; Eric S Lander; David B Jaffe Journal: Proc Natl Acad Sci U S A Date: 2010-12-27 Impact factor: 11.205
Authors: Martin Krzywinski; Jacqueline Schein; Inanç Birol; Joseph Connors; Randy Gascoyne; Doug Horsman; Steven J Jones; Marco A Marra Journal: Genome Res Date: 2009-06-18 Impact factor: 9.043
Authors: Jonathan Butler; Iain MacCallum; Michael Kleber; Ilya A Shlyakhter; Matthew K Belmonte; Eric S Lander; Chad Nusbaum; David B Jaffe Journal: Genome Res Date: 2008-03-13 Impact factor: 9.043
Authors: Filipe J Ribeiro; Dariusz Przybylski; Shuangye Yin; Ted Sharpe; Sante Gnerre; Amr Abouelleil; Aaron M Berlin; Anna Montmayeur; Terrance P Shea; Bruce J Walker; Sarah K Young; Carsten Russ; Chad Nusbaum; Iain MacCallum; David B Jaffe Journal: Genome Res Date: 2012-07-24 Impact factor: 9.043
Authors: A Renzoni; E Von Dach; C Landelle; S M Diene; C Manzano; R Gonzales; W Abdelhady; C P Randall; E J Bonetti; D Baud; A J O'Neill; A Bayer; A Cherkaoui; J Schrenzel; S Harbarth; P François Journal: Antimicrob Agents Chemother Date: 2017-09-22 Impact factor: 5.191
Authors: Gabriel Ramírez-Vargas; Carlos Quesada-Gómez; Luis Acuña-Amador; Diana López-Ureña; Tatiana Murillo; María Del Mar Gamboa-Coronado; Esteban Chaves-Olarte; Nicholas Thomson; Evelyn Rodríguez-Cavallini; César Rodríguez Journal: Antimicrob Agents Chemother Date: 2017-03-24 Impact factor: 5.191