Literature DB >> 19077061

De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads.

Rhys A Farrer1, Eric Kemen, Jonathan D G Jones, David J Studholme.   

Abstract

Illumina's Genome Analyzer generates ultra-short sequence reads, typically 36 nucleotides in length, and is primarily intended for resequencing. We tested the potential of this technology for de novo sequence assembly on the 6 Mbp genome of Pseudomonas syringae pv. syringae B728a with several freely available assembly software packages. Using an unpaired data set, velvet assembled >96% of the genome into contigs with an N50 length of 8289 nucleotides and an error rate of 0.33%. EDENA generated smaller contigs (N50 was 4192 nucleotides) and comparable error rates. SSAKE and VCAKE yielded shorter contigs with very high error rates. Assembly of paired-end sequence data carrying 400 bp inserts produced longer contigs (N50 up to 15 628 nucleotides), but with increased error rates (0.5%). Contig length and error rate were very sensitive to the choice of parameter values. Noncoding RNA genes were poorly resolved in de novo assemblies, while >90% of the protein-coding genes were assembled with 100% accuracy over their full length. This study demonstrates that, in practice, de novo assembly of 36-nucleotide reads can generate reasonably accurate assemblies from about 40 x deep sequence data sets. These draft assemblies are useful for exploring an organism's proteomic potential, at a very economic low cost.

Entities:  

Mesh:

Year:  2008        PMID: 19077061     DOI: 10.1111/j.1574-6968.2008.01441.x

Source DB:  PubMed          Journal:  FEMS Microbiol Lett        ISSN: 0378-1097            Impact factor:   2.742


  40 in total

1.  Resolution of a meningococcal disease outbreak from whole-genome sequence data with rapid Web-based analysis methods.

Authors:  Keith A Jolley; Dorothea M C Hill; Holly B Bratcher; Odile B Harrison; Ian M Feavers; Julian Parkhill; Martin C J Maiden
Journal:  J Clin Microbiol       Date:  2012-07-11       Impact factor: 5.948

Review 2.  The long and winding road: virulence effector proteins of plant pathogenic bacteria.

Authors:  Dagmar R Hann; John P Rathjen
Journal:  Cell Mol Life Sci       Date:  2010-06-13       Impact factor: 9.261

Review 3.  Next-generation sequencing techniques for eukaryotic microorganisms: sequencing-based solutions to biological problems.

Authors:  Minou Nowrousian
Journal:  Eukaryot Cell       Date:  2010-07-02

4.  Nitrite-driven anaerobic methane oxidation by oxygenic bacteria.

Authors:  Katharina F Ettwig; Margaret K Butler; Denis Le Paslier; Eric Pelletier; Sophie Mangenot; Marcel M M Kuypers; Frank Schreiber; Bas E Dutilh; Johannes Zedelius; Dirk de Beer; Jolein Gloerich; Hans J C T Wessels; Theo van Alen; Francisca Luesken; Ming L Wu; Katinka T van de Pas-Schoonen; Huub J M Op den Camp; Eva M Janssen-Megens; Kees-Jan Francoijs; Henk Stunnenberg; Jean Weissenbach; Mike S M Jetten; Marc Strous
Journal:  Nature       Date:  2010-03-25       Impact factor: 49.962

5.  Genome assembly reborn: recent computational challenges.

Authors:  Mihai Pop
Journal:  Brief Bioinform       Date:  2009-05-29       Impact factor: 11.622

Review 6.  Application of 'next-generation' sequencing technologies to microbial genetics.

Authors:  Daniel MacLean; Jonathan D G Jones; David J Studholme
Journal:  Nat Rev Microbiol       Date:  2009-04       Impact factor: 60.633

Review 7.  Genomics and bioinformatics resources for crop improvement.

Authors:  Keiichi Mochida; Kazuo Shinozaki
Journal:  Plant Cell Physiol       Date:  2010-03-05       Impact factor: 4.927

8.  Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey.

Authors:  Hindrik H D Kerstens; Richard P M A Crooijmans; Albertine Veenendaal; Bert W Dibbits; Thomas F C Chin-A-Woeng; Johan T den Dunnen; Martien A M Groenen
Journal:  BMC Genomics       Date:  2009-10-16       Impact factor: 3.969

9.  De novo 454 sequencing of barcoded BAC pools for comprehensive gene survey and genome analysis in the complex genome of barley.

Authors:  Burkhard Steuernagel; Stefan Taudien; Heidrun Gundlach; Michael Seidel; Ruvini Ariyadasa; Daniela Schulte; Andreas Petzold; Marius Felder; Andreas Graner; Uwe Scholz; Klaus F X Mayer; Matthias Platzer; Nils Stein
Journal:  BMC Genomics       Date:  2009-11-20       Impact factor: 3.969

10.  ALLPATHS 2: small genomes assembled accurately and with high continuity from short paired reads.

Authors:  Iain Maccallum; Dariusz Przybylski; Sante Gnerre; Joshua Burton; Ilya Shlyakhter; Andreas Gnirke; Joel Malek; Kevin McKernan; Swati Ranade; Terrance P Shea; Louise Williams; Sarah Young; Chad Nusbaum; David B Jaffe
Journal:  Genome Biol       Date:  2009-10-01       Impact factor: 13.583

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.