Literature DB >> 27688341

Application of Long Sequence Reads To Improve Genomes for Clostridium thermocellum AD2, Clostridium thermocellum LQRI, and Pelosinus fermentans R7.

Sagar M Utturkar1, Edward A Bayer2, Ilya Borovok3, Raphael Lamed3, Richard A Hurt4, Miriam L Land4, Dawn M Klingeman5, Dwayne Elias4, Jizhong Zhou6, Marcel Huntemann7, Alicia Clum7, Manoj Pillay7, Krishnaveni Palaniappan7, Neha Varghese7, Natalia Mikhailova7, Dimitrios Stamatis7, T B K Reddy7, Chew Yee Ngan7, Chris Daum7, Nicole Shapiro7, Victor Markowitz7, Natalia Ivanova7, Nikos Kyrpides7, Tanja Woyke7, Steven D Brown8.   

Abstract

We and others have shown the utility of long sequence reads to improve genome assembly quality. In this study, we generated PacBio DNA sequence data to improve the assemblies of draft genomes for Clostridium thermocellum AD2, Clostridium thermocellum LQRI, and Pelosinus fermentans R7.
Copyright © 2016 Utturkar et al.

Entities:  

Year:  2016        PMID: 27688341      PMCID: PMC5043559          DOI: 10.1128/genomeA.01043-16

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Draft genome sequences have been generated for Clostridium thermocellum ad 2 (1), Clostridium thermocellum LQRI (DSM 2360) (2), and Pelosinus fermentans R7 (3), which encompassed 131, 110, and 65 contigs, respectively. Clostridium (Ruminiclostridium) thermocellum strains are known for their potent cellulolytic capabilities (4). C. thermocellum strain AD2 is derived from a cellulose adhesion-defective (AD) mutant strain and played a critical role in describing the original cellulosome concept (5). C. thermocellum strain LQRI is an LQ8 reisolate used in enzyme studies to propose electron flow paths (6), and has been mistakenly called LQR1 on occasion. A genome for LQ8 (DSM 1313) has been reported (7). P. fermentans type strain R7 belongs to the Negativicutes within the Firmicutes phylum and in the presence of a fermentable substrate it can reduce Fe(III) (8). Complete genomes for Pelosinus fermentans JBW45 (9) and Pelosinus sp. strain UFO1 (10) have recently been reported using only single-molecule DNA sequencing technology, and the utility of long read sequences has been shown to improve other microbial genome assemblies (11–15). In this study, genomic DNA of all three strains underwent Pacific Biosciences RSII standard template preparation and sequencing. Raw reads were assembled using the HGAP (version: 2.3.0) protocol. The HGAP assembly of AD2 contained 10 contigs and the 3.55-Mb finished genome was generated by superassembly using the Geneious (version 8.1.6) software combined with PCR and Sanger, as described previously (14). The HGAP assembly for LQRI contained two contigs, totaling 3.61 Mbp in size, with an input read coverage of 166.7. A small duplicated 12-kb LQRI artifact contig was removed from the assembly, leaving a circular chromosome of 3.57 Mb. The final HGAP assembly for the R7 genome contained two contigs totaling 5.02 Mb in genome size. Genes for all three genomes were identified using Prodigal (16) and annotations were performed as described previously (17). A total of 3,055, 3,071, and 4,631 protein-coding genes were identified in the AD2, LQRI, and R7 genomes, respectively. A prior comparison of single-molecule sequencing-based genome assembly to short-read and hybrid assembly approaches showed that assemblies based on shorter-read technologies were confounded by a large number of longer repeats, in particular multiple copies of ~5-kb rRNA gene operons (11). Consistent with this observation, earlier drafts of AD2, LQRI, and R7 genome sequences contained single copies of the rRNA genes. The improved genomes for the C. thermocellum strains reported in this study each contain four copies of the 5S, 16S, and 23S rRNA genes. The improved P. fermentans R7 genome reported in this study contains 12, nine, and nine copies of the 5S, 16S, and 23S rRNA genes, respectively. Finished or near-finished genome assemblies for these strains represent a significant improvement over earlier draft assemblies. We expect that the protein-coding potential will be superior (14) and that these genomes will facilitate comparative and functional genomic studies. Lastly, these new genome sequences will also facilitate a broader and more detailed comparison between sequencing and assembly technologies.

Accession number(s).

This whole-genome shotgun project has been deposited at DDBJ/ENA/GenBank under accession numbers CP013828, CP016502, and AKVN00000000 for AD2, LQRI, and R7, respectively.
  17 in total

1.  Complete genome sequence of the cellulolytic thermophile Clostridium thermocellum DSM1313.

Authors:  Lawrence Feinberg; Justine Foden; Trisha Barrett; Karen Walston Davenport; David Bruce; Chris Detter; Roxanne Tapia; Cliff Han; Alla Lapidus; Susan Lucas; Jan-Fang Cheng; Samuel Pitluck; Tanja Woyke; Natalia Ivanova; Natalia Mikhailova; Miriam Land; Loren Hauser; D Aaron Argyros; Lynne Goodwin; David Hogsett; Nicky Caiazza
Journal:  J Bacteriol       Date:  2011-04-01       Impact factor: 3.490

2.  Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data.

Authors:  Chen-Shan Chin; David H Alexander; Patrick Marks; Aaron A Klammer; James Drake; Cheryl Heiner; Alicia Clum; Alex Copeland; John Huddleston; Evan E Eichler; Stephen W Turner; Jonas Korlach
Journal:  Nat Methods       Date:  2013-05-05       Impact factor: 28.547

3.  Adherence of Clostridium thermocellum to cellulose.

Authors:  E A Bayer; R Kenig; R Lamed
Journal:  J Bacteriol       Date:  1983-11       Impact factor: 3.490

4.  Draft genome sequences for two metal-reducing Pelosinus fermentans strains isolated from a Cr(VI)-contaminated site and for type strain R7.

Authors:  Steven D Brown; Mircea Podar; Dawn M Klingeman; Courtney M Johnson; Zamin K Yang; Sagar M Utturkar; Miriam L Land; Jennifer J Mosher; Richard A Hurt; Tommy J Phelps; Anthony V Palumbo; Adam P Arkin; Terry C Hazen; Dwayne A Elias
Journal:  J Bacteriol       Date:  2012-09       Impact factor: 3.490

5.  Ethanol production by thermophilic bacteria: relationship between fermentation product yields of and catabolic enzyme activities in Clostridium thermocellum and Thermoanaerobium brockii.

Authors:  R Lamed; J G Zeikus
Journal:  J Bacteriol       Date:  1980-11       Impact factor: 3.490

6.  Sequence data for Clostridium autoethanogenum using three generations of sequencing technologies.

Authors:  Sagar M Utturkar; Dawn M Klingeman; José M Bruno-Barcena; Mari S Chinn; Amy M Grunden; Michael Köpke; Steven D Brown
Journal:  Sci Data       Date:  2015-04-14       Impact factor: 6.444

7.  Complete Genome Sequence of Pelosinus sp. Strain UFO1 Assembled Using Single-Molecule Real-Time DNA Sequencing Technology.

Authors:  Steven D Brown; Sagar M Utturkar; Timothy S Magnuson; Allison E Ray; Farris L Poole; W Andrew Lancaster; Michael P Thorgersen; Michael W W Adams; Dwayne A Elias
Journal:  Genome Announc       Date:  2014-09-04

8.  Comparison of single-molecule sequencing and hybrid approaches for finishing the genome of Clostridium autoethanogenum and analysis of CRISPR systems in industrial relevant Clostridia.

Authors:  Steven D Brown; Shilpa Nagaraju; Sagar Utturkar; Sashini De Tissera; Simón Segovia; Wayne Mitchell; Miriam L Land; Asela Dassanayake; Michael Köpke
Journal:  Biotechnol Biofuels       Date:  2014-03-21       Impact factor: 6.040

9.  Evaluation and validation of de novo and hybrid assembly techniques to derive high-quality genome sequences.

Authors:  Sagar M Utturkar; Dawn M Klingeman; Miriam L Land; Christopher W Schadt; Mitchel J Doktycz; Dale A Pelletier; Steven D Brown
Journal:  Bioinformatics       Date:  2014-06-14       Impact factor: 6.937

10.  Draft Genome Sequence of the Lignin-Degrading Burkholderia sp. Strain LIG30, Isolated from Wet Tropical Forest Soil.

Authors:  Hannah L Woo; Sagar Utturkar; Dawn Klingeman; Blake A Simmons; Kristen M DeAngelis; Steven D Brown; Terry C Hazen
Journal:  Genome Announc       Date:  2014-06-19
View more
  2 in total

1.  Firmicutes-enriched IS1447 represents a group of IS3-family insertion sequences exhibiting unique + 1 transcriptional slippage.

Authors:  Ya-Jun Liu; Kuan Qi; Jie Zhang; Chao Chen; Qiu Cui; Yingang Feng
Journal:  Biotechnol Biofuels       Date:  2018-11-01       Impact factor: 6.040

2.  A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

Authors:  Sagar M Utturkar; Dawn M Klingeman; Richard A Hurt; Steven D Brown
Journal:  Front Microbiol       Date:  2017-07-18       Impact factor: 5.640

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.