Literature DB >> 31406903

Draft genome sequence data of Clostridium thermocellum PAL5 possessing high cellulose-degradation ability.

Eiko Nakazono-Nagaoka1, Takashi Fujikawa2, Ayumi Shikata1, Chakrit Tachaapaikoon3, Rattiya Waeonukul3, Patthra Pason3, Khanok Ratanakhanokchai4, Akihiko Kosugi1.   

Abstract

Clostridium thermocellum is a potent cellulolytic bacterium. C. thermocellum strain PAL5, was derived from strain S14 that was isolated from bagasse paper sludge, possesses higher cellulose-degradation ability than representative strains ATCC27405 and DSM1313. In this work, we determined the draft genome sequence of C. thermocellum PAL5. Genomic DNA was used for whole-genome sequencing using the Illumina HiSeq 2500. We obtained 215 contigs of >200 bp (N50, 78,366 bp; mean length, 17,378 bp). The assembled data were subjected to the National Center for Biotechnology Information (NCBI) Prokaryotic Genome Annotation Pipeline, and 3198 protein-coding sequences, 53 tRNA genes, and 4 rRNA genes were identified. The data are accessible at NCBI (the accession number SBHL00000000). Our data resource will facilitate further studies of efficient cellulose-degradation using C. thermocellum.

Entities:  

Keywords:  Cellulose; Cellulose-degradation; Clostridium thermocellum; Draft genome sequence

Year:  2019        PMID: 31406903      PMCID: PMC6685675          DOI: 10.1016/j.dib.2019.104274

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications table Clostridium thermocellum PAL5 having strong cellulose-degradation ability was derived from strain S14 that was isolated from bagasse paper sludge. Data on draft genome sequence of stain PAL5 can be used to search and characterize genes and enzymes regarding high cellulose-degradation ability. The comparison of genome sequence data between C. thermocellum strains gives an opportunity to understand a difference of cellulose degradation ability.

Data

The thermophilic anaerobic bacterium Clostridium thermocellum (recently called Hungateiclostridium thermocellum) is a multifunctional ethanol producer, capable of both saccharification and fermentation [1]. C. thermocellum PAL5 was derived from strain S14 [2], [3], [4] that was isolated from bagasse paper sludge. The cellulolytic activity of strain PAL5 was compared with those of C. thermocellum ATCC27405T, a type strain of this species [5], and C. thermocellum DSM1313 [6] by incubation for 3 days at 60 °C in CTFUD medium [7] containing 1.0% microcrystalline cellulose powder instead of cellobiose. PAL5 showed better cellulose degrading ability than the other strains (Fig. 1), indicating that PAL5 may, like strain S14, possess high cellulose-degradation ability.
Fig. 1

Comparison of cellulose-degradation ability of three strains of Clostridium thermocellum. The percentage of residual cellulose related to the original weight is shown for experiments with Clostridium thermocellum strains PAL5, ATCC27405, DSM1313 and uninoculated controls (control) after 3 days of incubation at 60 °C. PAL5, ATCC247405 and DSM 1313 were grown on CTFUD medium containing 1.0% microcrystalline cellulose. The data are means of four independent experiments. Error bars represent ± standard deviation (n = 4).

Comparison of cellulose-degradation ability of three strains of Clostridium thermocellum. The percentage of residual cellulose related to the original weight is shown for experiments with Clostridium thermocellum strains PAL5, ATCC27405, DSM1313 and uninoculated controls (control) after 3 days of incubation at 60 °C. PAL5, ATCC247405 and DSM 1313 were grown on CTFUD medium containing 1.0% microcrystalline cellulose. The data are means of four independent experiments. Error bars represent ± standard deviation (n = 4). In this work, we determined the draft genome sequence of C. thermocellum PAL5 to identify which factors affect its cellulose-degradation ability. In total, 81,421,880 single reads with length 100 bp were obtained after filtering for quality score. Genome de novo assembly was performed using the CLC Genomic Workbench (CLC Bio, Qiagen, Valencia, CA); 215 contigs of >200 bp excluding scaffolded regions were obtained. Features of the genome are shown in Table 1. The assembled data for PAL5 were subjected to the NCBI Prokaryotic Genome Annotation Pipeline (PGAP), and 3,198 protein-coding sequences (CDSs), 53 tRNA genes, and 4 rRNA genes were identified. The equivalent values for strain ATCC27405 were 3,204 CDSs, 56 tRNA genes, and 12 rRNA genes (GenBank accession number:NC_009012). Thus, it was confirmed that the sequencing results for PAL5 in this work were similar to the known genome information for the type strain, and thus could be considered reliable.
Table 1

Features of C. thermocellum PAL5 genome.

FeatureDescription
Number of reads used in assembly81,421,088
Read length100 bp
Genome size (total contig size)3.84 Mbp
Assembly G + C percent38.80%
N50 contig length78,366 bp
Minimum contig length208 bp
Maximum contig length424,669 bp
Average contig length17,378 bp
Number of contigs215 contigs
Total contig size3,736,353 bp
Genome coverage2,178-fold
Features of C. thermocellum PAL5 genome. We used the average nucleotide identity (ANI) assay [8] among eight strains of C. thermocellum, including PAL5, and two out group strains, C. clariflavum DSM19732 (CP003065.1) and Herbivorax (Hungateiclostridium) saccinocola GGR1 (CP025197.1). The ANI value is calculated as the mean identity of BLASTn matches between the virtually fragmented query genome and the reference genome. A dendrogram of relatedness using ANI values (Suppl. Table 1) was constructed using the unweighted pair group method with arithmetic (UPGMA) method (Fig. 2) and single-linkage method (data not shown) as clustering methods, which showed that PAL5 is closely related to all the C. thermocellum strains.
Fig. 2

Dendrogram of average nucleotide identity (ANI) values. The ANI value for each combination of strains was calculated, and a dendrogram was constructed using the unweighted pair group method with arithmetic mean. Clostridium clariflavum DSM19732 (GenBank accession number: NZ_CP003065.1) and Herbivorax saccinocola GGR1 (NZ_CP025197.1) were used as outgroups. Strains of Clostridium thermocellum: PAL5, ATCC27405 (NC_009012), DSM1313 (NC_017,304), DSM2360 (NZ_CP016502), CB1 (NZ_CBQ0000000000.1), JW20 (NZ_ABVG00000000.2), AD2 (NZ_CP013828.1), and YS (AJGT00000000.1).

Dendrogram of average nucleotide identity (ANI) values. The ANI value for each combination of strains was calculated, and a dendrogram was constructed using the unweighted pair group method with arithmetic mean. Clostridium clariflavum DSM19732 (GenBank accession number: NZ_CP003065.1) and Herbivorax saccinocola GGR1 (NZ_CP025197.1) were used as outgroups. Strains of Clostridium thermocellum: PAL5, ATCC27405 (NC_009012), DSM1313 (NC_017,304), DSM2360 (NZ_CP016502), CB1 (NZ_CBQ0000000000.1), JW20 (NZ_ABVG00000000.2), AD2 (NZ_CP013828.1), and YS (AJGT00000000.1). Eight putative cellulosomal scaffolding protein of PAL5 were identified from genomic data by similarity with strain ATCC27405 (Table 2). The protein accession numbers corresponding to CipA and OlpB were divided into three nonconsecutive fragments; we suggest this was because the single reads could not be concatenated by the algorithm used in the de novo assembly. We consider that our genome data are of sufficient quality for further analysis to consider which factors affect the cellulose-degradation ability of strain PAL5 and others.
Table 2

Comparison of cellulosomal scaffolding proteins from strains ATCC27405T and PAL5.

Predicted proteinATCC27405TProtein_accession number in PAL5
Scaffolding proteinCipATHJ77199.1, THJ77201.1, THJ77215.1 (partial)
Anchoring proteinOlpATHJ76703.1
OlpCTHJ77790.1
SdbATHJ78951.1
Orf2pTHJ76702.1
OlpBTHJ76701.1, THJ77198.1, THJ77200.1 (partial)
Cellulosomal integrated proteinCthe_0735THJ78005.1
Cthe_0736THJ78004.1
Comparison of cellulosomal scaffolding proteins from strains ATCC27405T and PAL5.

Experimental design, materials, and methods

Genomic DNA extraction and sequencing

Genomic DNA of C. thermocellum PAL5 was extracted from microbial cells grown in anaerobic conditions at 60 °C. We used the cetyltrimethylammonium bromide (CTAB) method to extract genomic DNA [9]. The genomic DNA was processed to template samples using the TruSeq Nano DNA LT Library Prep Kit (Illumina, San Diego, CA). The template samples were formed into clusters using the HiSeq PE Rapid Cluster Kit v2-HS and HiSeq Rapid Due cBot v2 Sample Loading Kit, and then sequenced using the HiSeq Rapid SBS Kit v2-HS (Illumina) with the HiSeq 2500 next generation sequencer (Illumina). Genome de novo assembly was performed using the CLC Genomic Workbench. The assembled data were subjected to the NCBI PGAP.

Genomic average nucleotide identity

ANI analysis, which is used for in silico analysis of DNA–DNA hybridization, was performed. ANI values of combinations of the whole genome sequences of C. thermocellum strains were calculated using the web tool ANI calculator (http://enve-omics.ce.gatech.edu/ani/). The matrix made from ANI values between C. thermocellum strains was converted to a genetic dendrogram with algorithms such as the unweighted pair group method with arithmetic mean and single-linkage clustering method in the R statistic program.

Specifications table

Subject areaBiology
More specific subject areaBacteriology, Genomics
Type of dataGenomic sequence, predicted genes and annotation of respective proteins, deposited in NCBI database and available by links provided within article
How data were acquiredWhole-genome sequencing using Illumina HiSeq 2500
Data formatRaw and analyzed
Experimental factorsGenomic DNA extracted from pure culture of Clostridium thermocellum PAL5
Experimental featuresGenome sequencing, de novo assembly, gene prediction
Data source locationTsukuba, Ibaraki, Japan
Data accessibilityDeposited data are available at the National Center for Biotechnology Information (NCBI) under the accession number SBHL00000000 (https://www.ncbi.nlm.nih.gov/nuccore/SBHL00000000)
Related research articleC. Tachaapaikoon, A. Kosugi, P. Pason, R. Waeonukul, K. Ratanakhanokchai, K.L. Kyu, T. Arai, Y. Murata, Y. Mori, Isolation and characterization of a new cellulosome-producing Clostridium thermocellum strain, Biodegradation 23 (1) (2012) 57–68.
Value of the data

Clostridium thermocellum PAL5 having strong cellulose-degradation ability was derived from strain S14 that was isolated from bagasse paper sludge.

Data on draft genome sequence of stain PAL5 can be used to search and characterize genes and enzymes regarding high cellulose-degradation ability.

The comparison of genome sequence data between C. thermocellum strains gives an opportunity to understand a difference of cellulose degradation ability.

  9 in total

1.  Isolation and characterization of a new cellulosome-producing Clostridium thermocellum strain.

Authors:  Chakrit Tachaapaikoon; Akihiko Kosugi; Patthra Pason; Rattiya Waeonukul; Khanok Ratanakhanokchai; Khin Lay Kyu; Takamitsu Arai; Yoshinori Murata; Yutaka Mori
Journal:  Biodegradation       Date:  2011-06-03       Impact factor: 3.909

2.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities.

Authors:  Johan Goris; Konstantinos T Konstantinidis; Joel A Klappenbach; Tom Coenye; Peter Vandamme; James M Tiedje
Journal:  Int J Syst Evol Microbiol       Date:  2007-01       Impact factor: 2.747

3.  Complete genome sequence of the cellulolytic thermophile Clostridium thermocellum DSM1313.

Authors:  Lawrence Feinberg; Justine Foden; Trisha Barrett; Karen Walston Davenport; David Bruce; Chris Detter; Roxanne Tapia; Cliff Han; Alla Lapidus; Susan Lucas; Jan-Fang Cheng; Samuel Pitluck; Tanja Woyke; Natalia Ivanova; Natalia Mikhailova; Miriam Land; Loren Hauser; D Aaron Argyros; Lynne Goodwin; David Hogsett; Nicky Caiazza
Journal:  J Bacteriol       Date:  2011-04-01       Impact factor: 3.490

4.  Biodegradation of fibrillated oil palm trunk fiber by a novel thermophilic, anaerobic, xylanolytic bacterium Caldicoprobacter sp. CL-2 isolated from compost.

Authors:  Erma Widyasti; Ayumi Shikata; Rokiah Hashim; Othman Sulaiman; Kumar Sudesh; Edi Wahjono; Akihiko Kosugi
Journal:  Enzyme Microb Technol       Date:  2017-12-30       Impact factor: 3.493

5.  Transformation of Clostridium thermocellum by electroporation.

Authors:  Daniel G Olson; Lee R Lynd
Journal:  Methods Enzymol       Date:  2012       Impact factor: 1.600

6.  Rapid isolation of high molecular weight plant DNA.

Authors:  M G Murray; W F Thompson
Journal:  Nucleic Acids Res       Date:  1980-10-10       Impact factor: 16.971

7.  Characterization of an Anaerobic, Thermophilic, Alkaliphilic, High Lignocellulosic Biomass-Degrading Bacterial Community, ISHI-3, Isolated from Biocompost.

Authors:  Ayumi Shikata; Junjarus Sermsathanaswadi; Phakhinee Thianheng; Sirilak Baramee; Chakrit Tachaapaikoon; Rattiya Waeonukul; Patthra Pason; Khanok Ratanakhanokchai; Akihiko Kosugi
Journal:  Enzyme Microb Technol       Date:  2018-07-08       Impact factor: 3.493

8.  Direct glucose production from lignocellulose using Clostridium thermocellum cultures supplemented with a thermostable β-glucosidase.

Authors:  Panida Prawitwong; Rattiya Waeonukul; Chakrit Tachaapaikoon; Patthra Pason; Khanok Ratanakhanokchai; Lan Deng; Junjarus Sermsathanaswadi; Krisna Septiningrum; Yutaka Mori; Akihiko Kosugi
Journal:  Biotechnol Biofuels       Date:  2013-12-21       Impact factor: 6.040

9.  Global transcriptome analysis of Clostridium thermocellum ATCC 27405 during growth on dilute acid pretreated Populus and switchgrass.

Authors:  Charlotte M Wilson; Miguel Rodriguez; Courtney M Johnson; Stanton L Martin; Tzu Ming Chu; Russ D Wolfinger; Loren J Hauser; Miriam L Land; Dawn M Klingeman; Mustafa H Syed; Arthur J Ragauskas; Timothy J Tschaplinski; Jonathan R Mielenz; Steven D Brown
Journal:  Biotechnol Biofuels       Date:  2013-12-02       Impact factor: 6.040

  9 in total
  1 in total

1.  Biological cellulose saccharification using a coculture of Clostridium thermocellum and Thermobrachium celere strain A9.

Authors:  Rattiya Waeonukul; Akihiko Kosugi; Sreyneang Nhim; Ayaka Uke; Sirilak Baramee; Khanok Ratanakhanokchai; Chakrit Tachaapaikoon; Patthra Pason; Ya-Jun Liu
Journal:  Appl Microbiol Biotechnol       Date:  2022-02-14       Impact factor: 4.813

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.