| Literature DB >> 31406903 |
Eiko Nakazono-Nagaoka1, Takashi Fujikawa2, Ayumi Shikata1, Chakrit Tachaapaikoon3, Rattiya Waeonukul3, Patthra Pason3, Khanok Ratanakhanokchai4, Akihiko Kosugi1.
Abstract
Clostridium thermocellum is a potent cellulolytic bacterium. C. thermocellum strain PAL5, was derived from strain S14 that was isolated from bagasse paper sludge, possesses higher cellulose-degradation ability than representative strains ATCC27405 and DSM1313. In this work, we determined the draft genome sequence of C. thermocellum PAL5. Genomic DNA was used for whole-genome sequencing using the Illumina HiSeq 2500. We obtained 215 contigs of >200 bp (N50, 78,366 bp; mean length, 17,378 bp). The assembled data were subjected to the National Center for Biotechnology Information (NCBI) Prokaryotic Genome Annotation Pipeline, and 3198 protein-coding sequences, 53 tRNA genes, and 4 rRNA genes were identified. The data are accessible at NCBI (the accession number SBHL00000000). Our data resource will facilitate further studies of efficient cellulose-degradation using C. thermocellum.Entities:
Keywords: Cellulose; Cellulose-degradation; Clostridium thermocellum; Draft genome sequence
Year: 2019 PMID: 31406903 PMCID: PMC6685675 DOI: 10.1016/j.dib.2019.104274
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Comparison of cellulose-degradation ability of three strains of Clostridium thermocellum. The percentage of residual cellulose related to the original weight is shown for experiments with Clostridium thermocellum strains PAL5, ATCC27405, DSM1313 and uninoculated controls (control) after 3 days of incubation at 60 °C. PAL5, ATCC247405 and DSM 1313 were grown on CTFUD medium containing 1.0% microcrystalline cellulose. The data are means of four independent experiments. Error bars represent ± standard deviation (n = 4).
Features of C. thermocellum PAL5 genome.
| Feature | Description |
|---|---|
| Number of reads used in assembly | 81,421,088 |
| Read length | 100 bp |
| Genome size (total contig size) | 3.84 Mbp |
| Assembly G + C percent | 38.80% |
| N50 contig length | 78,366 bp |
| Minimum contig length | 208 bp |
| Maximum contig length | 424,669 bp |
| Average contig length | 17,378 bp |
| Number of contigs | 215 contigs |
| Total contig size | 3,736,353 bp |
| Genome coverage | 2,178-fold |
Fig. 2Dendrogram of average nucleotide identity (ANI) values. The ANI value for each combination of strains was calculated, and a dendrogram was constructed using the unweighted pair group method with arithmetic mean. Clostridium clariflavum DSM19732 (GenBank accession number: NZ_CP003065.1) and Herbivorax saccinocola GGR1 (NZ_CP025197.1) were used as outgroups. Strains of Clostridium thermocellum: PAL5, ATCC27405 (NC_009012), DSM1313 (NC_017,304), DSM2360 (NZ_CP016502), CB1 (NZ_CBQ0000000000.1), JW20 (NZ_ABVG00000000.2), AD2 (NZ_CP013828.1), and YS (AJGT00000000.1).
Comparison of cellulosomal scaffolding proteins from strains ATCC27405T and PAL5.
| Predicted protein | ATCC27405T | Protein_accession number in PAL5 |
|---|---|---|
| Scaffolding protein | CipA | THJ77199.1, THJ77201.1, THJ77215.1 (partial) |
| Anchoring protein | OlpA | THJ76703.1 |
| OlpC | THJ77790.1 | |
| SdbA | THJ78951.1 | |
| Orf2p | THJ76702.1 | |
| OlpB | THJ76701.1, THJ77198.1, THJ77200.1 (partial) | |
| Cellulosomal integrated protein | Cthe_0735 | THJ78005.1 |
| Cthe_0736 | THJ78004.1 |
Specifications table
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data were acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility | Deposited data are available at the National Center for Biotechnology Information (NCBI) under the accession number SBHL00000000 ( |
| Related research article |
Clostridium thermocellum PAL5 having strong cellulose-degradation ability was derived from strain S14 that was isolated from bagasse paper sludge. Data on draft genome sequence of stain PAL5 can be used to search and characterize genes and enzymes regarding high cellulose-degradation ability. The comparison of genome sequence data between C. thermocellum strains gives an opportunity to understand a difference of cellulose degradation ability. |