Literature DB >> 28619808

First Insights into the Genome Sequence of Clostridium oryzae DSM 28571, Isolated from the Soil of a Japanese Rice Field.

Anja Poehlein1, Anna Lena Gippert2, Maaike J Bierenbroodspot2, Rolf Daniel3.   

Abstract

Clostridium oryzae was originally isolated from the soil of a Japanese rice field. C. oryzae represents a novel species within the genus Clostridium and is associated with anaerobic rice straw degradation. Here, we present the draft genome sequence of C. oryzae DSM 28571 (5.076 Mbp), containing 4,590 predicted protein-coding genes.
Copyright © 2017 Poehlein et al.

Entities:  

Year:  2017        PMID: 28619808      PMCID: PMC5473277          DOI: 10.1128/genomeA.00539-17

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Clostridium oryzae is an obligate anaerobic, Gram-positive, and spore-forming bacterium which was isolated from the soil of a Japanese rice field, using rice straw as the substrate (1). Phylogenetic analysis based on gyrase A protein sequences revealed that C. oryzae is closely related to some members of the genus Clostridium sensu stricto and Clostridium acetobutylicum. C. oryzae represents a novel species within the genus Clostridium, with strain KC3 (DSM 28571) as the type strain (1). Chromosomal DNA was extracted applying the MasterPure complete DNA purification kit, according to the instructions of the manufacturer (Epicentre, Madison, WI, USA). Illumina paired-end seqeuncing libraries were prepared using the Nextera XT DNA library preparation kit, as recommended by the manufacturer (Illumina, San Diego, CA, USA). Sequencing of DNA was performed employing an Illumina MiSeq instrument and Illumina MiSeq reagent kit version 3, as described by the manufacturer. Trimming of reads using Trimmomatic version 0.36 (2) resulted in 2,856,526 paired-end reads. The assembly performed with the SPAdes genome assembler software version 3.10.0 (3) resulted in 176 contigs (>500 bp), with an average coverage of 135-fold. The obtained contigs were validated using QualiMap version 2.1 (4). The draft genome consists of a single chromosome (5.076 Mbp), with an overall G+C content of 33.1%. Automatic gene prediction was performed by using Prokka (5). The draft genome harbored 4,590 predicted protein-coding genes, including 3,514 genes with a predicted function and 1,076 genes coding for hypothetical proteins. In addition, 12 rRNA genes, 1 transfer-messenger RNA (tmRNA), and 69 tRNA genes were predicted. Interestingly, 3 repeat regions were found in the genome of C. oryzae. Two of these regions were in close proximity to a clustered regularly interspaced short palindromic repeat (CRISPR)-associated (Cas) gene cluster present in the genome. The third repeat region was associated with another CRISPR-Cas gene cluster identified in the genome. In addition to Bacteroidetes and the Alphaproteobacteria, clostridial species are low-abundance components of rice soil bacterial communities. Rice straw mainly contains approximately 37% cellulose, 37% hemicellulose, and 15% lignin (6). C. oryzae cannot utilize lignin, cellulose, or hemicellulose, but it uses carbohydrates as growth substrates. One of the main substrates used by C. oryzae is soluble starch. We could identify a gene coding for a starch phosphorylase, which converts starch to α-d-glucose-6-phosphate (7). C. oryzae can also grow on cellobiose and melibiose. During the fermentation of cellobiose, C. oryzae produces lactate, butyrate, acetate, formate, H2, and CO2 (1). Correspondingly, 3 for butyrate production (puuE, buk, and ptb) and 4 genes for acetate synthesis (paaK, thiH, ackA, and pta) were detected in the genome sequence. C. oryzae has been described as positive for different enzymatic activities, including α-galactosidase, β-glucosidase, and α-arabinosidase activities (1). Correspondingly, genes were detected in the genome which were related to these enzyme types.

Accession number(s).

The whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession number MZGV00000000. The version described here is the first version, MZGV01000000.
  6 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Clostridium oryzae sp. nov., from soil of a Japanese rice field.

Authors:  Haruka Horino; Miyuki Ito; Akio Tonouchi
Journal:  Int J Syst Evol Microbiol       Date:  2015-01-06       Impact factor: 2.747

3.  Qualimap: evaluating next-generation sequencing alignment data.

Authors:  Fernando García-Alcalde; Konstantin Okonechnikov; José Carbonell; Luis M Cruz; Stefan Götz; Sonia Tarazona; Joaquín Dopazo; Thomas F Meyer; Ana Conesa
Journal:  Bioinformatics       Date:  2012-08-22       Impact factor: 6.937

4.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

Review 5.  Starch phosphorylase: role in starch metabolism and biotechnological applications.

Authors:  R S Rathore; Neha Garg; Sarika Garg; Anil Kumar
Journal:  Crit Rev Biotechnol       Date:  2009       Impact factor: 8.429

6.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.