Literature DB >> 25700411

Draft Genome Sequence of a Cellulase-Producing Psychrotrophic Paenibacillus Strain, IHB B 3415, Isolated from the Cold Environment of the Western Himalayas, India.

Hena Dhar, Mohit Kumar Swarnkar¹, Arvind Gulati, Anil Kumar Singh², Ramesh Chand Kasana³.

Abstract

Paenibacillus sp. strain IHB B 3415 is a cellulase-producing psychrotrophic bacterium isolated from a soil sample from the cold deserts of Himachal Pradesh, India. Here, we report an 8.44-Mb assembly of its genome sequence with a G+C content of 50.77%. The data presented here will provide insights into the mechanisms of cellulose degradation at low temperature.

Entities: Chemical Species

Year: 2015 PMID： 25700411 PMCID： PMC4335335 DOI： 10.1128/genomeA.01581-14

Source DB: PubMed Journal: Genome Announc

GENOME ANNOUNCEMENT

Low-temperature environments are inhabited by cold-adapted microorganisms, psychrotrophs and psychrophiles. These cold-adapted microorganisms are sources of commercially and industrially important enzymes, including cellulases (1). During our exploration of the microbial diversity associated with the soils of cold environments of Himachal Pradesh, India, we isolated a Paenibacillus strain, IHB B 3415, which produces cellulase at low temperature. This bacterium appears to be most closely related to Paenibacillus borealis KK19T based on a 16S rRNA gene sequence analysis. The genome of Paenibacillus sp. IHB B 3415 was sequenced, given the ability of the organism to grow and produce low-temperature active enzymes. Whole-genome shotgun sequencing was completed using the Illumina Genome Analyzer IIx in 76-bp paired-read format. A partial flow cell obtained 28,321,562 raw paired-end (PE) reads with 4,304,877,424 bases of raw sequence. The paired reads were quality filtered using the NGS QC toolkit version 2.3 (2) (cutoff read length for HQ, 70%; cutoff quality score, 20). A sum of 23,383,467 (77.57%) of the filtered PE reads were obtained without adaptor/primer contamination and were used further for assembly. De novo assembly of the genome data was done using Velvet version 1.2.10 (3). In this data set, the k parameter (from 49 to 73) was optimized for best assembly. We found that at k of 51 mers, 89.5% (41,853,238) of the reads were aligned out of 46,766,934 total reads. The Paenibacillus sp. IHB B 3415 genome was assembled in 752 contigs, with a sum of 8,350,804 bases (N50 length, 26,240 bases; maximum contig length, 110 kb). We used SSPACE version 3.0 (4) to extend and merge the resulting scaffolds based on read-pair information and short overlaps to reduce the number of scaffolds. GapFiller version 1.1 (5) was used to close the gaps between short scaffolds contained within the large scaffolds by replacing unknown nucleotides (Ns) with true nucleotides based on paired-read information and short overlaps. After filling the gaps, the reads were assembled as 290 scaffolds summing 8,437,849 bp (N50 size, 78,530 bp; longest size, 315,231 bp; G+C content, 50.77%). Annotation conducted on the RAST server using the Glimmer3 option predicted 7,897 protein-coding genes, including 78 RNA genes and 454 predicted SEED subsystem features (6). A total of 2,868 (37%) features were covered by SEED subsystems, out of which 2,737 were nonhypothetical proteins. The annotation of Paenibacillus sp. IHB B 3415 using Prodigal (7) predicted 7,335 coding sequences summing to a total of 7,161,309 bases, which is ~85% of the assembled size of the genome (8,437,849 bases). Further, tRNAscan-SE (8) and RNAmmer (9) predicted 77 noncoding RNAs (71 tRNAs, 1 pseudo-tRNA, and 3 RNAs consisting of 3 copies of 5S rRNA and one copy each of 16S rRNA and 23S rRNA genes). It is interesting to note that the genome of Paenibacillus sp. IHB B 3415 is currently the second largest genome in the genus after Paenibacillus mucilaginosus in both overall size and the number of genes encoding proteins (10). In the IHB B 3415 genome, 1,011 genes were assigned for carbohydrate metabolism, 453 for amino acids and derivatives, and 131 for stress response (including 18 and 3 for heat and cold shock, respectively). The genome contains 16 genes predicted for cellulases, corroborating our results for cellulose degradation effected by the strain. A comparative genome analysis of cellulase-producing psychrotrophic Paenibacillus sp. IHB B 3415 with other Paenibacillus species will give insight into the evolutionary relationships and biotechnological significance of this important genus.

Nucleotide sequence accession numbers.

This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank under the accession no. JUEI00000000. The version described in this paper is version JUEI01000000.

10 in total

1. Scaffolding pre-assembled contigs using SSPACE.

Authors: Marten Boetzer; Christiaan V Henkel; Hans J Jansen; Derek Butler; Walter Pirovano
Journal: Bioinformatics Date: 2010-12-12 Impact factor: 6.937

2. NGS QC Toolkit: a toolkit for quality control of next generation sequencing data.

Authors: Ravi K Patel; Mukesh Jain
Journal: PLoS One Date: 2012-02-01 Impact factor: 3.240

3. Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors: Daniel R Zerbino; Ewan Birney
Journal: Genome Res Date: 2008-03-18 Impact factor: 9.043

Review 4. Cellulases from psychrophilic microorganisms: a review.

Authors: Ramesh C Kasana; Arvind Gulati
Journal: J Basic Microbiol Date: 2011-03-24 Impact factor: 2.281

5. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors: T M Lowe; S R Eddy
Journal: Nucleic Acids Res Date: 1997-03-01 Impact factor: 16.971

6. Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors: Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal: BMC Bioinformatics Date: 2010-03-08 Impact factor: 3.169

7. Toward almost closed genomes with GapFiller.

Authors: Marten Boetzer; Walter Pirovano
Journal: Genome Biol Date: 2012-06-25 Impact factor: 13.583

8. Genome Sequence of Growth-Improving Paenibacillus mucilaginosus Strain KNP414.

Authors: Jing-Jiang Lu; Jian-Feng Wang; Xiu-Fang Hu
Journal: Genome Announc Date: 2013-10-24

9. RNAmmer: consistent and rapid annotation of ribosomal RNA genes.

Authors: Karin Lagesen; Peter Hallin; Einar Andreas Rødland; Hans-Henrik Staerfeldt; Torbjørn Rognes; David W Ussery
Journal: Nucleic Acids Res Date: 2007-04-22 Impact factor: 16.971

10. The RAST Server: rapid annotations using subsystems technology.

Authors: Ramy K Aziz; Daniela Bartels; Aaron A Best; Matthew DeJongh; Terrence Disz; Robert A Edwards; Kevin Formsma; Svetlana Gerdes; Elizabeth M Glass; Michael Kubal; Folker Meyer; Gary J Olsen; Robert Olson; Andrei L Osterman; Ross A Overbeek; Leslie K McNeil; Daniel Paarmann; Tobias Paczian; Bruce Parrello; Gordon D Pusch; Claudia Reich; Rick Stevens; Olga Vassieva; Veronika Vonstein; Andreas Wilke; Olga Zagnitko
Journal: BMC Genomics Date: 2008-02-08 Impact factor: 3.969

10 in total

2 in total

1. Decoding the complete arsenal for cellulose and hemicellulose deconstruction in the highly efficient cellulose decomposer Paenibacillus O199.

Authors: Rubén López-Mondéjar; Daniela Zühlke; Tomáš Větrovský; Dörte Becher; Katharina Riedel; Petr Baldrian
Journal: Biotechnol Biofuels Date: 2016-05-14 Impact factor: 6.040

2. Genome assembly of Chryseobacterium sp. strain IHBB 10212 from glacier top-surface soil in the Indian trans-Himalayas with potential for hydrolytic enzymes.

Authors: Mohinder Pal; Mohit Kumar Swarnkar; Hena Dhar; Sanjay Chhibber; Arvind Gulati
Journal: Genom Data Date: 2017-07-01

2 in total