| Literature DB >> 25197451 |
Peng Bao1, Jian-Qiang Su2, Zheng-Yi Hu3, Max M Häggblom4, Yong-Guan Zhu5.
Abstract
Bacillus sp. strain ZYK, a member of the phylum Firmicutes, is of interest for its ability to reduce nitrate and selenite and for its resistance to arsenic under anaerobic conditions. Here we describe some key features of this organism, together with the complete genome sequence and annotation. The 3,575,797 bp long chromosome with its 3,454 protein-coding and 70 RNA genes, and the information gained from its sequence will be relevant to the elucidation of microbially-mediated transformations of nitrogen, selenium and arsenic in paddy soil.Entities:
Keywords: Bacillaceae; Gram-positive; anaerobic; arsenic resistance; nitrate-reduction; paddy soil; selenite-reduction; spore-forming
Year: 2014 PMID: 25197451 PMCID: PMC4148952 DOI: 10.4056/sigs.3817480
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of sp. ZYK relative to selected species. The strains and their corresponding GenBank accession numbers of 16S rRNA genes are as indicated. The tree, based on 1,545 positions, was built with MEGA 5 [11] using the Maximum likelihood method. Bar: 0.01substitutions per nucleotide position.
Figure 2Transmission electron microscopy of strain ZYK. Scale bar corresponds to 1.0 μm.
Classification and general features of strain ZYK according to the MIGS recommendations [1]
| | | | |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Strain ZYK | IDA | ||
| Gram stain | Positive | IDA | |
| MIGS-37.1 | Cell shape | Rod-shaped | NAS |
| MIGS-37.2 | Motility | Motile | NAS |
| MIGS-37.3 | Sporulation | Sporulating | NAS |
| MIGS-37.9 | Cell arrangement | Single | NAS |
| MIGS-37.12 | Optimum pH | 7.0 | NAS |
| MIGS-6 | Optimum temperature | 30°C | NAS |
| Salinity | 0-1.1% | IDA | |
| MIGS-22 | Oxygen requirement | Strict | NAS |
| Carbon source | D-Glucose, Maltose, lactose, sucrose | IDA | |
| MIGS-6 | Habitat | Paddy soil | NAS |
| MIGS-15 | Biotic relationship | Free-living | NAS |
| Pathogenicity | None-pathogen | NAS | |
| Biosafety level | 1 | NAS | |
| MIGS-4 | Geographic location | Dehong, Yunnan, China | NAS |
| MIGS-4.1 | Latitude | 24o64´70"N | NAS |
| MIGS-4.2 | Longitude | 98o53´45"E | NAS |
| MIGS-4.5 | Isolation | Paddy soil | NAS |
Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [23]. If the evidence code is IDA, the property was directly observed by one of the authors.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Complete |
| MIGS-28 | Libraries used | Two libraries 500 bp PCR-free library, 2000 bp index library |
| MIGS-29 | Sequencing platforms | Illumina |
| MIGS-31.2 | Fold coverage | 140× |
| MIGS-30 | Assemblers | SOAPdenovo 1.05 |
| MIGS-32 | Gene calling method | Glimmer 3.0 |
| Locus TAG | D612 | |
| Genbank ID | ANOK00000000 | |
| Genbank Date of Release | January 15, 2013 | |
| GOLD ID | Gi22906 | |
| NCBI taxon ID | 1191699 | |
| MIGS-13 | Source material identifier | DSMZ 26460, CGMCC 1.5179 |
| MIGS-38.2 | Project relevance | Agricultural, Bioremediation, Environmental |
Figure 3Graphical representation of circular map of the chromosome of strain ZYK. From outside to the center: Genes on forward strand (colored by COG categories), Genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Genome statistics
| | | |
|---|---|---|
| Genome size (bp) | 3,575,797 | 100.00 |
| DNA coding region (bp) | 3,002,982 | 83.98 |
| DNA G+C content (bp) | 1,290,862 | 36.10 |
| Total genes | 3454 | 100.00 |
| RNA genes | 70 | 2.03 |
| Protein-coding genes (bp) | 3,002,982 | 83.98 |
| Genes with function prediction | 3261 | 94.41 |
| Genes assigned to COGs | 2,030 | 58.77 |
| Genes assigned to Pfam domains (bp) | 617,696 | 17.27 |
| Genes with signal peptides | 169 | 4.89 |
| Genes with transmembrane helices | 132 | 3.82 |
| CRISPR repeats | 84 | 0.09 |
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 149.0 | 6.5 | Translation |
| A | 0.0 | 0.0 | RNA processing and modification |
| K | 164.0 | 7.1 | Transcription |
| L | 119.0 | 5.2 | Replication, recombination and repair |
| B | 1.0 | 0.04 | Chromatin structure and dynamics |
| D | 27.0 | 1.2 | Cell cycle control, mitosis and meiosis |
| Y | 0.0 | 0.0 | Nuclear structure |
| V | 24.0 | 1.0 | Defense mechanisms |
| T | 162.0 | 7.0 | Signal transduction mechanisms |
| M | 95.0 | 4.1 | Cell wall/membrane biogenesis |
| N | 75.0 | 3.3 | Cell motility |
| Z | 0.0 | 0.0 | Cytoskeleton |
| W | 0.0 | 0.0 | Extracellular structures |
| U | 44.0 | 1.9 | Intracellular trafficking and secretion |
| O | 97.0 | 4.2 | Posttranslational modification, protein turnover, chaperones |
| C | 155 | 6.7 | Energy production and conversion |
| G | 79.0 | 3.4 | Carbohydrate transport and metabolism |
| E | 239.0 | 10.4 | Amino acid transport and metabolism |
| F | 61.0 | 2.7 | Nucleotide transport and metabolism |
| H | 93.0 | 4.0 | Coenzyme transport and metabolism |
| I | 97.0 | 4.2 | Lipid transport and metabolism |
| P | 127.0 | 5.5 | Inorganic ion transport and metabolism |
| Q | 38.0 | 1.7 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 261.0 | 11.3 | General function prediction only |
| S | 193.0 | 8.4 | Function unknown |
| - | 1424 | 41.2 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.