| Literature DB >> 26664656 |
Xiao-Yan You1, Hui Wang1, Guang-Yue Ren1, Jing-Jing Li1, Xu Duan1, Hua-Jun Zheng2, Zheng-Qiang Jiang3.
Abstract
Bacillus subtilis LM 4-2, a Gram-positive bacterium was isolated from a molybdenum mine in Luoyang city. Due to its strong resistance to molybdate and potential utilization in bioremediation of molybdate-polluted area, we describe the features of this organism, as well as its complete genome sequence and annotation. The genome was composed of a circular 4,069,266 bp chromosome with average GC content of 43.83 %, which included 4149 predicted ORFs and 116 RNA genes. Additionally, 687 transporter-coding and 116 redox protein-coding genes were identified in the strain LM 4-2 genome.Entities:
Keywords: Bacillus subtilis LM 4–2; Bioremediation; Gram-positive; Molybdate; Molybdenum-resistance
Year: 2015 PMID: 26664656 PMCID: PMC4674931 DOI: 10.1186/s40793-015-0118-6
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Bacillus subtilis LM 4–2 according to the MIGS recommendations [13]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Gram stain | Positive | IDA | |
| Cell shape | Rod-shaped | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Spore-forming | NAS | |
| Temperature range | 4–45 °C | IDA | |
| Optimum temperature | 30 °C | IDA | |
| pH range; Optimum | 4–9; 6.0 | IDA | |
| Carbon source | organic carbon source | IDA | |
| MIGS-6 | Habitat | soil | IDA |
| MIGS-6.3 | Salinity | salt tolerant | NAS |
| MIGS-22 | Oxygen requirement | aerobic | IDA |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | non-pathogen | NAS |
| MIGS-4 | Geographic location | Luoyang/Henan/China | IDA |
| MIGS-5 | Sample collection | 2012 | IDA |
| MIGS-4.1 | Latitude | 33°55′3.21″N | |
| MIGS-4.2 | Longitude | 111°31′0.42″E | |
| MIGS-4.4 | Altitude | 1164.78 |
Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [26]
Fig. 1Transmission electron microscopy of strain LM 4–2. Scale bar corresponds to 1.0 μm
Fig. 2Neighbor-Joining Phylogenetic tree was built with MEGA 5 based on 16S rRNAsequences [41]. The strains and their corresponding GenBank accession numbers for 16S rDNA sequences are: a Bacillus thioparans BMP-1 (DQ371431); b Bacillus selenatarsenatis (AB262082); c Bacillus methanolicus NCIMB 13113 (AB112727); d Bacillus azotoformans NBRC 15712 (AB363732); e Bacillus indicus Sd/3 (AJ583158); f Bacillus amyloliquefaciens BCRC 11601 (NR_116022); g Bacillus subtilis 168 (NC_000964); h Bacillus subtilis PPL-SC9 (KM226924); i Bacillus cohnii DSM 6307 (X76437); j Bacillus cereus ATCC 14579 (NR_074540); k Bacillus arsenicus con a/3 (AJ606700); l Bacillus arseniciselenatis E1H (AF064705); m Bacillus macyae JMM-4 (AY032601); n Bacillus beveridgei MLTeJB (FJ825145); o Bacillus selenitireducens MLS10 (CP001791)
Genome sequencing project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Complete |
| MIGS-28 | Libraries used | Two libraries, 20 Kb PacBio library, 2 × 150 bpllumina library |
| MIGS 29 | Sequencing platforms | PacBio RS II, Illumina Hi-Seq |
| MIGS 31.2 | Fold coverage | 213-and 409-fold |
| MIGS 30 | Assemblers | HGAP, bowtie2 |
| MIGS 32 | Gene calling method | Glimmer 3.02 and GeneMark |
| Locus Tag | BsLM | |
| Genbank ID | CP011101 | |
| GenBank Date of Release | April 23, 2015 | |
| GOLD ID | Gp0112736 | |
| BIOPROJECT | PRJNA277611 | |
| MIGS 13 | Source Material Identifier | CGMCC 1.15213 |
| Project relevance | Environmental, Bioremediation |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 4,069,266 | 100.00 |
| DNA coding (bp) | 3,596,010 | 88.37 |
| DNA G + C (bp) | 1,811,637 | 44.52 |
| Total genes | 4265 | 100.00 |
| Protein coding genes | 4149 | 97.28 |
| RNA genes | 116 | 2.72 |
| rRNA operons | 10 | 0.23 |
| Genes with function prediction | 2742 | 64.29 |
| Genes assigned to COGs | 3111 | 72.94 |
| Genes with Pfam domains | 3656 | 85.72 |
| Genes with signal peptides | 541 | 12.68 |
| Genes with transmembrane helices | 778 | 18.24 |
| CRISPR repeats | 0 | 0 |
Fig. 3Graphic representation of circular map of the chromosome of strain LM 4–2.The map was generated with the DNAPlotter [54]. From outside to the center: the first two outer circles represent the positions of genes in the chromosome (Circle 1: plus strand, Circle 2: minus strand). Circle 3 represents tRNA genes (blue), Circle 4 represents G + C content, and Circle 5 represents GC skew
Number of genes associated with general COG functional categoriesa
| Code | Value | % age | Description |
|---|---|---|---|
| J | 149 | 3.59 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | RNA processing and modification |
| K | 267 | 6.44 | Transcription |
| L | 114 | 2.75 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 36 | 0.87 | Cell cycle control, Cell division, chromosome partitioning |
| V | 54 | 1.30 | Defense mechanisms |
| T | 127 | 3.06 | Signal transduction mechanisms |
| M | 191 | 4.60 | Cell wall/membrane biogenesis |
| N | 60 | 1.45 | Cell motility |
| U | 25 | 0.60 | Intracellular trafficking and secretion |
| O | 101 | 2.43 | Posttranslational modification, protein turnover, chaperones |
| C | 166 | 4.00 | Energy production and conversion |
| G | 286 | 6.89 | Carbohydrate transport and metabolism |
| E | 298 | 7.18 | Amino acid transport and metabolism |
| F | 82 | 1.98 | Nucleotide transport and metabolism |
| H | 114 | 2.75 | Coenzyme transport and metabolism |
| I | 89 | 2.14 | Lipid transport and metabolism |
| P | 168 | 4.05 | Inorganic ion transport and metabolism |
| Q | 72 | 1.74 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 364 | 8.77 | General function prediction only |
| S | 347 | 8.36 | Function unknown |
| - | 1039 | 25.04 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome