Literature DB >> 26664656

Complete genome sequence of the molybdenum-resistant bacterium Bacillus subtilis strain LM 4-2.

Xiao-Yan You1, Hui Wang1, Guang-Yue Ren1, Jing-Jing Li1, Xu Duan1, Hua-Jun Zheng2, Zheng-Qiang Jiang3.   

Abstract

Bacillus subtilis LM 4-2, a Gram-positive bacterium was isolated from a molybdenum mine in Luoyang city. Due to its strong resistance to molybdate and potential utilization in bioremediation of molybdate-polluted area, we describe the features of this organism, as well as its complete genome sequence and annotation. The genome was composed of a circular 4,069,266 bp chromosome with average GC content of 43.83 %, which included 4149 predicted ORFs and 116 RNA genes. Additionally, 687 transporter-coding and 116 redox protein-coding genes were identified in the strain LM 4-2 genome.

Entities:  

Keywords:  Bacillus subtilis LM 4–2; Bioremediation; Gram-positive; Molybdate; Molybdenum-resistance

Year:  2015        PMID: 26664656      PMCID: PMC4674931          DOI: 10.1186/s40793-015-0118-6

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

LM 4–2 was a molybdenum-resistant strain isolated from a molybdenum mine. It has been reported that many microbes can resist the toxicity of molybdate ion though reduction of molybdate (Mo6+) to Mo-blue. Molybdenum-reducing microorganisms came from a variety of genera and included the following species, spp. [1, 2], [3], [4], [5, 6], [7], spp. [8], and K12 [9]. The capability of molybdate-reduction presents potential possibility of molybdenum bioremediationin many polluted areas [10]. Strain LM 4–2 showed stronger resistance to molybdate (up to 850 mM Na2MoO4) than many other reported molybdenum-resistant bacteria [11, 12]. However, no information related to the molecular mechanism of molybdenum-resistance has been identified, also in genus . Thus, strain LM 4–2 might be a perfect subject for us to unveil the mechanism and evaluate its possibility utilization in bioremediation. Here we present the complete genome sequence and detailed genomic features of LM 4–2.

Organism information

Classification and features

LM 4–2 (CGMCC 1.15213) is a Gram-positive, spore-forming, rod-shaped (0.3-0.5 μm wide and 3.0–4.0 μm long) with an optimum pH 6.0 and optimum temperature of 30 °C (Table 1, Fig. 1). Colonies are milky white and matte with a wrinkled surface when growth on R2A agar medium. Strictly aerobic and catalase formed. Carbon substrates utilized for growth by strain LM 4–2 included D-glucose, maltose, lactose and sucrose. Strain LM 4–2 is closely related to species based on the BLAST results of 16S rRNA gene [27]. The identity of 16S rRNA gene sequence between strain LM 4–2 and type strain DSM 10 is 100 %. A phylogenetic tree was constructed using the neighbor-Joining method under the default settings for complete sequence of 16S rRNA gene derived from genome of strain LM 4–2, along with the sequences of representative members of genus [28-34]. The phylogenetic tree was assessed by boot-strapped for 1000 times, which is shown in Fig. 2. Average nucleotide identity (ANI), average amino acid identity (AAI) and in silico Genome-to-Genome Hybridization value (GGDH) were calculated between the genomes of strain LM 4–2 and other 30 species that have been completed sequenced [35-40]. Results show that strain LM 4–2 shares high ANI (>95 %, 23 of total 30), AAI (>95 %, 23 of total 30) and GGDH value (>70 %, 24 of total 30) with most of the complete sequenced species, and highest ANI (99.00 %), AAI (99.13 %) and GGDH value (92.20 % ± 1.85) with strain TO-A JPC (Additional file 1: Table S1).
Table 1

Classification and general features of Bacillus subtilis LM 4–2 according to the MIGS recommendations [13]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [14]
Phylum Firmicutes TAS [1517]
Class Bacilli TAS [18, 19]
Order Bacillales TAS [20, 21]
Family Bacillaceae TAS [20, 22]
Genus Bacillus TAS [20, 23, 24]
Species Bacillus subtilis TAS [25]
Gram stainPositiveIDA
Cell shapeRod-shapedIDA
MotilityMotileIDA
SporulationSpore-formingNAS
Temperature range4–45 °CIDA
Optimum temperature30 °CIDA
pH range; Optimum4–9; 6.0IDA
Carbon sourceorganic carbon sourceIDA
MIGS-6HabitatsoilIDA
MIGS-6.3Salinitysalt tolerantNAS
MIGS-22Oxygen requirementaerobicIDA
MIGS-15Biotic relationshipfree-livingNAS
MIGS-14Pathogenicitynon-pathogenNAS
MIGS-4Geographic locationLuoyang/Henan/ChinaIDA
MIGS-5Sample collection2012IDA
MIGS-4.1Latitude33°55′3.21″N
MIGS-4.2Longitude111°31′0.42″E
MIGS-4.4Altitude1164.78

Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [26]

Fig. 1

Transmission electron microscopy of strain LM 4–2. Scale bar corresponds to 1.0 μm

Fig. 2

Neighbor-Joining Phylogenetic tree was built with MEGA 5 based on 16S rRNAsequences [41]. The strains and their corresponding GenBank accession numbers for 16S rDNA sequences are: a Bacillus thioparans BMP-1 (DQ371431); b Bacillus selenatarsenatis (AB262082); c Bacillus methanolicus NCIMB 13113 (AB112727); d Bacillus azotoformans NBRC 15712 (AB363732); e Bacillus indicus Sd/3 (AJ583158); f Bacillus amyloliquefaciens BCRC 11601 (NR_116022); g Bacillus subtilis 168 (NC_000964); h Bacillus subtilis PPL-SC9 (KM226924); i Bacillus cohnii DSM 6307 (X76437); j Bacillus cereus ATCC 14579 (NR_074540); k Bacillus arsenicus con a/3 (AJ606700); l Bacillus arseniciselenatis E1H (AF064705); m Bacillus macyae JMM-4 (AY032601); n Bacillus beveridgei MLTeJB (FJ825145); o Bacillus selenitireducens MLS10 (CP001791)

Classification and general features of Bacillus subtilis LM 4–2 according to the MIGS recommendations [13] Evidence codes - IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [26] Transmission electron microscopy of strain LM 4–2. Scale bar corresponds to 1.0 μm Neighbor-Joining Phylogenetic tree was built with MEGA 5 based on 16S rRNAsequences [41]. The strains and their corresponding GenBank accession numbers for 16S rDNA sequences are: a Bacillus thioparans BMP-1 (DQ371431); b Bacillus selenatarsenatis (AB262082); c Bacillus methanolicus NCIMB 13113 (AB112727); d Bacillus azotoformans NBRC 15712 (AB363732); e Bacillus indicus Sd/3 (AJ583158); f Bacillus amyloliquefaciens BCRC 11601 (NR_116022); g Bacillus subtilis 168 (NC_000964); h Bacillus subtilis PPL-SC9 (KM226924); i Bacillus cohnii DSM 6307 (X76437); j Bacillus cereus ATCC 14579 (NR_074540); k Bacillus arsenicus con a/3 (AJ606700); l Bacillus arseniciselenatis E1H (AF064705); m Bacillus macyae JMM-4 (AY032601); n Bacillus beveridgei MLTeJB (FJ825145); o Bacillus selenitireducens MLS10 (CP001791)

Genome sequencing information

Genome project history

LM 4–2 was selected for sequencing due to its strong resistance to molybdate and potential utilization in bioremediation of molybdate-polluted areas. The genome sequence was deposited in GenBank under accession number CP011101 and the genome project was deposited in the Genomes on Line Database [42] under Gp0112736. Genome sequencing and annotation were performed by Chinese National Human Genome Center at Shanghai. A summary of the project was given in Table 2.
Table 2

Genome sequencing project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityComplete
MIGS-28Libraries usedTwo libraries, 20 Kb PacBio library, 2 × 150 bpllumina library
MIGS 29Sequencing platformsPacBio RS II, Illumina Hi-Seq
MIGS 31.2Fold coverage213-and 409-fold
MIGS 30AssemblersHGAP, bowtie2
MIGS 32Gene calling methodGlimmer 3.02 and GeneMark
Locus TagBsLM
Genbank IDCP011101
GenBank Date of ReleaseApril 23, 2015
GOLD IDGp0112736
BIOPROJECTPRJNA277611
MIGS 13Source Material IdentifierCGMCC 1.15213
Project relevanceEnvironmental, Bioremediation
Genome sequencing project information

Growth conditions and genomic DNA preparation

LM 4–2 was inoculated in 200 mL R2A medium and cultivated for 8 h at 30 °C in a shaker with speed of 200 rpm. 1.2 g of harvested cells was suspended in 5 mL TE (pH8.0) with 10 mg/mL lysozymeat 30 °C for 4 h. After centrifugation (12,000 rpm) for 10 min, genomic DNA was extracted by phenol-chloroform methods as described previously [43]. DNA was dissolved in 2 mL sterilized deionized water with a final concentration of 12.67 μg/μL and 2.04 of OD260/OD280 ratio determined by NanoDrop 2000 spectrophotometer (Thermo Scientific, USA). The genomic DNA was stored in −20 °C freezer.

Genome sequencing and assembly

The genome of LM 4–2 was sequenced by a dual sequencing approach that using a combination of PacBio RS II and Genome Analyzer IIx sequence platforms. Approximately 121,583 PacBio and 1637 million Illumina reads were generated from PacBio platform and the Illumina platform (2 × 150 bp paired-end sequencing) with average sequence coverage of 213-and 409-fold.Sequence reads from the PacBio RS II were assembled by using hierarchical genome-assembly process assembler and finally only one self-cycled supper contig was generated. The Illumina reads were quality trimmed with the CLC Genomics Workbench and then utilized for error correction of the PacBio reads by using bowtie2 (version 2.1.0) software [44].

Genome annotation

The Glimmer 3.02 and GeneMark programs were used to predict the positions of open reading frames [45, 46]. Protein function was predicted by the following methods: 1) homology searches in the GenBank and UniProt protein database [47]; 2) function assignment searches in CDD database [48]; and 3) domain or motif searches in the Pfam databases [49]. The KEGG database was used to reconstruct metabolic pathways [50]. Ribosomal RNAs and Transfer RNAs were predicted by using RNAmmer and tRNAscan-SE programs [51, 52]. Transporters were predicted by searching the TCDB database using BLASTP program [27, 53] with expectation value lower than 1e-05.

Genome properties

The complete strain LM 4–2 genome was composed of a circular 4,069,266 bp chromosome with an overall 43.83 % G + C content. Four thousand one hundred forty-nine ORFs, 10 sets of rRNA operons, and 84 tRNAs were predicted in the LM 4–2 genome (Table 3 and Fig. 3). Two thousand seven hundred forty-two of total 4149 predicted ORFs could be functional assignment, 1415 were annotated as hypothetical proteins. When analyzed for biological roles according to COG categories, amino acid transport and metabolism proteins accounted for the largest percent (7.18 %) of all functionally assigned proteins, followed by carbohydrate transport and metabolism proteins (6.89 %), and Transcription proteins (6.43 %). There are 687 transporter-coding and 116 redox protein-coding genes were identified in the LM 4–2 genome. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)4,069,266100.00
DNA coding (bp)3,596,01088.37
DNA G + C (bp)1,811,63744.52
Total genes4265100.00
Protein coding genes414997.28
RNA genes1162.72
rRNA operons100.23
Genes with function prediction274264.29
Genes assigned to COGs311172.94
Genes with Pfam domains365685.72
Genes with signal peptides54112.68
Genes with transmembrane helices77818.24
CRISPR repeats00
Fig. 3

Graphic representation of circular map of the chromosome of strain LM 4–2.The map was generated with the DNAPlotter [54]. From outside to the center: the first two outer circles represent the positions of genes in the chromosome (Circle 1: plus strand, Circle 2: minus strand). Circle 3 represents tRNA genes (blue), Circle 4 represents G + C content, and Circle 5 represents GC skew

Table 4

Number of genes associated with general COG functional categoriesa

CodeValue% ageDescription
J1493.59Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K2676.44Transcription
L1142.75Replication, recombination and repair
B10.02Chromatin structure and dynamics
D360.87Cell cycle control, Cell division, chromosome partitioning
V541.30Defense mechanisms
T1273.06Signal transduction mechanisms
M1914.60Cell wall/membrane biogenesis
N601.45Cell motility
U250.60Intracellular trafficking and secretion
O1012.43Posttranslational modification, protein turnover, chaperones
C1664.00Energy production and conversion
G2866.89Carbohydrate transport and metabolism
E2987.18Amino acid transport and metabolism
F821.98Nucleotide transport and metabolism
H1142.75Coenzyme transport and metabolism
I892.14Lipid transport and metabolism
P1684.05Inorganic ion transport and metabolism
Q721.74Secondary metabolites biosynthesis, transport and catabolism
R3648.77General function prediction only
S3478.36Function unknown
-103925.04Not in COGs

aThe total is based on the total number of protein coding genes in the annotated genome

Genome statistics Graphic representation of circular map of the chromosome of strain LM 4–2.The map was generated with the DNAPlotter [54]. From outside to the center: the first two outer circles represent the positions of genes in the chromosome (Circle 1: plus strand, Circle 2: minus strand). Circle 3 represents tRNA genes (blue), Circle 4 represents G + C content, and Circle 5 represents GC skew Number of genes associated with general COG functional categoriesa aThe total is based on the total number of protein coding genes in the annotated genome

Conclusions

Molybdenum pollution has been reported in water and soils all around the world [55]. Some Mo-resistance bacteria can be used to immobilize soluble molybdenum toinsoluble formsalong with reducing the toxicity. In this study we presented the complete genome sequence of LM 4–2, which was isolated from a molybdenum mine in Luoyang city. Due to its strong resistance to molybdate and potential utilization in bioremediation of molybdate-polluted area, we sequence the genome and try to identify the possible molecular mechanism of molybdenum-resistance. Genomic analysis of strain LM 4–2 revealed 687 transporter-coding and 116 redox protein-coding genes were separated in the genome. Three genome islands were identified in the strain LM 4–2 genome, covering 2.71 % of the whole genome. Three gene clusters were involved in the non-ribosomal synthesis of lipopeptides, such as surfactin, fengycin, and dipeptide bacilysin. Additionally, one gene clusters for subtilosin A synthesis and one gene clusters for polyketide synthesis. No CRISPRs were identified in the strain LM 4–2 genome. The complete genome sequence of strain LM 4–2 will facilitate functional genomics to elucidate the molecular mechanisms that underlie molybdenum-resistance and it may facilitate the bioremediation of molybdenum-contaminated areas.
  34 in total

1.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

2.  The Genomes OnLine Database (GOLD) v.5: a metadata management system based on a four level (meta)genome project classification.

Authors:  T B K Reddy; Alex D Thomas; Dimitri Stamatis; Jon Bertsch; Michelle Isbandi; Jakob Jansson; Jyothi Mallajosyula; Ioanna Pagani; Elizabeth A Lobos; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2014-10-27       Impact factor: 16.971

3.  Relationship of Bacillus subtilis clades associated with strains 168 and W23: a proposal for Bacillus subtilis subsp. subtilis subsp. nov. and Bacillus subtilis subsp. spizizenii subsp. nov.

Authors:  L K Nakamura; M S Roberts; F M Cohan
Journal:  Int J Syst Bacteriol       Date:  1999-07

4.  Reduction of Mo6+ with elemental sulfur by Thiobacillus ferrooxidans.

Authors:  T Sugio; Y Tsujita; T Katagiri; K Inagaki; T Tano
Journal:  J Bacteriol       Date:  1988-12       Impact factor: 3.490

5.  Molybdate reduction by Pseudomonas sp. strain DRY2.

Authors:  M Y Shukor; S A Ahmad; M M M Nadzir; M P Abdullah; N A Shamaan; M A Syed
Journal:  J Appl Microbiol       Date:  2009-10-26       Impact factor: 3.772

6.  Reduction of molybdate to molybdenum blue by Enterobacter sp. strain Dr.Y13.

Authors:  M Y Shukor; M F Rahman; N A Shamaan; M A Syed
Journal:  J Basic Microbiol       Date:  2009-09       Impact factor: 2.281

7.  Reclassification of Brevibacillus brevis strains NCIMB 13288 and DSM 6472 (=NRRL NRS-887) as Aneurinibacillus danicus sp. nov. and Brevibacillus limnophilus sp. nov.

Authors:  Keiichi Goto; Rieko Fujita; Yuko Kato; Mika Asahara; Akira Yokota
Journal:  Int J Syst Evol Microbiol       Date:  2004-03       Impact factor: 2.747

8.  TCDB: the Transporter Classification Database for membrane transport protein analyses and information.

Authors:  Milton H Saier; Can V Tran; Ravi D Barabote
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Hexavalent molybdenum reduction to mo-blue by a sodium-dodecyl-sulfate-degrading Klebsiella oxytoca strain DRY14.

Authors:  M I E Halmi; S W Zuhainis; M T Yusof; N A Shaharuddin; W Helmi; Y Shukor; M A Syed; S A Ahmad
Journal:  Biomed Res Int       Date:  2013-12-09       Impact factor: 3.411

10.  Molybdenum reduction to molybdenum blue in Serratia sp. Strain DRY5 is catalyzed by a novel molybdenum-reducing enzyme.

Authors:  M Y Shukor; M I E Halmi; M F A Rahman; N A Shamaan; M A Syed
Journal:  Biomed Res Int       Date:  2014-03-03       Impact factor: 3.411

View more
  1 in total

Review 1.  Microbiological Reduction of Molybdenum to Molybdenum Blue as a Sustainable Remediation Tool for Molybdenum: A Comprehensive Review.

Authors:  Hafeez Muhammad Yakasai; Mohd Fadhil Rahman; Motharasan Manogaran; Nur Adeela Yasid; Mohd Arif Syed; Nor Aripin Shamaan; Mohd Yunus Shukor
Journal:  Int J Environ Res Public Health       Date:  2021-05-27       Impact factor: 3.390

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.