| Literature DB >> 25685257 |
Tito David Peña-Montenegro1, Lucía Lozano1, Jenny Dussán1.
Abstract
Lysinibacillus sphaericus CBAM5, was isolated from subsurface soil of oil well explorations in the Easter Planes of Colombia. This strain has potential in bioremediation of heavy-metal polluted environments and biological control of Culex quinquefasciatus. According to the phylogenetic analysis of 16S rRNA gene sequences, the strain CBAM5 was assigned to the Lysinibacillus sphaericus taxonomic group 1 that comprises mosquito pathogenic strains. After a combination assembly-integration, alignment and gap-filling steps, we propose a 4,610,292 bp chromosomal scaffold. The whole genome (consisting of 5,146,656 bp long, 60 contigs and 5,209 predicted-coding sequences) revealed strong functional and syntenial similarities to the L. sphaericus C3-41 genome. Mosquitocidal (Mtx), binary (Bin) toxins, cereolysin O, and heavy metal resistance clusters from nik, ars, czc, mnt, ter, cop, cad, and znu operons were identified.Entities:
Keywords: Binary toxins; DNA homology; Heavy metal tolerance; Lysinibacillus sphaericus CBAM5; Mosquitocidal toxins; S-layer proteins
Year: 2015 PMID: 25685257 PMCID: PMC4317669 DOI: 10.1186/1944-3277-10-2
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of CBAM5. Phylogenetic analyses included available type strains and other non-assigned species within the families Alicyclobacillaceae and Bacillaceae. Right brackets encompass each homology group (1–7) according to Nakamura’s benchmarks [15]. Nucleotide sequences obtained from GenBank and used in the phylogenetic analyses were as follows: Alicyclobacillus cycloheptanicus 1457 (X51928), Geobacillus stearothermophilus 10 (X57309), Bacillus subtilis 168T (X60646), Bacillus licheniformis DSM 13T (X68416), Bacillus megaterium IAM 13418T (D16273), Bacillus sp. BD-87 (AF169520), Bacillus sp. BD-99 (AF169525), Bacillus sp. NRS-1691 (AF169531), Bacillus sp. NRS-1693 (AF169533), Solibacillus silvestris StLB046 (NR_074954), Lysinibacillus massiliensis 4400831 (NR_043092), Bacillus sp. B-1876 (AF169494), Bacillus sp. NRS-1198 (AF169528), Bacillus sp. B-4297 (AF169507), Bacillus sp. NRS-111 (AF169526), Lysinibacillus sphaericus OT4b.31 (JQ744623), Lysinibacillus sphaericus B-23268T (AF169495), Bacillus sp. B-183 (AF169493), Lysinibacillus sphaericus JG-A12 (AM292655), Bacillus sp. B-14905 (AF169491), Bacillus sp. B-14865 (AF169490), Lysinibacillus fusiformis NRS-350 (AJ310083), Lysinibacillus sphaericus C3-41 (CP000817:16818–18361), Bacillus sp. B-14957 (AF169492), Lysinibacillus sphaericus 2362 (L14011), Bacillus sp. B-23269 (AF169496), Lysinibacillus sphaericus CBAM5 (KK037167:893906–895445). The tree with the highest log likelihood (-6732.2703) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. Lineages with type strain genome sequencing projects registered in GOLD [57] are labeled with one asterisk, those also listed as ‘Complete and Published’ with two asterisks.
Classification and general features of CBAM5 according to the MIGS recommendations[19]
| | Current classification | Domain | TAS [ |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain CBAM5 | TAS [ | ||
| | Gram stain | Positive in vegetative cells, variable in sporulating stages | IDA |
| | Cell shape | Straight rods | IDA |
| | Motility | Motile | IDA |
| | Sporulation | Sporulating | IDA |
| | Temperature range | 15 – 40°C | IDA |
| | Optimum temperature | 30°C | IDA |
| | Carbon source | Complex carbohydrates | TAS [ |
| | Energy metabolism | Heterotroph | TAS [ |
| MIGS-6 | Habitat | Subsurface soil | TAS [ |
| MIGS-6.3 | Salinity | Growth in Luria-Bertani broth (5% NaCl) | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free living | TAS [ |
| MIGS-14 | Pathogenicity | Pathogenic toward | TAS [ |
| MIGS-4 | Geographic location | Eastern Planes oil basins, Colombia | TAS [ |
| MIGS-5 | Sample collection time | January 2005 | TAS [ |
| MIGS-4.1 | Latitude | 5.0121944 | TAS [ |
| MIGS-4.2 | Longitude | -72.7109167 | TAS [ |
| MIGS-4.3 | Depth | 20 m | TAS [ |
| MIGS-4.4 | Altitude | 350 m above sea level | TAS [ |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [30].
Genome sequencing project information
| MIGS-31 | Finishing quality | Improved high-quality draft |
| MIGS-28 | Libraries used | One paired end tags 90:90 bp with 500 bp insert |
| MIGS-29 | Sequencing platforms | Illumina Hi-Seq 2000 |
| MIGS-31.2 | Fold coverage | 100× |
| MIGS-30 | Assemblers | CISA version 1.3, SOAPdenovo version 2.04, Velvet version 1.2.10, ABySS version 1.3.7, CLC Assembly Cell version 4.0.10 |
| MIGS-32 | Gene calling method | Glimmer3, tRNAscan-SE, RNAmmer |
| | Genbank ID | AYKQ00000000 |
| | Genbank Date of Release | February 1, 2014 |
| | GOLD ID | Gi0057485 |
| Project relevance | Biotechnology, metabolic pathway |
Summary of genome
| Chromosomal scaffold | 4,610,292 | Circular | KK037167.1 |
| Extrachromosomal elements | 536,364 | Linear | KK037168.1-KK037224.1 |
Nucleotide content and gene count levels of the genome
| Genome size (bp) | 5,146,656 | 100.00 |
| DNA GC content (bp) | 1,913,947 | 37.19 |
| DNA coding region (bp) | 4,311,603 | 83.77 |
| Number of replicons | 1 | |
| Total genes | 5,620 | 100 |
| RNA genes | 207 | 3.68 |
| tRNA genes | 180 | 3.20 |
| Protein-coding genes | 5,209 | 92.69 |
| Genes in paralog clusters | 151 | 2.69 |
| Genes assigned to COGs | 3701 | 65.85 |
| 1 or more conserved domains | 2,520 | 44.84 |
| 2 or more conserved domains | 834 | 14.84 |
| 3 or more conserved domains | 361 | 6.42 |
| Genes with function prediction | 3,224 | 57.37 |
| Genes assigned Pfam domains | 3,995 | 71.09 |
| Genes with signal peptides | 459 | 8.17 |
| Genes with transmembrane helices | 1,140 | 20.28 |
| CRISPR repeats | 1 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Figure 2Graphical map of the genome. From outside to the center: Ordered and oriented scaffolds assigned to chromosome in blue and red, extrachromosomal scaffolds in orange and black, Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs gray), GC content and GC skew.
Number of genes associated with the 25 general COG functional categories
| J | 190 | 3.65 | Translation |
| A | 1 | 0.02 | RNA processing and modification |
| K | 337 | 6.47 | Transcription |
| L | 193 | 3.71 | Replication, recombination and repair |
| B | 2 | 0.04 | Chromatin structure and dynamics |
| D | 39 | 0.75 | Cell cycle control, mitosis and meiosis |
| V | 66 | 1.27 | Defense mechanisms |
| T | 191 | 3.67 | Signal transduction mechanisms |
| M | 153 | 2.94 | Cell wall/membrane biogénesis |
| N | 79 | 1.52 | Cell motility |
| U | 23 | 0.44 | Intracellular trafficking and secretion |
| O | 116 | 2.23 | Posttranslational modification, protein turnover, chaperones |
| C | 162 | 3.11 | Energy production and conversión |
| G | 154 | 2.96 | Carbohydrate transport and metabolism |
| E | 396 | 7.60 | Amino acid transport and metabolism |
| F | 111 | 2.13 | Nucleotide transport and metabolism |
| H | 167 | 3.21 | Coenzyme transport and metabolism |
| I | 141 | 2.71 | Lipid transport and metabolism |
| P | 222 | 4.26 | Inorganic ion transport and metabolism |
| Q | 37 | 0.71 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 480 | 9.21 | General function prediction only |
| S | 441 | 8.47 | Function unknown |
| - | 1508 | 28.95 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome.
Figure 3Dot-plots of amino-acid-based alignments of CBAM5, C3-41 and OT4b.31. Dot-plot of amino-acid-based alignment of a 4.61 Mbp chromosomal scaffold of L. sphaericus CBAM5 (y-axis) to the chromosome of L. sphaericus C3-41 (left) and L. sphaericus OT4b.31 (right). Aligned segments are represented as dots or lines. Forward matches are plotted in green, reverse matches in red. Figure generated by PROmer [41].
Figure 4Mosquitocidal binary toxin gene clusters of strains CBAM5, C3-41 and 2297. Binary toxin BinA and BinB, Mtx2/3 homolog, CDSs for a phage integrase family protein, the GerXB-KA-XC operon, a ribonuclease, a putative peptide synthase, and a chitin-binding protein, hypothetical proteins (blue arrows) and transposases (gray arrows) are indicated. A 1554 bp insertion is located between the GerXB-KA-XC operon and BinA-B coding sequences. A disrupted CDS (red box) includes a mobile element and a hypothetical protein.