| Literature DB >> 24501644 |
Tito David Peña-Montenegro1, Jenny Dussán1.
Abstract
Lysinibacillus sphaericus strain OT4b.31 is a native Colombian strain having no larvicidal activity against Culex quinquefasciatus and is widely applied in the bioremediation of heavy-metal polluted environments. Strain OT4b.31 was placed between DNA homology groups III and IV. By gap-filling and alignment steps, we propose a 4,096,672 bp chromosomal scaffold. The whole genome (consisting of 4,856,302 bp long, 94 contigs and 4,846 predicted protein-coding sequences) revealed differences in comparison to the L. sphaericus C3-41 genome, such as syntenial relationships, prophages and putative mosquitocidal toxins. Sphaericolysin B354, the coleopteran toxin Sip1A and heavy metal resistance clusters from nik, ars, czc, cop, chr, czr and cad operons were identified. Lysinibacillus sphaericus OT4b.31 has applications not only in bioremediation efforts, but also in the biological control of agricultural pests.Entities:
Keywords: DNA homology; Lysinibacillus sphaericus OT4b.31; Sip1A coleopteran toxin; de novo assembly; heavy metal tolerance
Year: 2013 PMID: 24501644 PMCID: PMC3910547 DOI: 10.4056/sigs.4227894
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of OT4b.31 relative to the available type strains and other non-assigned species within the families and . was designated as the outgroup species for the analyses. Right brackets encompass each homology group (I–VII) according to Nakamura’s benchmarks [21]. Nucleotide sequences obtained from GenBank and used in the phylogenetic analyses were as follows: 1457 (X51928), 10 (X57309), 168T (X60646), DSM 13T (X68416), IAM 13418T (D16273), BD-87 (AF169520), BD-99 (AF169525), NRS-1691 (AF169531), NRS-1693 (AF169533), StLB046 (NR_074954), 4400831 (NR_043092), NRS-250 (AF169536), B-1876 (AF169494), NRS-1198 (AF169528), B-4297 (AF169507), NRS-111 (AF169526), OT4b.31 (AQPX00000042.1:91-1546), B-183 (AF169493), B-23268T (AF169495), JG-A12 (AM292655), B-14905 (AF169491), ZC1 (NZ_ADJR01000054.1:1-1487), C3-41 (NC_010382.1:16887-18287), B-14865 (AF169490), 2362 (L14011), ATCC-7055 (AJ310083), B-14957 (AF169492) and B-23269 (AF169496). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches represent percentage bootstrap values based on 1,000 replicates. Lineages with type strain genome sequencing projects registered in GOLD [22] are labeled with one asterisk, those also listed as 'Complete and Published' with two asterisks.
Classification and general features of OT4b.31 according to the MIGS recommendations [26]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Gram stain | Positive in vegetative cells, variable in sporulating stages | IDA | |
| Cell shape | Straight rods | IDA | |
| Motility | Non-motile | IDA | |
| Sporulation | Sporulating | IDA | |
| Temperature range | Mesophile, grows > 14°, < 37°C | TAS [ | |
| Optimum temperature | 30°C | TAS [ | |
| Carbon source | Complex carbohydrates | TAS [ | |
| Energy metabolism | Heterotroph | TAS [ | |
| MIGS-6 | Habitat | Coleopteran (beetle) larvae | TAS [ |
| MIGS-6.3 | Salinity | Growth in Luria-Bertani broth (5% NaCl) | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free living | TAS [ |
| MIGS-14 | Pathogenicity | Unknown | TAS [ |
| MIGS-4 | Geographic location | Tenjo, Cundinamarca, Colombia | TAS [ |
| MIGS-5 | Sample collection time | 1995 | TAS [ |
| MIGS-4.1 | Latitude | 4.88727 | TAS [ |
| MIGS-4.2 | Longitude | -74.132831 | TAS [ |
| MIGS-4.3 | Depth | Surface | TAS [ |
| MIGS-4.4 | Altitude | 2,685 m above sea level | TAS [ |
a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [38].
Figure 2Gram staining of (A) vegetative cells and (B) spores of OT4b.31.
Figure 3Scanning electron micrograph of OT4b.31 at an operating voltage of 20 kV.
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Improved high-quality draft |
| MIGS-28 | Libraries used | One paired end tags 90:90 bp with 500 bp insert |
| MIGS-29 | Sequencing platforms | Illumina Hi-Seq 2000 |
| MIGS-31.2 | Fold coverage | 100× |
| MIGS-30 | Assemblers | CLC Assembly Cell version 4.0.10 |
| MIGS-32 | Gene calling method | Glimmer3, tRNAscan-SE |
| Genbank ID | AQPX00000000 | |
| Genbank Date of Release | May 10, 2013 | |
| GOLD ID | Gi39289 | |
| Project relevance | Biotechnology, metabolic pathway |
Summary of genome
| | | | |
|---|---|---|---|
| Chromosomal scaffold | 4,096,672 | Circular | KB933398.1 |
| Extrachromosomal elements | 759,630 | Linear | KB933399.1-KB933469.1 |
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 4,856,302 | 100.00 |
| DNA GC content (bp) | 1,821,262 | 37.50 |
| DNA coding region (bp) | 3,924,297 | 80.81 |
| Number of replicons | 1 | |
| Extrachromosomal | 0 | |
| Total genes | 4,938 | 100 |
| RNA genes | 46 | 0.93 |
| rRNA operons | 7 | |
| tRNA genes | 38 | 0.77 |
| Pseudogenes | 1,623 | 32.87 |
| Protein-coding genes | 4,846 | 98.14 |
| Genes in paralog clusters | 658 | 13.33 |
| Genes assigned to COGs | 2,946 | 59.66 |
| 1 or more conserved domains | 2,946 | 59.66 |
| 2 or more conserved domains | 529 | 10.71 |
| 3 or more conserved domains | 98 | 1.98 |
| Genes with function prediction | 3,315 | 67.13 |
| Genes assigned Pfam domains | 2,799 | 56.68 |
| Genes with signal peptides | 1,206 | 24.42 |
| Genes with transmembrane helices | 1,206 | 24.42 |
| CRISPR repeats | 0 | 0.00 |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Figure 4Graphical map of the genome. From outside to the center: Ordered and oriented scaffolds assigned to chromosome in blue and red, extrachromosomal scaffolds in orange and black, Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs gray), GC content and GC skew.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 180 | 3.80 | Translation |
| A | 118 | 2.49 | RNA processing and modification |
| K | 354 | 7.48 | Transcription |
| L | 167 | 3.53 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 37 | 0.78 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 75 | 1.58 | Defense mechanisms |
| T | 293 | 6.19 | Signal transduction mechanisms |
| M | 159 | 3.36 | Cell wall/membrane biogenesis |
| N | 95 | 2.01 | Cell motility |
| Z | 31 | 0.66 | Cytoskeleton |
| W | 28 | 0.59 | Extracellular structures |
| U | 48 | 1.01 | Intracellular trafficking and secretion |
| O | 96 | 2.03 | Posttranslational modification, protein turnover, chaperones |
| C | 169 | 3.57 | Energy production and conversion |
| G | 146 | 3.09 | Carbohydrate transport and metabolism |
| E | 351 | 7.42 | Amino acid transport and metabolism |
| F | 85 | 1.80 | Nucleotide transport and metabolism |
| H | 142 | 3.00 | Coenzyme transport and metabolism |
| I | 133 | 2.81 | Lipid transport and metabolism |
| P | 273 | 5.77 | Inorganic ion transport and metabolism |
| Q | 98 | 2.07 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 450 | 9.51 | General function prediction only |
| S | 234 | 4.95 | Function unknown |
| - | 1,694 | 37.74 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.
Figure 5(A) Dot-plot of amino-acid-based alignment of a 4.09 Mbp chromosomal scaffold of OT4b.31 (y-axis) to a 4.6 Mbp chromosome of C3-41 (x-axis). Aligned segments are represented as dots or lines. Forward matches are plotted in red, reverse matches in blue. Figure generated by PROmer [45]. (B) Nucleotide-based alignment of a 4.09 Mbp chromosomal scaffold of OT4b.31 (right) to a 4.6 Mbp chromosome of C3-41 (left). A total of 27 homologous blocks are shown as identically colored regions and linked across the sequences. Regions that are inverted relative to OT4b.31 are shifted to the right of center axis of the sequence. The origin of replication in each sequence is approximately at coordinate 1. Red bars show the limits of each contig in the chromosomal scaffold. Contigs 1 to 26 are numbered in ascending order start in coordinate 1. The figure was generated by Mauve [46].