| Literature DB >> 34207879 |
Daria I Melnikova1, Reindert Nijland2, Timur Yu Magarlamov1.
Abstract
Tetrodotoxin (TTX)-producing bacteria have attracted great interest as a model system for study of the TTX biosynthetic route. Here, we report the complete genome of the TTX-producing bacterium Bacillus sp. 1839. The genome of the strain Bacillus sp. 1839, previously isolated from the TTX-bearing marine ribbon worm Cephalothrix cf. simula, was obtained using second generation Illumina and third generation nanopore sequencing technologies. Phylogenetic analysis has classified this strain as Cytobacillus gottheilii.Entities:
Keywords: TTX-producing bacteria; genome; tetrodotoxin (TTX)
Mesh:
Substances:
Year: 2021 PMID: 34207879 PMCID: PMC8228330 DOI: 10.3390/toxins13060410
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
The general genome features of Bacillus sp. 1839.
| Feature | Value | |
|---|---|---|
| Chromosome | Plasmid | |
| Genome size (bp) | 4,523,455 | 61,233 |
| G + C content (mol %) | 39 | 34 |
| Genes number (total) | 4458 | 69 |
| Coding sequences total number | 4339 | 69 |
| Coding sequences with protein number | 4300 | 69 |
| Protein-coding genes number | 4300 | 69 |
| rRNAs number | 11, 10, 10 | 0 |
| tRNAs number | 83 | 0 |
| ncRNAs number | 5 | 0 |
| Pseudogenes number | 39 | 0 |
| Insertion sequences number | 92 | 0 |
| Genomic islands number | 9 | 0 |
| Clustered regularly interspaced short palindromic repeats number | 2 | 0 |
| Prophage number | 4 | 0 |
Figure 1Graphic circular map of Bacillus sp. 1839 chromosome (A), and plasmid (B). From outer to inner rings, circle 1: the size of the complete genome; circle 2: the predicted protein-coding genes on the sense and antisense strands (colored according to Clusters of Orthologous Groups categories); circle 3: RNA genes; circle 4: G + C content, with >39% G + C in blue, with ≤39% G + C in orange; circle 5: G + C skew, with G% > C% in green, with G% < C% in violet.
Genes associated with general Clusters of Orthologous Groups (COG) functional categories of Bacillus sp. 1839.
| Code | Value | % 1 | Function Description |
|---|---|---|---|
| INFORMATION STORAGE AND PROCESSING | |||
| B | 1 | 0.02 | Chromatin structure and dynamics |
| J | 171 | 3.9 | Translation, ribosomal structure, and biogenesis |
| K | 269 | 6.2 | Transcription |
| L | 197 | 4.5 | Replication, recombination, and repair |
| METABOLISM | |||
| C | 189 | 4.3 | Energy production and conversion |
| E | 255 | 5.8 | Amino acid transport and metabolism |
| F | 88 | 2 | Nucleotide transport and metabolism |
| G | 213 | 4.9 | Carbohydrate transport and metabolism |
| H | 100 | 2.3 | Coenzyme transport and metabolism |
| I | 88 | 2 | Lipid transport and metabolism |
| P | 248 | 5.7 | Inorganic ion transport and metabolism |
| Q | 29 | 0.7 | Secondary metabolites biosynthesis, transport, and catabolism |
| CELLULAR PROCESSES AND SIGNALING | |||
| D | 36 | 0,8 | Cell cycle control, Cell division, and chromosome partitioning |
| M | 170 | 3.9 | Cell wall/membrane/envelope biogenesis |
| N | 43 | 1 | Cell motility |
| O | 113 | 2.6 | Posttranslational modification, protein turnover, chaperones |
| T | 191 | 4.4 | Signal transduction mechanisms |
| U | 38 | 0.9 | Intracellular trafficking, secretion, and vesicular transport |
| V | 75 | 1.7 | Defense mechanisms |
| POORLY CHARACTERIZED | |||
| S | 994 | 22.8 | Function unknown |
| - | 848 | 19.5 | Not in COGs |
1 The total is based on the total number of protein coding genes in the genome.
Putative gene clusters coding for secondary metabolites in Bacillus sp. 1839.
| Type | From (bp) | To (bp) | Most Similar Known Cluster | % Similarity | Accession Number |
|---|---|---|---|---|---|
| Terpene | 348,477 | 367,415 | 88 | NZ_KV440945 | |
| Thiopeptide-Linear azol(in)e-containing peptides | 447,791 | 476,894 | 82 | ||
| Terpene | 973,108 | 1,001,908 | 89 | ||
| Type III polyketide synthase cluster | 2,777,409 | 2,818,491 | 100 |
Figure 2Neighbor-joining phylogenetic tree of Bacillus sp. 1839 and other closely related strains based on 16S rRNA gene sequences. Evolutionary distances are computed by the maximum likelihood method with 1000 bootstrap replications. Scale bar indicates 0.005 nucleotide substitutions per site. Bacillus sp. 1839 is indicated by the red triangle.
General genome features of the Cytobacillus spp. closely related to Bacillus sp. 1839.
| Organism | NCBI | Assembly Level | Size (bp) | GC (%) | Predicted Coding | No. of Genes | No. of Proteins | No. of RNAs |
|---|---|---|---|---|---|---|---|---|
| NZ_FUVC00000000.1 | Scaffold | 4,719,939 | 39 | 4492 | 4621 | 4452 | 129 | |
| NZ_KV440945 | Genome | 4,584,535 | 39 | 4370 | 4475 | 4310 | 105 |
Average nucleotide identity (ANI) and tetranucleotide frequency correlation coefficient (Tetra) analysis of Bacillus sp. 1839 and closely related Cytobacillus strains.
| ANIb | Aligned | Tetra | ANIb | Aligned | Tetra | ANIb | Aligned | Tetra | |
|---|---|---|---|---|---|---|---|---|---|
| * | 97.55 | 85.95 | 0.99891 | 97.55 | 85.95 | 0.99892 | |||
| 97.32 | 86.16 | 0.99891 | * | 100.00 | 99.49 | 1.0 | |||
| 97.34 | 86.18 | 0.99892 | 100.00 | 99.47 | 1.0 | * | |||
* The asterisk indicates that the strain is compared to itself.
Digital DNA–DNA hybridization (dDDH) analysis of Bacillus sp. 1839 with closely related Cytobacillus strains.
| Query Genome | Reference Genome | Formula 1 | Formula 2 | Formula 3 | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| DDH (%) | Model C.I. (%) | Distance | DDH (%) | Model C.I. (%) | Distance | DDH (%) | Model C.I. (%) | Distance | ||
| 83.7 | 79.8–86.9 | 0.1140 | 79.6 | 76.6–82.2 | 0.0239 | 85.9 | 82.8–88.5 | 0.1351 | ||
| 83.7 | 79.9–86.9 | 0.1137 | 79.6 | 76.6–82.2 | 0.0239 | 85.9 | 82.8–88.6 | 0.1349 | ||