| Literature DB >> 29026451 |
David Sannino1, Esther R Angert1.
Abstract
Paenibacillus thiaminolyticus is the model organism for studying thiaminase I, an enigmatic extracellular enzyme. Originally isolated from the feces of clinical patients suffering from thiamin deficiency, P. thiaminolyticus has been implicated in thiamin deficiencies in humans and other animals due to its ability to produce this thiamin-degrading enzyme. Its close relative, P. apiarius, also produces thiaminase I and was originally isolated from dead honeybee larvae, though it has not been reported to be a honeybee pathogen. We generated draft genomes of the type strains of both species, P. thiaminolyticus NRRL B-4156 and P. apiarius NRRL B-23460, to deeply explore potential routes of thiamin metabolism. We discovered that the thiaminase I gene is located in a highly conserved operon with thiamin biosynthesis and salvage genes, as well as genes involved in the biosynthesis of the antibiotic bacimethrin. Based on metabolic pathway predictions, P. apiarius NRRL B-23460 has the genomic capacity to synthesize thiamin de novo using a pathway that is rarely seen in bacteria, but P. thiaminolyticus NRRL B-4156 is a thiamin auxotroph. Both genomes encode importers for thiamin and the pyrimidine moiety of thiamin, as well as enzymes to synthesize thiamin from pyrimidine and thiazole.Entities:
Keywords: Hydroxymethyl pyrimidine; Paenibacillus apiarius; Paenibacillus dendritiformis; Paenibacillus thiaminolyticus; Thiamin; Thiaminase I
Year: 2017 PMID: 29026451 PMCID: PMC5627462 DOI: 10.1186/s40793-017-0276-9
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phase-contrast micrographs of 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460 and 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156. a Depicts 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460 cells grown in 10.1601/strainfinder?urlappend=%3Fid%3DTSB+for+24 hr. at 30 °C. b Depicts 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156 cells grown in 10.1601/strainfinder?urlappend=%3Fid%3DTSB+for+24 hr. at 37 °C. Scale bars represent 5 μm
Classification and general features of 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460 [53]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum 10.1601/nm.3874 | TAS [ | ||
| Class 10.1601/nm.4854 | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus 10.1601/nm.5109 | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain: 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460
| |||
| Gram stain | Positive | TAS [ | |
| Cell shape | Rod | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | Endospores with thick coats | TAS [ | |
| Temperature range | 15–40 °C | TAS [ | |
| Optimum temperature | 28 °C | TAS [ | |
| pH range; Optimum | Not reported | ||
| Carbon source | D-glucose, D-galactose, cellobiose, maltose, melibiose, sucrose, trehalose, salicin; can hydrolyize starch, casein | TAS [ | |
| MIGS-6 | Habitat | Soil and honeybee associated | TAS [ |
| MIGS-6.3 | Salinity | 5% NaCl ( | TAS [ |
| MIGS-22 | Oxygen requirement | facultative | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | non-pathogen | TAS [ |
| MIGS-4 | Geographic location | Manitoba, Canada | TAS [ |
| MIGS-5 | Sample collection | 1950s | TAS [ |
| MIGS-4.1 | Latitude | Not reported | |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [60]
Classification and general features of 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156 [53]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum 10.1601/nm.3874 | TAS [ | ||
| Class 10.1601/nm.4854 | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus 10.1601/nm.5109 | TAS [ | ||
| Species | TAS [ | ||
| (Type) strain: 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156
| |||
| Gram stain | Positive | TAS [ | |
| Cell shape | Rod | TAS [ | |
| Motility | Motile | TAS [ | |
| Sporulation | endospores | TAS [ | |
| Temperature range | 20–45 °C | TAS [ | |
| Optimum temperature | 28 °C | TAS [ | |
| pH range; Optimum | Not reported | ||
| Carbon source | D-glucose, D-fructose, D-galactose, D-ribose, lactose, cellobiose, maltose, mannose, melibiose, sucrose, trehalose, salicin; can hydrolyze starch, casein | TAS [ | |
| MIGS-6 | Habitat | Soil, animal associated | TAS [ |
| MIGS-6.3 | Salinity | 5% NaCl (w/v) | TAS [ |
| MIGS-22 | Oxygen requirement | facultative | TAS [ |
| MIGS-15 | Biotic relationship | free-living | TAS [ |
| MIGS-14 | Pathogenicity | non-pathogen (1 case in humans) | NAS [ |
| MIGS-4 | Geographic location | Japan | TAS [ |
| MIGS-5 | Sample collection | 1940s | TAS [ |
| MIGS-4.1 | Latitude | Not reported | |
| MIGS-4.2 | Longitude | Not reported | |
| MIGS-4.4 | Altitude | Not reported |
a Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [60]
Fig. 2Phylogenetic tree of 10.1601/nm.5109 spp. based on 16S rRNA gene sequences. The maximum likelihood tree was inferred from a comparison of sequences from 10.1601/nm.5109 spp., 10.1601/nm.5168 10.1601/strainfinder?urlappend=%3Fid%3DDSM+5760, and 10.1601/nm.10618 168 using FastTree 2.1 [31]. The sequences generated from the draft genomes of this study are highlighted, with 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156 in blue font, and 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460 in green font
Project information
| MIGS ID | Property | Term (10.1601/nm.5116) | Term (10.1601/nm.5156) |
|---|---|---|---|
| MIGS 31 | Finishing quality | Draft | Draft |
| MIGS-28 | Libraries used | TruSeq | TruSeq |
| MIGS 29 | Sequencing platforms | Illumina MiSeq | Illumina MiSeq |
| MIGS 31.2 | Fold coverage | 150× | 150× |
| MIGS 30 | Assemblers | 10.1601/strainfinder?urlappend=%3Fid%3DSPA+des+5.3 | SPAdes 3.5 |
| MIGS 32 | Gene calling method | IMG and MicroScope | IMG and MicroScope |
| Locus Tag | Ga0138518 | Ga0138519 | |
| Genbank ID |
|
| |
| GenBank Date of Release | 05/31/2017 | 05/31/17 | |
| GOLD ID | Ga0138518 | Ga0138519 | |
| BIOPROJECT |
|
| |
| MIGS 13 | Source Material Identifier | Insect associated | Human associated |
| Project relevance | Metabolic pathways | Metabolic pathways |
Genome and annotation statistics for 10.1601/nm.5116 and 10.1601/nm.5156
| Attribute | 10.1601/nm.5116 NRRL B-23460 | 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156 | ||
|---|---|---|---|---|
| Value | % of Total | Value | % of Total | |
| Genome size (bp) | 5,404,821 | 100.00 | 6,537,496 | 100.00 |
| DNA coding (bp) | 4,642,405 | 85.89 | 5,508,364 | 84.26 |
| DNA G + C (bp) | 2,729,114 | 50.49 | 3,507,168 | 53.65 |
| DNA scaffolds | 51 | 100.00 | 47 | 100.00 |
| Total genes | 4957 | 100.00 | 5880 | 100.00 |
| Protein coding genes | 4822 | 97.28 | 5756 | 97.89 |
| RNA genes | 135 | 2.72 | 124 | 2.11 |
| Pseudo genes | 0 | 0 | 0 | 0 |
| Genes in internal clusters | 1259 | 25.40 | 1709 | 29.06 |
| Genes with function prediction | 3756 | 75.77 | 4458 | 75.82 |
| Genes assigned to COGs | 3092 | 62.38 | 3654 | 62.14 |
| Genes with Pfam domains | 3910 | 78.88 | 4674 | 79.49 |
| Genes with signal peptides | 304 | 6.13 | 350 | 5.95 |
| Genes with transmembrane helices | 1385 | 27.94 | 1658 | 28.20 |
| CRISPR repeats | 0 | 0 | 0 | 0 |
Number of genes associated with general COG functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 219 | 6.30% | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00% | RNA processing and modification |
| K | 343 | 9.87% | Transcription |
| L | 98 | 2.82% | Replication, recombination and repair |
| B | 1 | 0.03% | Chromatin structure and dynamics |
| D | 50 | 1.44% | Cell cycle control, Cell division, chromosome partitioning |
| V | 110 | 3.16% | Defense mechanisms |
| T | 194 | 5.58% | Signal transduction mechanisms |
| M | 179 | 5.15% | Cell wall/membrane biogenesis |
| N | 66 | 1.90% | Cell motility |
| U | 29 | 0.83% | Intracellular trafficking and secretion |
| O | 112 | 3.22% | Posttranslational modification, protein turnover, chaperones |
| C | 165 | 4.75% | Energy production and conversion |
| G | 368 | 10.59% | Carbohydrate transport and metabolism |
| E | 317 | 9.12% | Amino acid transport and metabolism |
| F | 103 | 2.96% | Nucleotide transport and metabolism |
| H | 186 | 5.35% | Coenzyme transport and metabolism |
| I | 127 | 3.65% | Lipid transport and metabolism |
| P | 213 | 6.13% | Inorganic ion transport and metabolism |
| Q | 102 | 2.93% | Secondary metabolites biosynthesis, transport and catabolism |
| R | 281 | 8.08% | General function prediction only |
| S | 186 | 5.35% | Function unknown |
| – | 1865 | 37.62% | Not in COGs |
The total is based on the total number of protein coding genes in the genome of 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460
Number of genes associated with general COG functional categories
| Value | % of total | Description | |
|---|---|---|---|
| J | 248 | 6.03% | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00% | RNA processing and modification |
| K | 431 | 10.47% | Transcription |
| L | 112 | 2.72% | Replication, recombination and repair |
| B | 0 | 0.00% | Chromatin structure and dynamics |
| D | 59 | 1.43% | Cell cycle control, Cell division, chromosome partitioning |
| V | 157 | 3.81% | Defense mechanisms |
| T | 263 | 6.39% | Signal transduction mechanisms |
| M | 224 | 5.44% | Cell wall/membrane biogenesis |
| N | 63 | 1.53% | Cell motility |
| U | 28 | 0.68% | Intracellular trafficking and secretion |
| O | 142 | 3.45% | Posttranslational modification, protein turnover, chaperones |
| C | 199 | 4.83% | Energy production and conversion |
| G | 450 | 10.93% | Carbohydrate transport and metabolism |
| E | 370 | 8.99% | Amino acid transport and metabolism |
| F | 109 | 2.65% | Nucleotide transport and metabolism |
| H | 195 | 4.74% | Coenzyme transport and metabolism |
| I | 139 | 3.38% | Lipid transport and metabolism |
| P | 246 | 5.98% | Inorganic ion transport and metabolism |
| Q | 118 | 2.87% | Secondary metabolites biosynthesis, transport and catabolism |
| R | 334 | 8.11% | General function prediction only |
| S | 202 | 4.91% | Function unknown |
| – | 2226 | 37.86% | Not in COGs |
The total is based on the total number of protein coding genes in the genome of 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156
Fig. 3Predicted thiamin biosynthesis pathway in 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460. Pathways involved in thiazole biosynthesis are highlighted in blue and pathways involved in pyrimidine biosynthesis are shown in red. The steps in black correspond to the coupling of the thiazole and pyrimidine moieties, and the formation of the active cofactor TPP
Fig. 4Putative thiaminase I operons in the three 10.1601/nm.5109 species. Annotations of conserved genes shared amongst the three species are as follows (1) thiM; (2) thiD; (3) thiE; (4) glycosyltransferase; (5) thymidylate synthase; (6) methyltransferase; (7) yzgD NUDIX hydrolase; (8) thiaminase I and (9) HAD hydrolase. Three genes in 10.1601/nm.5116 B23460 that are not conserved include a putative transcriptional regulator and two genes of unknown function. The operons may be under the control of a putative thiamin pyrophosphate binding riboswitch, upstream of thiM. Genes are color-coded based on proposed functions
Fig. 5Predicted thiamin salvage pathways in all three paenibacilli. Pathways involved in thiazole salvage are highlighted in blue and pathways involved in pyrimidine salvage are shown in red. The dotted red boxes are steps unique to 10.1601/nm.5156 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-4156 and 10.1601/nm.5127 C454. Biosynthetic pathways are shown with solid arrows and import pathways are indicated with a striped arrow. The putative importer boxed in green is unique to 10.1601/nm.5116 10.1601/strainfinder?urlappend=%3Fid%3DNRRL+B-23460. In all cases, it is not yet understood how THZ enters the cell