| Literature DB >> 31906154 |
Maria B Nowakowska1, François P Douillard1, Miia Lindström1.
Abstract
The botulinum neurotoxin (BoNT) has been extensively researched over the years in regard to its structure, mode of action, and applications. Nevertheless, the biological roles of four proteins encoded from a number of BoNT gene clusters, i.e., OrfX1-3 and P47, are unknown. Here, we investigated the diversity of orfX-p47 gene clusters using in silico analytical tools. We show that the orfX-p47 cluster was not only present in the genomes of BoNT-producing bacteria but also in a substantially wider range of bacterial species across the bacterial phylogenetic tree. Remarkably, the orfX-p47 cluster was consistently located in proximity to genes coding for various toxins, suggesting that OrfX1-3 and P47 may have a conserved function related to toxinogenesis and/or pathogenesis, regardless of the toxin produced by the bacterium. Our work also led to the identification of a putative novel BoNT-like toxin gene cluster in a Bacillus isolate. This gene cluster shares striking similarities to the BoNT cluster, encoding a bont/ntnh-like gene and orfX-p47, but also differs from it markedly, displaying additional genes putatively encoding the components of a polymorphic ABC toxin complex. These findings provide novel insights into the biological roles of OrfX1, OrfX2, OrfX3, and P47 in toxinogenesis and pathogenesis of BoNT-producing and non-producing bacteria.Entities:
Keywords: botulinum neurotoxin; neurotoxin associated proteins; neurotoxin gene cluster; orfX; p47
Mesh:
Substances:
Year: 2019 PMID: 31906154 PMCID: PMC7020563 DOI: 10.3390/toxins12010019
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Figure 1Phylogenetic distribution of the PF06597 domain (Clostridium P47) among bacterial species based on Annotree. Blue branches correspond to species carrying the PF06597 domain (Clostridium P47 superfamily). Strains/species belonging to the orders shown in red were further analyzed in the present study. The order of Lactobacillales is highlighted in green as it was not detected in Annotree, although this order has one known E. faecium isolate harboring a neurotoxin gene cluster with the orfX-p47 genes.
Bacterial genomes analyzed in this study.
| Bacterial Species | Strain Name | Taxonomy (Class) | Isolation Source | Accession Number | Reference |
|---|---|---|---|---|---|
|
| Beluga |
| Fermented whale flippers, Canada | NZ_ACSC01000002 | Direct submission |
|
| Kyoto-F |
| Infant feces, Japan | CP001581 | [ |
|
| CDC_297 |
| Liver paste, USA | CP006907 | [ |
|
| 111 |
| Infant feces, Japan | AP014696 | [ |
|
| Loch Maree |
| Duck liver paste, Scotland | CP000962 | [ |
|
| Mauritius |
| Fish, Mauritius | NZ_LFPL01000000 | [ |
|
| Langeland |
| Liver paste, Denmark | CP000728 | Direct submission |
|
| Mfbjulcb3 |
| Retail fish market, India | CP027780 | Direct submission |
|
| Sullivan |
| Adult human feces, USA | CP006905 | [ |
|
| 3G1_DIV0629 |
| Cow feces, USA | NGLI00000000 | Direct submission |
| Pbm |
| Swamp soil, Malaysia | CM017269 | [ | |
|
| DSM 15247 |
| Son-killer of | AUCC00000000 | Direct submission |
| 2SH |
| Alpine fresh spring, Italy | SCNA01000023 | [ | |
|
| 1951 |
| Forage rape seed, New Zealand | RHPK00000000 | [ |
| TSV86 |
| Water, Australia | GCA_001522865 | Direct submission | |
|
| ATCC 49946 |
| Infected apple tree, USA | FN666575 | [ |
| SAG 10367 |
| NZ_CP020557 | [ | ||
|
| BO5 |
| Soil, Russia | GCA_003591545.1 | Direct submission |
|
| - |
| Pill bugs, USA | NZ_AAQJ00000000 | Direct Submission |
|
| DSM 19387 |
| Air sample, South Korea | NZ_ARPG00000000 | Direct submission |
|
| AR67 |
| Sheep rumen, New Zealand | GCA_900112155 | Direct submission |
|
| DSM 21201 |
| Salt mine sediment, China | NZ_AUBC00000000 | Direct submission |
|
| AFS089089 |
| Grainbin dust, USA | NVNL01000046 | [ |
Figure 2Gene arrangement of orfX-p47 clusters. Each chromosomal region was ordered based on the maximum likelihood phylogenetic tree of P47. Values indicated on the tree branches are bootstrapping values (250 iterations).
Figure 3Maximum likelihood phylogenetic tree of P47, OrfX1, OrfX2, OrfX3, and OrfX-T. Values indicated on the tree branches are bootstrapping values (250 iterations). The results retrieved from MEME analysis of all protein sequences were juxtaposed to the corresponding strains (p-values and motif locations). The color-coded blocks correspond to different amino acid motifs detected within the analyzed protein sequences and serve to visualize the similarities between the proteins.
Features of genes contiguous to the orfX-p47 gene cluster. Predicted annotations are based on sequence homology (protein BLAST) and/or structural homology (Phyre2). The E-score corresponds to the identified domain.
| Bacterial Strain | OrfX-assisting Gene | Protein ID | Predicted Annotation | NCBI Conserved Domain Search Output (Accession Number) | E-score |
|---|---|---|---|---|---|
|
| EES49627.1 | BoNT type E | Clostridial neurotoxin, translocation domain (cl06820) | 3.07 × 10−93 | |
| Clostridial neurotoxin zinc protease (cl15546) | 1.10 × 10−79 | ||||
| Clostridial neurotoxin, N-terminal receptor binding (PF07953) | 9.90 × 10−68 | ||||
| Clostridial neurotoxin, C-terminal receptor binding (cl08467) | 3.90 × 10−22 | ||||
|
| EES49602.1 | NTNH protein | Clostridial neurotoxin zinc protease (cl15546) | 1.15 × 10−83 | |
| Laminin G domain (cl22861) | 1.79 × 10−47 | ||||
| Non-toxic non-hemagglutinin C-terminal (cl07187) | 2.43 × 10−40 | ||||
| C7M59_04110 | AVQ52086.1 | Crystal insecticidal protein (Cry)/insecticidal delta-endotoxin | Delta-endotoxin, C-terminal domain (cd04085) | 5.38 × 10−39 | |
| Delta-endotoxin (cl15971) | 5.32 × 10−14 | ||||
| Delta-endotoxin, N-terminal domain (cl04339) | 2.11 × 10−13 | ||||
| NNa | WP_026823093.1 | RHS repeat protein | RHS Repeat (PF05593) | 1.19 × 10−5 | |
| Uncharacterized conserved protein RhaS (COG3209) | 3.84 × 10−3 | ||||
| NN | WP_026823094.1 | RHS repeat protein | - | - | |
| NN | WP_081700660.1 | RHS repeat protein | RHS repeat-associated core domain (cl37315) | 2.68 × 10−12 | |
| Beta-eliminating lyase (cl18945) | 2.98 × 10−3 | ||||
| BNAb | WP_137842862.1 | BoNT/NTNH-like A component (BNA) | Clostridial neurotoxin zinc protease (cl15546) | 3.55 × 10−34 | |
| Clostridial neurotoxin, translocation domain (cl06820) | 1.14 × 10−26 | ||||
| NN | WP_137842861.1 | RHS repeat protein | - | - | |
| NN | WP_137842860.1 | RHS repeat protein | RHS repeat-associated core domain (TIGR03696) | 2.88 × 10−26 | |
| Bacterial SNF2 helicase associated domain (cl07173) | 1.81 × 10−3 | ||||
| EEL31_08340 | TPG68525.1 | Crystal insecticidal protein (Cry)/insecticidal delta-endotoxin | Delta-endotoxin, C-terminal domain (cd04085) | 1.86 × 10−22 | |
| Delta-endotoxin (cl15971) | 1.13 × 10−13 | ||||
| Delta-endotoxin, N-terminal domain (cl04339) | 1.03 × 10−11 | ||||
| EEL31_17680 | TPG70133.1 | Binary toxin/vegetative insecticidal protein (VIP1) | Clostridial binary toxin B/anthrax toxin PA domain 2 (cl38748) | 2.52 × 10−39 | |
| Clostridial binary toxin B/anthrax toxin PA Ca-binding domain (cl09551) | 2.12 × 10−14 | ||||
| Clostridial binary toxin B/anthrax toxin PA domain 3 (cl38749) | 6.89 × 10−12 | ||||
| EEL31_17670 | TPG71603.1 | Vegetative insecticidal protein (VIP2) | VIP2, ADP-ribosyltransferase exoenzyme (cl00173) | 4.71 × 10−51 | |
| Clostridial binary toxin B/anthrax toxin PA domain 2 (cl38748) | 1.60 × 10−39 | ||||
| EEL31_17650 | TPG70130.1 | Binary toxin/vegetative insecticidal protein (VIP1) | Clostridial binary toxin B/anthrax toxin PA Ca-binding domain (cl09551) | 1.44 × 10−14 | |
| Clostridial binary toxin B/anthrax toxin PA domain 3 (cl38749) | 3.11 × 10−8 | ||||
| EEL31_17645 | TPG70129.1 | Vegetative insecticidal protein (VIP2) | VIP2, ADP-ribosyltransferase exoenzyme (cl00173) | 3.92 × 10−66 | |
| WS68_18250 | WP_059573479.1 | Autotransporter protein | Outer membrane autotransporter barrel domain (cl36898) | 5.76 × 10−52 | |
| Autotransport protein MisL (cl36477) | 1.07 × 10−19 | ||||
| Large exoprotein involved in heme utilization or adhesion (COG3210) | 2.08 × 10−6 | ||||
| Extended signal peptide of type V secretion system (PF13018) | 4.04 × 10−5 | ||||
| EAM_RS01885 | WP_004160289.1 | RHS repeat protein | RHS repeat-associated core domain (TIGR03696) | 1.12 × 10−23 | |
| Uncharacterized conserved protein RhaS (COG3209) | 8.37 × 10−8 | ||||
| B7C51_09885 | ARF68072.1 | Binary toxin/vegetative insecticidal protein (VIP1) | Clostridial binary toxin B/anthrax toxin PA domain 2 (cl38748) | 4.05 × 10−40 | |
| Clostridial binary toxin B/anthrax toxin PA Ca-binding domain (cl09551) | 1.82 × 10−10 | ||||
| Clostridial binary toxin B/anthrax toxin PA domain 3 (cl38749) | 6.37 × 10−7 | ||||
| PA14 domain (cl08459) | 3.38 × 10−5 | ||||
| B7C51_09880 | ARF68071.1 | Vegetative insecticidal protein (VIP2) | VIP2, ADP-ribosyltransferase exoenzyme (cl00173) | 1.86 × 10−8 | |
| Anthrax toxin lethal factor (cl08465) | 2.48 × 10−3 | ||||
| B7C51_09875 | NN | Anthrax toxin lethal factor/vegetative insecticidal protein (VIP2) | VIP2, ADP-ribosyltransferase exoenzyme (PF03496) | 3.59 × 10−62 | |
| DQX05_07030 | WP_119792154.1 | Binary toxin/vegetative insecticidal protein (VIP1) | Clostridial binary toxin B/anthrax toxin PA domain 2 (cl38748) | 1.14 × 10−39 | |
| Clostridial binary toxin B/anthrax toxin PA Ca-binding domain (cl09551) | 5.76 × 10−15 | ||||
| Clostridial binary toxin B/anthrax toxin PA domain 3 (cl38749) | 1.63 × 10−7 | ||||
| PA14 domain (cl08459) | 1.49 × 10−4 | ||||
| Ricin-type beta-trefoil lectin domain-like (PF14200) | 4.93 × 10−3 | ||||
| DQX05_07025 | WP_119792152.1 | Binary toxin/vegetative insecticidal protein (VIP1) | Clostridial binary toxin B/anthrax toxin PA domain 2 (cl38748) | 2.27 × 10−41 | |
| Clostridial binary toxin B/anthrax toxin PA Ca-binding domain (cl09551) | 1.32 × 10−15 | ||||
| Clostridial binary toxin B/anthrax toxin PA domain 3 (cl38749) | 2.02 × 10−6 | ||||
| DQX05_07020 | WP_119792150 | Vegetative insecticidal protein (VIP2) | VIP2, ADP-ribosyltransferase exoenzyme (cl00173) | 1.20 × 10−6 | |
| DQX05_07015 | WP_119792149 | Anthrax toxin lethal factor/vegetative insecticidal protein (VIP2) | Anthrax toxin lethal factor (cl08465) | 6.60 × 10−4 | |
| VIP2, ADP-ribosyltransferase exoenzyme (cl00173) | 5.05 × 10−63 | ||||
|
| RICGR_0720 | WP_081441678.1 | Shiga toxin A-chain (rRNA N-glycosidase) | Ribosome inactivating protein (cl08249) | 7.84 × 10−20 |
| NN | WP_019988042.1 | BIG-5 domain containing protein | Bacterial Ig-like domain, BIG5 (PF13205) | 1.62 × 10−16 | |
| NN | WP_019988043.1 | Low affinity iron permease | Low affinity iron permease (PF04120) | 1.20 × 10−70 | |
| SAMN02910406 | WP_074963339.1 | Starch-binding protein | Uncharacterized conserved protein YjdB, contains Ig-like domain (COG5492) | 1.24 × 10−10 | |
| Starch-binding module 26 (PF16738) | 1.92 × 10−8 | ||||
| CON71_23765 | WP_098902378.1 | Crystal insecticidal protein (Cry) | Insecticidal crystal toxin, P42 (cl05149) | 5.68 × 10−11 | |
| CON71_23770 | WP_098902379.1 | RHS repeat protein | RHS repeat-associated core domain (TIGR03696) | 7.79 × 10−25 | |
| Uncharacterized conserved protein RhaS (COG3209) | 7.58 × 10−7 |
a NN, no name/tag assigned in the deposited database. b Name assigned in the present study.
Figure 4Protein sequence analysis of BNA. (a) Domain structure of BNA compared to the canonical domain structure of BoNT and NTNH. The NCBI conserved domain detection tool was used to predict protein domains within BNA, BoNT, and NTNH protein sequences. (b) Maximum likelihood phylogenetic tree of BoNT, NTNH, TeNT, and BNA. Values indicated on the tree branches are bootstrapping values (250 iterations). (c) Structural modeling of BNA based on I-TASSER prediction (left) compared to the structures of NTNH type D (middle) and BoNT type B (right).