| Literature DB >> 31771094 |
Rawana N Alkhalili1, Joel Wallenius2, Björn Canbäck2.
Abstract
The toxin-antitoxin (TA) systems have been attracting attention due to their role in regulating stress responses in prokaryotes and their biotechnological potential. Much recognition has been given to type II TA system of mesophiles, while thermophiles have received merely limited attention. Here, we are presenting the putative type II TA families encoded on the genomes of four Geobacillus strains. We employed the TA finder tool to mine for TA-coding genes and manually curated the results using protein domain analysis tools. We also used the NCBI BLAST, Operon Mapper, ProOpDB, and sequence alignment tools to reveal the geobacilli TA features. We identified 28 putative TA pairs, distributed over eight TA families. Among the identified TAs, 15 represent putative novel toxins and antitoxins, belonging to the MazEF, MNT-HEPN, ParDE, RelBE, and XRE-COG2856 TA families. We also identified a potentially new TA composite, AbrB-ParE. Furthermore, we are suggesting the Geobacillus acetyltransferase TA (GacTA) family, which potentially represents one of the unique TA families with a reverse gene order. Moreover, we are proposing a hypothesis on the xre-cog2856 gene expression regulation, which seems to involve the c-di-AMP. This study aims for highlighting the significance of studying TAs in Geobacillus and facilitating future experimental research.Entities:
Keywords: (p)ppGpp; AbrB-ParE; GNAT-HTH; Geobacillus kaustophilus; Geobacillus sp. ZGt-1; Geobacillus thermodenitrificans; Geobacillus thermoleovorans; MNT-HEPN; XRE-COG2856; c-di-AMP
Mesh:
Year: 2019 PMID: 31771094 PMCID: PMC6929052 DOI: 10.3390/ijms20235869
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Workflow chart summarizing the analysis approach carried out in this study for the identification of type II TA families.
TA families, domains, and operons predicted in the current study. Operon prediction was mainly based on the Operon-Mapper tool; exceptions are mentioned. In the last column, the term “Separate” means the toxin and antitoxin are in different operons, while “Shared” means they are located in the same operon. Footnote symbols that are not numbers are consistently used in other tables as well (if present).
| Strain | TA Family Name | Antitoxin Domain | Antitoxin Locus Tag | Antitoxin Length (aa) | Toxin Domain | Toxin Locus Tag | Toxin Length (aa) | TA Operon |
|---|---|---|---|---|---|---|---|---|
|
| GacTA 1 | wHTH ¤¤ | GTNG_1350 | 249 | GNAT ¤ | GTNG_1349 | 143 | Separate ‡‡ |
| GacT, solo toxin | N.A ± | N.A ± | N.A ± | GNAT ¤ | GTNG_1577 | 148 | Shared with GTNG_1578 | |
| GacT, solo toxin | N.A ± | N.A ± | N.A ± | GNAT ¤ | GTNG_1578 | 177 | Shared with GTNG_1577 | |
| GacA, solo antitoxin | HTH ¤ | GTNG_1575 | 67 | N.A ± | N.A ± | N.A ± | Shared with another protein ‡‡ | |
| MazEF | RHH ¤¤ | GTNG_0206 | 93 | PemK/MazF | GTNG_0207 | 116 | Shared | |
|
| GacTA 1 | wHTH ¤¤ | GK1499 | 250 | GNAT ¤ | GK1498 | 144 | Separate ‡‡ |
| MazEF (I) | AbrB/MazE ¤ | GK1647 | 89 | PemK/MazF ¤ | GK1648 | 109 | Shared | |
| MazEF (II) | RHH ¤¤ | GK0232 | 93 | PemK/MazF ¤ | GK0233 | 116 | Shared | |
| ParDE | SpoVT-AbrB (I) ¤ | GK2355 | 135 | ParE ¤ | GK2354 | 111 | Shared | |
| Phd-Doc | SpoVT-AbrB (II) ¤¤ | GK1845 | 84 | Fic/Doc ¤ | GK1846 | 140 | Shared | |
| RelBE | -/XRE # | GK3105 | 86 | RelE ¤ | GK3104 | 86 | Shared | |
| VapBC | UPF20175 ¤¤ | GK1950 | 96 | DUF33368/ | GK1949 | 167 | Shared | |
| XRE-COG2856 | -/HTH ɫ ɫ | GK3185 | 130 | COG2856 ¤ | GK3184 | 264 | Shared with 3rd protein ‡‡ | |
|
| GacTA 1 | wHTH ¤¤ | *_17290 | 249 | GNAT ¤ | *_17280 | 144 | Separate ‡‡ |
| MazEF (I) | MazE ¤ | *_19080 | 89 | PemK/MazF ¤ | *_19090 | 109 | Shared | |
| MazEF (II) | RHH ¤¤ | *_2490 | 93 | PemK/MazF ¤ | *_2500 | 116 | Shared | |
| MNT-HEPN (I) | NT5/ | *_10710 | 51 | DUF786 / | *_10720 | 90 | Shared | |
| MNT solo antitoxin | NT5/ | Unannotated | 54 | N.A ± | N.A ± | N.A ± | Shared with *_10710 and | |
| MNT-HEPN (II) | NT5/KNTase ¤ | *_11510 | 135 | DUF386 ¤/ | *_11500 | 139 | Shared | |
| ParDE | SpoVT-AbrB (I) ¤ | *_26570 | 122 | ParE ¤ | *_26560 | 102 | Shared §§ | |
| Phd-Doc | SpoVT-AbrB (II) ¤¤ | *_21520 | 93 | Fic/Doc ¤ | *_21530 | 140 | Shared operon | |
| RelBE | -/Xre # | *_34820 | 86 | RelE ¤ | *_34810 | 86 | Shared§§ | |
| VapBC | UPF20175 ¤ | *_22490 | 96 | DUF33368/ | *_22480 | 167 | Shared | |
| XRE-COG2856 | -/HTH ɫ ɫ | *_35630 | 130 | COG2856 ¤ | *_35620 | 264 | Shared with 3rd protein ‡‡ §§ | |
| ZG ¶ | GacTA | wHTH ¤¤ | Contig 16_18 | 249 | GNAT ¤ | Contig 16_17 | 144 | Separate ‡‡ |
| MazEF (I) | MazE ¤ | Contig 16_161 | 89 | PemK/MazF ¤ | Contig 16_162 | 109 | Shared | |
| MazEF (II) | RHH ¤¤ | Contig 4_60 | 93 | PemK/MazF ¤ | Contig 4_61 | 116 | Shared | |
| MNT-HEPN (I) | NT5/ | Contig 12_19 | 51 | DUF386 /COG2361 ¤ | Contig 12_20 | 90 | Shared | |
| MNT solo antitoxin | NT5/ | Contig 12_18 | 54 | N.A ± | N.A ± | N.A ± | Shared with | |
| MNT-HEPN (II) | NT5/KNTase ¤ | Contig 12_84 | 135 | DUF386 ¤/COG2445 ¤¤¤ | Contig 12_83 | 139 | Shared | |
| ParDE | SpoVT-AbrB (I) ¤ | Contig 23_243 | 135 | ParE ¤ | Contig 23_242 | 112 | Shared | |
| Phd-Doc | SpoVT-AbrB (II) ¤¤ | Contig 18_126 | 93 | Fic/Doc ¤ | Contig 18_127 | 140 | Shared | |
| RelBE | -/XRE # | Contig 25_196 | 84 | RelE ¤ | Contig 25_195 | 86 | Shared 6 |
¤ The conserved domain was inferred using CDD (Conservation Domain Database) tool; ¤¤ The conserved domain was inferred using InterPro domain analysis tool; ¤¤¤ COG2445 was inferred using Operon-Mapper tool; * Stands for “GTCCBUS3UF5” that is part of the locus tags in G. thermoleovorans CCB_US3_UF5; # There is no conserved domain in the antitoxin, but it is orthologous to XRE family transcriptional regulator, as shown in the KEGG Genes database/the KEGG KOALA BLAST and explained in the text; ɫ ɫ There is no conserved domain in the antitoxin; however, KEGG shows that the protein motif is HTH, as explained in the text; ‡‡ Details are given in Table S3; §§ Operon prediction was based on the ProOpDB. † G. thermodenitrificans NG80-2; ‡ G. kaustophilus HTA426; § G. thermoleovorans CCB_US3_UF5; ¶ Geobacillus sp. ZGt-1; ± N.A stands for “not applicable”.1 Geobacillus acetyltransferase toxin-antitoxin, this TA family name is suggested in this study for Geobacillus. strains having HTA-GNAT domain-harboring proteins; 2 Uncharacterized Protein Family. 3 Domain of Unknown Function, representing protein superfamily; 4 Clusters of Orthologous Genes; 5 Nucleotidyltransferase domain of DNA polymerase beta-like protein superfamily; 6 Operon prediction was inferred manually.
Previously unrecognized toxins and antitoxins that have been identified in the current study as putatively novel ones. These TAs have either been annotated as hypothetical proteins or have not been annotated.
| Strain | Genome Accession Number | Putative T/AT 1 | Locus Tag | Protein ID 2 |
|---|---|---|---|---|
|
| NC_009328 | RHH | GTNG_0206 | WP_008881474 3 |
| AbrB | GK2355 | WP_015375348 3 | ||
| HTH | GK3185 | WP_011232655 3 | ||
| ParE | GK2354 | WP_020278248 3 | ||
| RHH | GK0232 | WP_011229742 3 | ||
| XRE | GK3105 | WP_011232575 3 | ||
| AbrB | *_26570 | WP_014196297 3 | ||
| HTH | *_35630 | WP_014196828 3 | ||
| MNT solo antitoxin | Unannotated | WP_013146011 4 | ||
| ParE | *_26560 | WP_014196296 3 | ||
| Xre | *_34820 | WP_014196753 3 | ||
| ZG ¶ LDPD00000000 | AbrB | Contig 23_243 | WP_015375348 4 | |
| ParE | Contig 23_242 | WP_020278248 3 | ||
| RHH | Contig 4_60 | WP_011229742 3 | ||
| XRE | Contig 25_196 | WP_082218538 4 | ||
†G. thermodenitrificans NG80-2; ‡ G. kaustophilus HTA426. CCB_US3_UF5; ¶ Geobacillus sp. ZGt-1; * Stands for “GTCCBUS3UF5” that is part of the locus tags in G. thermoleovorans CCB_US3_UF5; 1Toxin/Antitoxin; 2 Represents the RefSeq accession number of the putative toxin/antitoxin protein; 3 Accession number belongs to the putative toxin/antitoxin protein as annotated in the RefSeq genome record of the type strain/draft genome sequence of strain ZGt-1; 4 Accession number belongs to the NCBI blastp top hit, e-value < 10−20.
Figure 2Scheme illustrating the hypothesis proposed in the current study on the regulation of the xre-cog2856 expression by the signaling messengers, (p)ppGpp and c-di-AMP. * 3rd protein represents the protein products of GK3183 and *_35610. QS stands for quorum sensing molecule(s). The black up arrows indicate increased levels, while the black down arrows indicate decreased levels. The small black curved arrow indicates the involvement of the QS molecule(s). The big black curved arrow indicates that the cell could go back to normal conditions. Scheme inspired by a figure in Gross et al., 2006 [111].