Literature DB >> 29106666

TADB 2.0: an updated database of bacterial type II toxin-antitoxin loci.

Yingzhou Xie1, Yiqing Wei1, Yue Shen1, Xiaobin Li1, Hao Zhou1, Cui Tai1, Zixin Deng1, Hong-Yu Ou1.   

Abstract

TADB2.0 (http://bioinfo-mml.sjtu.edu.cn/TADB2/) is an updated database that provides comprehensive information about bacterial type II toxin-antitoxin (TA) loci. Compared with the previous version, the database refined and the new data schema is employed. With the aid of text mining and manual curation, it recorded 6193 type II TA loci in 870 replicons of bacteria and archaea, including 105 experimentally validated TA loci. In addition, the newly developed tool TAfinder combines the homolog searches and the operon structure detection, allowing the prediction for type II TA pairs in bacterial genome sequences. It also helps to investigate the genomic context of predicted TA loci for putative virulence factors, antimicrobial resistance determinants and mobile genetic elements via alignments to the specific public databases. Additionally, the module TAfinder-Compare allows comparing the presence of the given TA loci across the close relative genomes. With the recent updates, TADB2.0 might provide better support for understanding the important roles of type II TA systems in the prokaryotic life activities.
© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29106666      PMCID: PMC5753263          DOI: 10.1093/nar/gkx1033

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Toxin–antitoxin (TA) systems, initially identified as plasmid addiction modules, are highly abundant on the chromosomes of most free-living bacteria (1). The TA systems are involved in multiple life activities of bacteria, such as nutrition starvation (1,2), programmed cell death (3), protection from bacteriophage (4) and the antimicrobial resistance (5). TA system consists of a stable toxin protein and a labile cognate antitoxin encoded by a bicistronic locus. Depending on the molecular pattern of antitoxin and the mechanism of toxin neutralization, the known TA systems have been sorted into six different groups, namely type I to VI (6,7). The antitoxin molecules are small non-coding RNAs in type I and III TA pairs, while they are proteins in other types. Toxins act on different targets to affect various cellular processes (Supplementary Table S1), such as peptidoglycan synthesis, replication and translation. Recently, two type II toxins containing the Gcn5-related N-acetyltransferase (GNAT) domain, TacT of Salmonella enterica Typhimurium (8) and AtaT of Escherichia coli O157:H7 (9), were reported to transfer the acetyl group from acetyl coenzyme A to the amine group of the tRNAs, resulting in halting translation of bacterial cell. Thus, more diverse mechanisms of how toxin works and gets neutralized by antitoxin are pending to be elucidated (10). The number of TAfinder-predicted type II TA loci in the mobile genetic elements. Panel (A), (B), (C) and (D) shows the type II TA number for 4419 plasmids, 253 integrative conjugative elements (ICEs), 3278 prophages and 3918 genomic islands (GIs) under analysis (Supplementary Table S2), respectively. Among all the six types of TA system, type II is the most extensively studied, with large quantity and high quality of publicly accessible data. With the advent of next-generation sequencing, a vast number of bacterial genome sequences are being generated. The knowledge resource systematically collecting the reported type II TA loci is therefore needed for researchers to sharing data, gaining reference and predicting new TA pairs. There are currently two open-access bioinformatics resources in the field of type II TA loci, namely the online tool RASTA (11) and the web-based database TADB (12). RASTA searches the conserved functional domains of individual toxins or antitoxin proteins; however, the RASTA website announced that its maintenance stopped since 2011 (http://genoweb1.irisa.fr/duals/RASTA-Bacteria/). In the year of 2011, we proposed the web-based open-access database TADB1.1 archiving both the experimentally validated type II TA pairs and the data derived from computationally predicted datasets (13,14). Since then, a vast number of new TA pairs were identified experimentally. The demand for database update and a tool to help predict new TA pairs became urgent. Here, we report the new major release of TADB, version 2.0. It reflects a large increase in the curated dataset of known type II TA pair and reorganization of the data schema. A newly developed online prediction tool for type II TA loci was also integrated. We expect that TADB2.0 will provide a better support for researchers interested in bacterial type II TA systems.

MATERIALS AND METHODS

Data update by text mining and manual curations

The maintenance of TADB core dataset has been performed majorly on addition and proof of the TA loci experimental evidence. The TA pair was tagged as ‘experimentally validated’ in TADB2.0 only if its clear biological function was reported in a peer-reviewed scientific publication. After manual curation of the PubMed searching results with the key word of ‘toxin and antitoxin’, 326 papers published since 2011 were collected and added into TADB2.0, resulting in a total of 586 papers in the database. TADB2.0 built the links of the archived type II TA loci to the corresponding experimental literature. For the TA loci with experimental evidence, in the previous version, the mapping was set between the archived TA loci and the literature on the host strain level. Now, the link was straightforwardly built from the TA loci to the corresponding experimental literature. In TADB2.0, 105 pairs of experimentally validated type II TA loci were collected. In addition, TADB2.0 also recorded 6088 computationally predicted type II TA loci. They were taken from two datasets: set I containing the BLASTp-identified TA pairs reported by Pandey et al. (13); Set II including the TA loci assigned to 44 conserved TA domain pairs proposed by Makarova et al. (14). However, note that, contrary to the previous version, TADB2.0 had not recorded the third in silico dataset obtained from RASTA-Bacteria (11). Namely, RASTA-Bacteria employed RPSBLAST searches to detected the individual toxin or antitoxin proteins; however, the obtained toxin or antitoxin genes were paired by TADB1.1 without the proven accuracy (12). The updated version had thus removed the RASTA-Bacteria dataset and just showed them as the supplemental list on the web pages of browsing individual replicons. The interface of TADB2.0 data presentation is illustrated in Supplementary Figure S1.

New schema of the back-end database

Similarly, to the previous version, TADB2.0 is implemented as a PostgreSQL relational database, but the database tables were constructed with a new data schema. In TADB1.1, the Generic Model Organism Database’s Chado schema (15) was employed to house the data of ∼1000 prokaryotic genomes obtained from the NCBI RefSeq project archive, sequence and annotation included. As the available genome data blasted in the recent years, TADB1.1 turned out to be hard to maintain and update. In the present release, a new local schema of PostgreSQL was designed to reorganize the TA locus data efficiently. It contains two major modules: TA pair (Supplementary Figure S2) and external reference. By this, the number of tables in the back-end of our database has been decreased to 21, compared to 194 in the previous version. The view number also decreased from 83 to 4. The file storage space thus reduced from 26 Gb to ∼31 Mb.

Integration of the type II TA loci prediction tool

Conserved genetic organization of known type II TA systems typically contains two tandem genes coding for cognate protein partners. We integrated a newly developed online tool, named TAfinder, into TADB2.0 to quickly detect the putative type II TA loci in bacterial genome sequences. In the previous release, the prediction of the putative toxin or antitoxin loci was aided by homolog searches by using the TADB-host WU-BLAST tool or the RPSBLAST-based tool RASTA. In TADB2.0, the TAfinder combines the homolog search module and the operon structure detection module, allowing the enhancement of prediction performance for type II TA pair (Supplementary Figures S3 and S4). Briefly, the TAfinder starts at searching the toxin or antitoxin protein homologs by using both NCBI BLASTp (16) and HMMer3 (17). The BLASTp subject dataset contains all the TADB2.0-archived type II TA systems. The 108 HMM-profiles for the conserved toxin domains and 201 for the antitoxin domains are also detected by HMMer3 with the default settings. Then, the short homologs of toxin proteins or antitoxin proteins with the length of 30–200 amino acid residues were kept as candidates. Finally, two flanking toxin and antitoxin candidate genes on the same DNA strand and with the intergenic distances of −20 to 150 bp were paired as an operon structure, thus predicted as a putative Type II TA locus. The Perl/Bioperl scripts have been written to parse HMMer/BLASTp search results and co-localize significant hits efficiently on a local Linux server. The TAfinder tool was designed to contain two functional modules, TAfinder-Predict and TAfinder-Compare. For the TAfinder-Predict module, the input data in multiple types are acceptable, including the amino acid sequence, nucleotide sequence, the annotated GenBank file and even the unannotated sequences of scaffolds or contigs. For input file with nucleotide sequence, the TAfinder users are encouraged to submit the annotated data, preferably with manual curation, for the maximum accuracy. But the raw nucleotide sequence could be also accepted. It would be preprocessed by our quick gene annotation tool CDSeasy (18), and then used as an input to TAfinder for TA prediction. We have also downloaded the genomic data of 5184 completely sequenced replicons. Users could select the data of interest either by ticking in the genome list or if applicable, typing in the RefSeq accession number. The output interface of the TAfinder-Predict module presents the putative TA pairs in tabular form on the web, and provides the hyperlink to other databases publicly available, such as NCBI, to meet the users’ demands. The TAfinder-Predict module also helps to investigate the genomic context of predicted TA loci for virulence factor, antimicrobial resistance determinants and genes related to horizontal transfer via alignments to the specific public databases (Supplementary Figure S3). Additionally, the TAfinder-Compare module allows doing the alignment between the identified TA loci in multiple closely related bacterial genomes. Finally, TAfinder-Compare measures the toxin or antitoxin protein sequence similarities by using BLASTp-based H-value (19,20) (Supplementary Figure S5).

RESULTS AND DISCUSSION

The web-based database TADB1.1 published in 2011 (12) has been offering a comprehensive compilation of both in silico predicted and experimentally supported type II TA locus data and genetic features. As a unique TA database, it has been cited more than 100 times. Recently a number of new type II TA pairs and other type TA systems have been characterized experimentally, suggesting an urgent demand for the updated version. Recent developments present in this study have further improved the data quality of TADB, such as the type II TA loci curation with experimental literature support. Compared to TADB1.1, the updated TADB2.0 offers three major improvements: (i) new type II TA loci dataset via manual curation; (ii) new data schema of the back-end database; (iii) online tool for type II TA pair prediction. Experimentally validated dataset provided by TADB could be applicable to detect putative type II TA loci in a wide range of bacterial species. For example, there is no well documented relBE family of type II TA systems reported in Streptomyces. We thus searched for the relBE locus against eight completely sequenced Streptomyces genomes based on the conserved RelBE domains, and then identified and characterized a new chromosomal relBE locus in Streptomyces cattleya DSM46488 (21). We also examined ten completely sequenced Klebsiella pneumoniae genomes and 212 putative type II TA loci were identified, including 77 toxin proteins containing GNAT conserved domain (22). In this study, extending on our local Perl/Bioperl scripts, we developed user-friendly tool TAfinder as a public resource for detecting type II TA pairs in bacterial genome sequences. It employed the training dataset taken from TADB2.0. With respect to RASTA based on individual toxin or antitoxin domain searches, TAfinder was designed to identify the TA pair to facilitate functional interpretation. It combines homolog searches and operon feature detection (Supplementary Figure S5). To date, TAfinder has been applied in silico analysis and/or the support of the experimental validations in several bacteria by other research groups, including Staphylococcus aureus (23), Acinetobacter baumannii (24) and the IncX plasmids (25). While the TADB2.0 browse module is still functioning (to accommodate the experimentally validated TA loci), for predicting putative type II TA loci, we strongly encourage users to run TAfinder instead of only browsing the TADB2.0 web page. Comprehensive genome in silico analysis revealed that the type II TA systems are diverse and widespread in the prokaryotic organisms (26). After the examination of 2786 species of prokaryotes with the publicly available complete genome sequences (Supplementary Figure S6 and Table S2), TAfinder prediction results showed that 66% of species harbored 1–20 type II TA loci in individual strains while 20% of species carried more than 20 loci. The cross-talks of the intra- and inter-type TA systems can be explored (23). Remarkably, there are no type II TA loci detected in 14% of species, including Prochlorococcus marinus and the small Mycoplasma symbionts. Whether the TA systems might promote the bacterial fitness and facilitate the evolution of free-living organisms awaits investigation within the population. In addition, among all type II TA loci of the organisms under study, the top five conserved domains of the toxin proteins were RelE, PIN, MNT, MazF and VapC (Supplementary Figure S7A). But the distribution biases might change for the different species; for example, the number of the GNAT toxins in S. enterica (Supplementary Figure S7B) and K. pneumoniae (Supplementary Figure S7C) rises dramatically, just following that of the RelE toxins. In their corresponding antitoxins, the DNA binding-associated domains HTH, RHH and DUF1778 were usually found. A few plasmid-encoded TA systems have been well documented, providing competitive advantage to the host. Out of the 4419 (2165 from whole genome sequence data +2254 from independent plasmid sequencing) plasmids with the whole sequences (Supplementary Table S2), 33.9% (1499/4419) plasmids carried 3592 TAfinder-predicted type II TA loci. Additionally, the TA loci have also been increasingly discovered within or close to the mobile elements on bacterial chromosomes, including prophages, genomic islands (GIs) and integrative and conjugative elements (ICEs) (27,28). Via the TAfinder predictions with the input from multiple data sources (Supplementary Table S2), 33.6% (85/253) ICEs had 120 putative type II TA loci, 26.3% (863/3278) prophages carried 1237 type II TA loci and 28.1% (1099/3918) GIs host 1563 type II TA loci (Figure 1 and Supplementary Figures S8). These type II TA systems might play key roles in the stabilization of the self-transmissible integrative elements, which thus contribute to the horizontal transfer of virulence factors, antibiotic-resistant determinants and many other important bacterial traits.
Figure 1.

The number of TAfinder-predicted type II TA loci in the mobile genetic elements. Panel (A), (B), (C) and (D) shows the type II TA number for 4419 plasmids, 253 integrative conjugative elements (ICEs), 3278 prophages and 3918 genomic islands (GIs) under analysis (Supplementary Table S2), respectively.

CONCLUSION

Here, we reported a major upgrade of TADB, where the type II TA loci have been collected in TADB2.0, with an optimized database back-end organization. The unique type II TA system prediction tool has been integrated into this database, aiding researcher to explore new TA pairs in the high-speed growing bacterial genomic data. New records of publications will be mined to update the status of currently archived TA systems. Newly available data on TA loci will be uploaded regularly to keep pace with the rapidly expanding bacterial genome database. Finally, we propose an updated type II TA-specific resource which is expected to facilitate efficient investigation of large numbers of these systems, recognition of patterns corresponding to cellular targets of diverse toxins, and the neutralization mechanism of the cognate antitoxin, and an improved understanding of their biological roles and significance. Click here for additional data file.
  28 in total

1.  AtaT blocks translation initiation by N-acetylation of the initiator tRNAfMet.

Authors:  Dukas Jurėnas; Sneha Chatterjee; Albert Konijnenberg; Frank Sobott; Louis Droogmans; Abel Garcia-Pino; Laurence Van Melderen
Journal:  Nat Chem Biol       Date:  2017-04-03       Impact factor: 15.040

Review 2.  Bacterial persistence and toxin-antitoxin loci.

Authors:  Kenn Gerdes; Etienne Maisonneuve
Journal:  Annu Rev Microbiol       Date:  2012       Impact factor: 15.500

3.  Persister formation in Staphylococcus aureus is associated with ATP depletion.

Authors:  Brian P Conlon; Sarah E Rowe; Autumn Brown Gandt; Austin S Nuxoll; Niles P Donegan; Eliza A Zalis; Geremy Clair; Joshua N Adkins; Ambrose L Cheung; Kim Lewis
Journal:  Nat Microbiol       Date:  2016-04-18       Impact factor: 17.745

Review 4.  Toxin-antitoxin systems in bacterial growth arrest and persistence.

Authors:  Rebecca Page; Wolfgang Peti
Journal:  Nat Chem Biol       Date:  2016-04       Impact factor: 15.040

5.  Extensive genomic diversity in pathogenic Escherichia coli and Shigella Strains revealed by comparative genomic hybridization microarray.

Authors:  Satoru Fukiya; Hiroshi Mizoguchi; Toru Tobe; Hideo Mori
Journal:  J Bacteriol       Date:  2004-06       Impact factor: 3.490

6.  Recent advancements in toxin and antitoxin systems involved in bacterial programmed cell death.

Authors:  Ming-Xi Hu; Xiao Zhang; Er-Li Li; Yong-Jun Feng
Journal:  Int J Microbiol       Date:  2010-12-27

7.  Toxin-antitoxin loci are highly abundant in free-living but lost from host-associated prokaryotes.

Authors:  Deo Prakash Pandey; Kenn Gerdes
Journal:  Nucleic Acids Res       Date:  2005-02-17       Impact factor: 16.971

8.  RASTA-Bacteria: a web-based tool for identifying toxin-antitoxin loci in prokaryotes.

Authors:  Emeric W Sevin; Frédérique Barloy-Hubler
Journal:  Genome Biol       Date:  2007       Impact factor: 13.583

9.  HMMER web server: 2015 update.

Authors:  Robert D Finn; Jody Clements; William Arndt; Benjamin L Miller; Travis J Wheeler; Fabian Schreiber; Alex Bateman; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2015-05-05       Impact factor: 16.971

Review 10.  Bacterial toxin-antitoxin systems: more than selfish entities?

Authors:  Laurence Van Melderen; Manuel Saavedra De Bast
Journal:  PLoS Genet       Date:  2009-03-27       Impact factor: 5.917

View more
  80 in total

1.  Characterization of DinJ-YafQ toxin-antitoxin module in Tetragenococcus halophilus: activity, interplay, and evolution.

Authors:  Xiaotong Luo; Jieting Lin; Junwei Yan; Xiaoxian Kuang; Hantao Su; Weifeng Lin; Lixin Luo
Journal:  Appl Microbiol Biotechnol       Date:  2021-04-20       Impact factor: 4.813

2.  Presence of toxin-antitoxin systems in picocyanobacteria and their ecological implications.

Authors:  Daniel Fucich; Feng Chen
Journal:  ISME J       Date:  2020-08-19       Impact factor: 10.302

3.  Identification of Type II Toxin-Antitoxin Loci in Levilactobacillus brevis.

Authors:  Ying-Xian Goh; Yang He; Hong-Yu Ou
Journal:  Interdiscip Sci       Date:  2021-10-18       Impact factor: 2.233

Review 4.  Small-Molecule Acetylation by GCN5-Related N-Acetyltransferases in Bacteria.

Authors:  Rachel M Burckhardt; Jorge C Escalante-Semerena
Journal:  Microbiol Mol Biol Rev       Date:  2020-04-15       Impact factor: 11.056

5.  Genome-Wide Screening for Identification of Novel Toxin-Antitoxin Systems in Staphylococcus aureus.

Authors:  Fuminori Kato; Satoshi Yoshizumi; Yoshihiro Yamaguchi; Masayori Inouye
Journal:  Appl Environ Microbiol       Date:  2019-10-01       Impact factor: 4.792

6.  Evaluation of gene expression and protein structural modeling involved in persister cell formation in Salmonella Typhimurium.

Authors:  Negar Narimisa; Fatemeh Amraei; Behrooz Sadeghi Kalani; Faramarz Masjedian Jazi
Journal:  Braz J Microbiol       Date:  2020-10-30       Impact factor: 2.476

7.  Comparative genome analyses of Mycobacteroides immunogenum reveals two potential novel subspecies.

Authors:  Siew Woh Choo; Shusruto Rishik; Wei Yee Wee
Journal:  Microb Genom       Date:  2020-12-09

8.  Edwardsiella piscicida YefM-YoeB: A Type II Toxin-Antitoxin System That Is Related to Antibiotic Resistance, Biofilm Formation, Serum Survival, and Host Infection.

Authors:  Dongmei Ma; Hanjie Gu; Yanjie Shi; Huiqin Huang; Dongmei Sun; Yonghua Hu
Journal:  Front Microbiol       Date:  2021-03-01       Impact factor: 5.640

9.  Identification of Three Type II Toxin-Antitoxin Systems in Model Bacterial Plant Pathogen Dickeya dadantii 3937.

Authors:  Lidia Boss; Marcin Górniak; Alicja Lewańczyk; Joanna Morcinek-Orłowska; Sylwia Barańska; Agnieszka Szalewska-Pałasz
Journal:  Int J Mol Sci       Date:  2021-05-31       Impact factor: 5.923

10.  The higBA-Type Toxin-Antitoxin System in IncC Plasmids Is a Mobilizable Ciprofloxacin-Inducible System.

Authors:  Qin Qi; Muhammad Kamruzzaman; Jonathan R Iredell
Journal:  mSphere       Date:  2021-06-02       Impact factor: 4.389

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.