| Literature DB >> 17678530 |
Emeric W Sevin1, Frédérique Barloy-Hubler.
Abstract
Toxin/antitoxin (TA) systems, viewed as essential regulators of growth arrest and programmed cell death, are widespread among prokaryotes, but remain sparsely annotated. We present RASTA-Bacteria, an automated method allowing quick and reliable identification of TA loci in sequenced prokaryotic genomes, whether they are annotated open reading frames or not. The tool successfully confirmed all reported TA systems, and spotted new putative loci upon screening of sequenced genomes. RASTA-Bacteria is publicly available at http://genoweb.univ-rennes1.fr/duals/RASTA-Bacteria.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17678530 PMCID: PMC2374986 DOI: 10.1186/gb-2007-8-8-r155
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schematic modular pipeline of RASTA-Bacteria. Step 1: provide a nucleic genome sequence in GenBank or raw Fasta format. Step 2: tune the search parameters (optional). Step 3: launch the search; each module calculates a local score, and possibly modifies the dataset (Sx = score at level x; Ny = number of ORFs in dataset; Lz = length distribution of dataset; b1 = bonus). Step 4: output in webpage and/or results files available for download.
Figure 2Screenshot of the results displayed as a webpage. This illustration shows the output results ranked by confidence score. The arrows represent internal links to additional supporting data. The amino acid sequence corresponding to an ORF as annotated by RASTA-Bacteria is shown (1). When a conserved TA domain was predicted, the alignment results can be seen in rpsblast output format (2). Anchor links between co-localized candidates allow checking for possible parity (3).
Figure 3General genetic context of a TA loci. The typical TA loci organization with sizes and distance profiling is shown.
Figure 4Length distribution of Bacterial toxins and antitoxins. The graph represents the length distribution of antitoxins and toxins in 126 organisms (from [39]), depending on their classification (X-axis, length in amino acids; left Y-axis, number of sequences). The black curves represent the probability over the total population (1,378 TA) for a sequence of length X to constitute a TA (right Y-axis), and were used to determine the length-criterion scoring function as described in the text.
List of PSSM profiles selected in TAcddb to verify the presence of a conserved TA-related domain
| PSSMid | CD accession name | Relation/involvement in TA world | Reference |
| 28977 | cd00093-HTH_XRE | XRE-like domain present in HigA and VapB antitoxins | [20], this study |
| 31586 | COG1396-HipB | Involved in production of persister cells (antitoxin) | [20] |
| 31676 | COG1487-VapC | Quality control of gene expression | [57] |
| 31786 | COG1598 | HicB of HicAB system (function undetermined) | [58] |
| 31910 | COG1724 | HicA of HicAB system (function undetermined) | [58] |
| 32033 | COG1848 | PIN domain, present in VapC toxins | [20,59,60] |
| 32185 | COG2002-AbrB | Domain present in of MazE and VapB antitoxins | [20] |
| 32209 | COG2026-RelE | Toxin of cytotoxic translational repressor system | [14,28,29] |
| 32344 | COG2161-StbD | Antitoxin of the RelBE family | [61] |
| 32487 | COG2336-MazE | Growth regulator (antitoxin) | [45] |
| 32488 | COG2337-MazF | Growth inhibitor (toxin) | [45] |
| 32907 | COG3093-VapI | Named from VapI region; corresponds to VapB antitoxins (Plasmid maintenance) | [62], this study |
| 33351 | COG3549-HigB | Toxin of plasmid maintenance system | [23] |
| 33352 | COG3550-HipA | Involved in production of persister cells (toxin) | [20] |
| 33408 | COG3609 | CopG/Arc/MetJ DNA-binding domain, present in RelB, ParD, VapBCand CcdA antitoxins | [20], this study |
| 33452 | COG3654-Doc | Toxin of probable translational inhibitor system | [25,63] |
| 33466 | COG3668-ParE | Toxin of plasmid stabilization system | [22,64] |
| 33870 | COG4113 | PIN domain, present in VapC toxins | [20,59,60] |
| 33875 | COG4118-Phd | Antitoxin to translational inhibitor Doc | [65] |
| 33951 | COG4226-HicB | HicB of HicAB system (predicted) | [58] |
| 34119 | COG4423 | Predicted antitoxin of PIN domain toxins (VapC) | [57,60] |
| 34135 | COG4456-VagC | Antitoxin of plasmid maintenance system | [66] |
| 34307 | COG4691-StbC | Plasmid stability proteins (HigBA family) | [67,68], this study |
| 34891 | COG5302-CcdA | Antitoxin of plasmid stabilization system | [27,69] |
| 35058 | COG5499 | Predicted transcription regulators with HTH domain | [20], this study |
| 41431 | pfam01381-Hth_3 | Present in antitoxins of HigBA and VapBC families | [20], this study |
| 41452 | pfam01402-Hth_4 | Present in CopG repressors (RelBE, ParDE, VapBC, and CcdAB families) | [20], this study |
| 41869 | pfam01845-CcdB | Toxin of plasmid stabilization system | [69] |
| 41874 | pfam01850-PIN | DNA binding PIN domain, present in VapC toxins | [59,60] |
| 42429 | pfam02452-PemK | Toxin of the MazEF family | [70] |
| 43931 | pfam04014-AbrB | Domain present in MazE and VapB antitoxins | [20], this study |
| 44135 | pfam04221-RelB | Antitoxin to translational repressor RelE | [14] |
| 44915 | pfam05012-Doc | Toxin of probable translational inhibitor system | [63] |
| 44918 | pfam05015-Plasmid_killer | Toxins of the HigBA family | [23], this study |
| 44919 | pfam05016-Plasmid_stabil | Toxins of the RelE family | [14], this study |
| 45431 | pfam05534-HicB | Member of the HicAB system | [58] |
| 47246 | pfam07362-CcdA | Antitoxin of plasmid stabilization system | [27,69] |
| 47831 | smart00530-Xre | XRE-like HTH domain present in HigA and VapB | [20], this study |
References to 'this study' correspond to domains found in this study upon sequence analysis of described TA candidates. AbrB, AidB regulator; HTH, helix-turn-helix; PIN, homologues of the pilin biogenesis protein pilT amino-terminal domain; XRE, xenobiotic response element.
Figure 5TA loci features in individual replicons of S. meliloti strain 1021. (a) Repartition of TA loci in the chromosome (new and confirming Pandey et al.'s [39] findings) and in the two megaplasmids. (b) Percentage of TA loci as a function of replicon size. (c) Repartition with respect to leading and lagging strands of replication. (d) Frequency of the three genomic organizations found for TA genes in the three replicons.
Figure 6Maps of TA loci in individual replicons of S. meliloti strain 1021. The maps were created using CGView [53,54]. Green labels represent newly annotated TA genes, and orange labels represent RASTA-Bacteria predicted TA genes previously reported by Pandey et al. [39] On the chromosome, the grey SmeXXX regions correspond to genomic islands as described in the Islander database [55,56].
Results for 14 previously studied organisms
| Organism | Unclass. | Total | ||||||||
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 1 (0) | 6 (2) | 2 (0) | 0 | 9 (2) | |
| 0 (0) | 0 (0) | 1 (1) | 3 (3) | 1 (0) | 7 (7) | 5 (3) | 6 (0) | 1 | 24 (14) | |
| 0 (0) | 0 (0) | 1 (1) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 1 (1) | |
| 0 (0) | 0 (0) | 1 (1) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 1 (1) | |
| 0 (0) | 4 (4) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 1 (1) | 6 (0) | 0 | 12 (5) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 1 (0) | 1 (0) | 0 (0) | 0 | 2 (0) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 3 (3) | 0 (0) | 1 (1) | 0 (0) | 1 (1) | 2 (2) | 0 (0) | 3 | 10 (7) | |
| 0 (0) | 1 (1) | 2 (2) | 0 (0) | 0 (0) | 4 (3) | 2 (0) | 0 (0) | 0 | 10 (6) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 1 (0) | 6 | 7 (0) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 1 (1) | 8 (7) | 5 (5) | 6 (6) | 2 (2) | 10 (10) | 20 (14) | 1 (0) | 4 | 57 (45) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 5 (1) | 0 (0) | 1 (1) | 1 (0) | 1 (1) | 0 (0) | 2 (0) | 3 | 13 (3) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 | 0 (0) | |
| 0 (0) | 6 (3) | 1 (0) | 8 (0) | 3 (0) | 2 (2) | 27 (7) | 1 (0) | 5 | 53 (12) | |
| 0 (0) | 0 (0) | 0 (0) | 0 (0) | 3 (3) | 4 (4) | 29 (25) | 1 (0) | 0 | 37 (32) | |
| 0 (0) | 0 (1) | 0 (0) | 0 (0) | 0 (0) | 1 (0) | 0 (0) | 0 (0) | 0 | 2 (1) |
Numbers stand for TA systems (singleton or doublet) as predicted by: RASTA-Bacteria (numbers in parentheses are as predicted by Pandey et al. [39]). *Plasmids were not included in the analysis by Pandey et al. [39]. Unclass., unclassified.
RASTA-Bacteria predictions for newly sequenced genomes
| Organism | UnClass. | Total | ||||||||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 1 | 2 | 0 | 1 | 4 | |
| 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 2 | |
| 0 | 2 | 1 | 0 | 0 | 2 | 8 | 5 | 15 | 33 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 2 | 0 | 0 | 0 | 0 | 0 | 3 | 2 | 7 | |
| 0 | 0 | 1 | 0 | 0 | 0 | 2 | 0 | 3 | 6 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 1 | 4 | 0 | 0 | 3 | 3 | 1 | 0 | 12 | |
| 1 | 3 | 1 | 5 | 0 | 1 | 1 | 0 | 0 | 12 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 4 | |
| 0 | 0 | 0 | 2 | 0 | 1 | 1 | 0 | 0 | 4 | |
| 1 | 6 | 0 | 4 | 1 | 1 | 4 | 0 | 0 | 17 | |
| 0 | 2 | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 5 | |
| 0 | 2 | 3 | 0 | 0 | 7 | 15 | 0 | 2 | 30 | |
| 2 | 1 | 9 | 2 | 0 | 3 | 48 | 0 | 0 | 65 | |
| 0 | 1 | 2 | 0 | 2 | 1 | 5 | 0 | 3 | 14 | |
| 0 | 4 | 0 | 0 | 0 | 0 | 0 | 1 | 3 | 10 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 2 | 2 | 1 | 1 | 4 | 10 | 4 | 1 | 25 | |
| 0 | 1 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | 5 | |
| 0 | 1 | 0 | 2 | 0 | 0 | 0 | 1 | 3 | 8 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 2 | 2 | 0 | 0 | 0 | 1 | 1 | 6 | |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 0 | 0 | 0 | 0 | 0 | 7 | 0 | 0 | 0 | 7 |
Unclass., unclassified.