| Literature DB >> 32545455 |
Ksenia M Klimina1,2, Viktoriya N Voroshilova1,3, Elena U Poluektova1, Vladimir A Veselovsky2, Roman A Yunes1, Aleksey S Kovtun1,3, Anna V Kudryavtseva4, Artem S Kasianov1,3, Valery N Danilenko1,5.
Abstract
The human gastrointestinal microbiota (HGM) is known for its rich diversity of bacterial species and strains. Yet many studies stop at characterizing the HGM at the family level. This is mainly due to lack of adequate methods for a high-resolution profiling of the HGM. One way to characterize the strain diversity of the HGM is to look for strain-specific functional markers. Here, we propose using type II toxin-antitoxin systems (TAS). To identify TAS systems in the HGM, we previously developed the software TAGMA. This software was designed to detect the TAS systems, MazEF and RelBE, in lactobacilli and bifidobacteria. In this study, we updated the gene catalog created previously and used it to test our software anew on 1346 strains of bacteria, which belonged to 489 species and 49 genera. We also sequenced the genomes of 20 fecal samples and analyzed the results with TAGMA. Although some differences were detected at the strain level, the results showed no particular difference in the bacterial species between our method and other classic analysis software. These results support the use of the updated catalog of genes encoding type II TAS as a useful tool for computer-assisted species and strain characterization of the HGM.Entities:
Keywords: gut microbiota; metagenomes; strain identification; toxin-antitoxin systems
Year: 2020 PMID: 32545455 PMCID: PMC7354421 DOI: 10.3390/toxins12060388
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Type II TAS used in this study.
| System | Toxin | Antitoxin |
|---|---|---|
| BrnT/BrnA | BrnT | BrnA |
| Ccd | CcdB | CcdA |
| HicAB | HicA | HicB |
| HigBA | HigB | HigA |
| HipAB | HipA | HipB |
| MazEF | MazF | MazE |
| MqsR/MqsA | MqsR | MqsA |
| Omega-Epsilon-Zeta (ω-ε-ζ) | Zeta | Epsilon |
| ParDE | ParE | ParD |
| PezA/Pez | PezT | PezA |
| Phd/Doc | Doc | Phd |
| PhoAT/PhoH2 | PhoH2 | PhoAT |
| RelBE | RelE | RelB |
| VapBC | VapC | VapB |
| YafQ/DinJ | YafQ | DinJ |
| YoeB/YefM | YoeB | YefM |
| RatA/RatB | RatA | RatB |
| FitB/FitA | FitB | FitA |
| Kid/Kis | Kid | Kis |
| AtaT/AtaR | ataT | ataR |
Pfam protein families of toxins and antitoxins used in this study.
| Pfam Clan | Pfam Protein Family of Toxins | Pfam Protein Family of Antitoxins |
|---|---|---|
|
| RelE (PF06296) | PhdYeFM_antitox (PF02604) |
| Gp49 (PF05973) | RelB (PF04221) | |
| HigB-like_toxin (PF05015) | PHD_like (PF12910) | |
| ParE-like_toxin (PF15781) | ||
| ParE_toxin (PF05016) | ||
| YafQ_toxin (PF15738) | ||
| MqsR_toxin (PF15723) | ||
| BrnT-toxin (PF04365) | ||
| YoeB_toxin (PF06769) | ||
|
| HicA_toxin (PF07927) | CcdA (PF07362) |
| BrnA_antitoxin (PF14384) | ||
| CopG_antitoxin (PF12441) | ||
| HicB (PF05534) | ||
| HicB-like_2 (PF15970) | ||
| HicB_lk_antitox (PF15919) | ||
| ParD (PF09386) | ||
| ParD_antitoxin (PF03693) | ||
| ParD_like (PF11903) | ||
| VapB_antitoxin (PF09957) | ||
|
| PemK_toxin (PF02452) | |
| CcdB (PF01845) | ||
|
| MazE_antitoxin (PF04014) | |
| MraZ (PF02381) | ||
| PrlF_antitoxin (PF15937) |
The number of genera, species and strains in the final catalog.
| No. | Genus | N of Species | N of Strains |
|---|---|---|---|
| 1 |
| 1 | 5 |
| 2 |
| 13 | 22 |
| 3 |
| 2 | 4 |
| 4 |
| 1 | 1 |
| 5 |
| 4 | 5 |
| 6 |
| 1 | 1 |
| 7 |
| 2 | 6 |
| 8 |
| 1 | 2 |
| 9 |
| 19 | 64 |
| 10 |
| 33 | 79 |
| 11 |
| 3 | 9 |
| 12 |
| 4 | 10 |
| 13 |
| 1 | 2 |
| 14 |
| 42 | 109 |
| 15 |
| 4 | 5 |
| 16 |
| 3 | 5 |
| 17 |
| 11 | 15 |
| 18 |
| 3 | 4 |
| 19 |
| 2 | 7 |
| 20 |
| 1 | 2 |
| 21 |
| 6 | 34 |
| 22 |
| 27 | 82 |
| 23 |
| 4 | 106 |
| 24 |
| 18 | 31 |
| 25 |
| 2 | 9 |
| 26 |
| 5 | 29 |
| 27 |
| 1 | 1 |
| 28 |
| 13 | 43 |
| 29 |
| 1 | 1 |
| 30 |
| 4 | 58 |
| 31 |
| 137 | 225 |
| 32 |
| 13 | 18 |
| 33 |
| 1 | 1 |
| 34 |
| 2 | 3 |
| 35 |
| 6 | 6 |
| 36 |
| 1 | 1 |
| 37 |
| 2 | 2 |
| 38 |
| 1 | 2 |
| 39 |
| 4 | 9 |
| 40 |
| 7 | 14 |
| 41 |
| 15 | 26 |
| 42 |
| 4 | 10 |
| 43 |
| 5 | 15 |
| 44 |
| 4 | 12 |
| 45 |
| 9 | 22 |
| 46 |
| 31 | 208 |
| 47 |
| 1 | 1 |
| 48 |
| 4 | 8 |
| 49 |
| 10 | 12 |
Figure 1Two-dimensional model of the relative localization of TA protein families, constructed with CLANS software.
Value ranges of the selected parameters.
| Parameter Number | Parameter | Value Range |
|---|---|---|
| 1 | Uniqueness | 1–15 |
| 2 | Coverage, b.p. | 30–280 |
| 3 | Coverage, % | 40–100 |
| 4 | Number of significant SNPs | 0–20 |
| 5 | Threshold number of TAS | 2–15 |
| 6 | Number of TAS in summary.txt | 1–10 |
| 7 | Number of TAS in test_results_short.txt | 1–10 |
Comparison of the mean Jaccard index (JI) of nine samples processed using MetaPhlAn2 and TAGMA before and after adjusting the thresholds.
| Samples | 1 | 2 | 3 | 4 | 6 | 7 | 8 | 9 | |
|---|---|---|---|---|---|---|---|---|---|
| Software | |||||||||
| MetaPhlAn2 | 0.58 | 0.55 | 0.75 | 0.79 | 0.59 | 0.62 | 0.24 | 0.58 | |
| TAGMA (before adjustment) | 0.45 | 0.73 | 0.74 | 0.56 | 0.75 | 0.71 | 0.73 | 0.47 | |
| TAGMA (after adjustment) | 0.66 | 0.88 | 0.80 | 0.52 | 0.79 | 0.77 | 0.53 | 0.68 | |
Comparison of the false positive rate (FPR) of nine samples processed using MetaPhlAn2 and TAGMA before and after adjusting the thresholds.
| Samples | 1 | 2 | 3 | 4 | 6 | 7 | 8 | 9 | |
|---|---|---|---|---|---|---|---|---|---|
| Software | |||||||||
| MetaPhlAn2 | 29.73 | 27.27 | 14.29 | 9.52 | 24.00 | 30.30 | 35.71 | 29.73 | |
| TAGMA (before adjustment) | 53.03 | 25.00 | 20.69 | 33.33 | 22.86 | 28.57 | 25.00 | 50.79 | |
| TAGMA (after adjustment) | 13.79 | 0.00 | 0.00 | 27.27 | 0.00 | 4.00 | 4.76 | 10.71 | |
Comparison of the mean JI and the FPR (%) of the results generated using MetaPhlAn2 and TAGMA before and after adjusting the thresholds.
| Indicators | Mean JI | Mean Standard Error for JI | Mean FPR (%) | Standard Error for the Mean Positive Rate (%) | |
|---|---|---|---|---|---|
| Software | |||||
| MetaPhlAn2 | 0.59 | 0.06 | 25.07 | 3.13 | |
| TAGMA (before adjustment) | 0.64 | 0.05 | 32.41 | 4.47 | |
| TAGMA (after adjustment) | 0.70 | 0.05 | 7.57 | 3.35 | |
Description of metagenomes.
| Accession No. | Reference in the Article | Description |
|---|---|---|
| SRX5327357 | A1 | male, 3 years old |
| SRX5327356 | A2 | 3 years old |
| SRX5327362 | A3 | 1 year old |
| SRX5327358 | A4 | 4 years old |
| SRX5327344 | A5 | male, 3 years old |
| SRX5327353 | A6 | male, 7 years old |
| SRX5327352 | A7 | female, 9 years old |
| SRX5327351 | A8 | male, 7 years old |
| SRX5327350 | A9 | female, 3 years old |
| SRX5327347 | A10 | male, 9 years old |
| SRX5327345 | A11 | male, 4 years old |
| SRX5327355 | A12 | male, 3 years old |
| SRX5327354 | A13 | male, 4 years old |
| SRX5327360 | A14 | male, 8 years old |
| SRX5327359 | A15 | male, 3 years old |
| SRX5327349 | B1 | infant (0–1 year) |
| SRX5327346 | B2 | infant (0–1 year) |
| SRX5327343 | B3 | infant (0–1 year) |
| SRX5327348 | B4 | infant (0–1 year) |
| SRX5327361 | B5 | infant (0–1 year) |
The number of species and strains in sample A1 (species over 1%).
| Sample | Species | N Species MetaPhlan2 | N Species TAGMA | N Strain TAGMA |
|---|---|---|---|---|
| A1 |
| 1 | 1 | 1 |
|
| 4 | 3 | 7 | |
|
| 1 | - | - | |
|
| 2 | 5 | 9 | |
|
| - | 1 | 1 | |
|
| - | 5 | 5 | |
|
| - | 1 | 1 | |
|
| 1 | 1 | 2 | |
|
| - | 1 | 1 | |
|
| - | 1 | 1 | |
|
| 1 | 1 | 3 | |
|
| 1 | 1 | 1 | |
|
| 1 | 1 | 1 | |
|
| - | 2 | 2 | |
|
| 1 | - | - | |
|
| 1 | 1 | 1 | |
|
| 1 | 2 | 3 |