| Literature DB >> 32620855 |
Enhui Shen1,2, Kangli Zhu1,2, Danyang Li1,2, Zhenrui Pan1,2, Yun Luo3,4, Qiao Bian5, Liuqing He1,2, Xiaojun Song6, Ying Zhen1,2, Dazhi Jin6,7, Liang Tao8,9.
Abstract
Clostridioides difficile toxins (TcdA and TcdB) are major exotoxins responsible for C. difficile infection (CDI) associated diseases. The previously reported TcdB variants showed distinct biological features, immunoactivities, and potential pathogenicity in disease progression. Here, we performed global comparisons of amino acid sequences of both TcdA and TcdB from 3,269 C. difficile genomes and clustered them according to the evolutionary relatedness. We found that TcdB was much diverse and could be divided into eight subtypes, of which four were first described. Further analysis indicates that the tcdB gene undergoes accelerated evolution to maximize diversity. By tracing TcdB subtypes back to their original isolates, we found that the distribution of TcdB subtypes was not completely aligned with the phylogeny of C. difficile. These findings suggest that the tcdB genes not only frequently mutate, but also continuously transfer and exchange among C. difficile strains.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32620855 PMCID: PMC7335066 DOI: 10.1038/s42003-020-1078-y
Source DB: PubMed Journal: Commun Biol ISSN: 2399-3642
Fig. 1TcdB sequences are clustered into eight subtypes.
a Neighbor-joining cluster analysis of 128 unique amino-acid sequences from 2691 valid TcdB sequences. The shadows with different colors highlighted the eight TcdB subtypes with the minimal cutoff of 5.03% dissimilarity. b Sequence variations between TcdB1 and other TcdB subtypes. The top bar chart represents the structural arrangement of four domains (GTD, CPD, DRBD, and CROPs) in TcdB1. The black vertical lines in bar charts below mark the positions of divergent amino-acid residues in other TcdB subtypes when compared with TcdB1. c The time to death of C57BL/6 mice that were injected intraperitoneally with 1 μg/kg TcdB of divergent subtypes. (n = 6 mice) d Breakdown of genome-sequenced C. difficile strains in North America, East Asia, Europe, and Australia by TcdB subtypes 1–4.
Minimum between-subtype amino-acid differences among TcdB subtypes.
| Subtype | Numbers of strains | Minimum between-subtype differences (%) | Maximum within-subtype difference (%) | Representative strain | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| TcdB1 | TcdB2 | TcdB3 | TcdB4 | TcdB5 | TcdB6 | TcdB7 | TcdB8 | ||||
| TcdB1 | — | 5.33 | 5.75 | 11.15 | 5.70 | 10.06 | 10.48 | 6.72 | 4.14 | 630 | |
| TcdB2 | — | 10.10 | 5.15 | 11.24 | 6.00 | 5.75 | 8.92 | 4.27 | CD196 | ||
| TcdB3 | — | 6.38 | 5.03 | 13.44 | 13.77 | 12.13 | 0.85 | 1470 | |||
| TcdB4 | — | 8.96 | 9.63 | 8.32 | 13.48 | 3.93 | 8864 | ||||
| TcdB5 | — | 14.32 | 14.32 | 12.17 | 0.17 | ES130 | |||||
| TcdB6 | — | 5.92 | 11.33 | — | CD160 | ||||||
| TcdB7 | — | 11.62 | 0.17 | CD10–165 | |||||||
| TcdB8 | — | — | 173070 | ||||||||
Fig. 2TcdB has higher sequence diversity compared with TcdA.
a The classic PaLoc structure in C. difficile genome referred to strain 630. The arrows represent the ORFs in the PaLoc. b Amino-acid sequence diversity of TcdA and TcdB. The vertical lines in the bar charts represent positions of non-conserved amino-acid residues. The gradient color (white to black) of the vertical lines indicate different conservation (100–70%) of amino-acid positions in the sequences. Positions with conservation <70% were marked as black vertical lines. c Riverplot graph showing associations between TcdB subtypes and tcdC alleles. NA means no valid sequence was identified in the C. difficile genome.
PAML analyses for the tcdA and tcdB genes.
| Models | Gene | Hypothesis (model) | Hypothesis-testing | |||
|---|---|---|---|---|---|---|
| M1a versus M2a | Null (M1a) | −11405.514 | <0.001 | Positive selection | 5.889 | |
| Alternative (M2a) | −11394.345 | |||||
| Null (M1a) | −18796.020 | 0.010 | Positive selection | 35.957 | ||
| Alternative (M2a) | −18791.455 | |||||
| M8a versus M8 | Null (M8a) | −11394.667 | <0.001 | Positive selection | 4.869 | |
| Alternative (M8) | −11405.514 | |||||
| Null (M8a) | −18763.111 | 0.0017 | Positive selection | 27.059 | ||
| Alternative (M8) | −18768.022 |
aInL is the log-likelihood score.
bLikelihood ratio test to detect positive selection, p < 0.05 was considered significant.
cRatio of non-synonymous and synonymous substitution.
Fig. 3The distribution of TcdB subtypes in the phylogeny of STs.
A maximum likelihood tree was built for 110 C. difficile MLST types based on seven housekeeping genes (adk, atpA, dxr, glyA, recA, sodA, and tpi). Each color in the nodes represents one subtype of TcdB (TcdB1 in pink, TcdB2 in light blue, TcdB3 in lavender, TcdB4 in yellow, TcdB5 in green, TcdB6 in aubergine, TcdB7 in orange and TcdB8 in brown). TcdB1b and TcdB1c were denoted with one and two dots. C. difficile clade 1–5 were denoted by colored arcs. ST122, ST200, ST360, and ST361 are marked as outliers (red arcs). The red arrows and boxes highlight two C. difficile MLST types that contain either TcdB2 or TcdB4 in different isolates.