| Literature DB >> 28262711 |
Rong Chen1,2, Yu Feng1,2, Xiaohui Wang1,2, Jingyu Yang1,2, Xiaoxia Zhang1,2, Xiaoju Lü1,2, Zhiyong Zong1,2,3.
Abstract
Clostridium difficile consists of six clades but studies on Clade 3 are limited. Here, we report genome sequences of three Clade 3 C. difficile strains carrying genes encoding toxin A and B and the binary toxin. Isolates 103 and 133 (both of ST5) and isolate 106 (ST285) were recovered from three ICU patients. Whole genome sequencing using HiSeq 2500 revealed 4.1-Mb genomes with 28-29% GC content. There were ≥1,104 SNP between the isolates, suggesting they were not of a single clone. The toxin A and B gene-carrying pathogenicity locus (PaLoc) of the three isolates were identical and had the insertion of the transposon Tn6218. The genetic components of PaLoc among Clade 3 strains were the same with only a few nucleotide mutations and deletions/insertions, suggesting that the Tn6218 insertion might have occurred before the divergence within Clade 3. The binary toxin-genes carrying CDT locus (CdtLoc) of the three isolates were identical and were highly similar to those of other Clade 3 strains, but were more divergent from those of other clades. In conclusion, Clade 3 has an unusual clade-specific PaLoc characteristic of a Tn6218 insertion which appears to be the main feature to distinguish Clade 3 from other C. difficile.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28262711 PMCID: PMC5337907 DOI: 10.1038/srep43555
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic representations of PaLoc and flanking genes.
The 5′ flanking gene cdu1, the 3′ flanking gene cdd1, positive regulator gene tcdR, negative regulator gene tcdC, and toxin genes tcdA and tcdB are shown. For Tn6218, the genetic components from left to right are int, xis, rep, xre, merR, oxidoreductase-encoding gene, flavodoxin-encoding gene, orf and σ70-encoding gene.
ANI values (%) of isolates 103, 106 and 133 compared with other Clade 3 strains and representative strains of other clades.
| Clade | Strain | ST | Genome accession no. | Country | Year | 103 ANI value | 106 ANI value | 133 ANI value |
|---|---|---|---|---|---|---|---|---|
| 3 | 103 | 5 | China | |||||
| 106 | 285 | China | 99.52 | |||||
| 133 | 5 | China | 99.51 | 99.25 | ||||
| ZJCDC-S82 | 5 | JYNK01000000 | China | 2013 | 99.44 | 99.18 | 99.38 | |
| OX2183 | 5 | C00005010 | UK | 2009 | 99.43 | 99.16 | 99.36 | |
| OX1485 | 5 | C00004983 | UK | 2008 | 99.37 | 99.20 | 99.39 | |
| OX561 | 22 | C00001506 | UK | 2007 | 99.42 | 99.20 | 99.41 | |
| OX1232 | 25 | C00002723 | UK | 2007 | 99.46 | 99.18 | 99.49 | |
| L058 | 96 | C00002452 | UK | 2008 | 99.48 | 99.23 | 99.37 | |
| Q24 | 162 | C00006439 | Australia | 2008 | 99.43 | 99.23 | 99.28 | |
| VL-0104 | 201 | FAAJ01000000 | Canada | NA | 99.49 | 99.18 | 99.37 | |
| VL-0391 | 201 | FALK01000001 | Canada | NA | 99.48 | 99.28 | 99.36 | |
| CD69 | 221 | AVHE01000000 | USA | 2010 | 99.41 | 99.22 | 99.52 | |
| 1 | E14 | 2 | CAMS01000000 | France | NA | 97.71 | 97.77 | 97.68 |
| CD37 | 3 | AHJJ01000000 | USA | 1980 | 97.66 | 97.81 | 97.73 | |
| P49 | 4 | AVMN01000000 | USA | NA | 97.91 | 97.91 | 97.77 | |
| E25 | 6 | CAMJ01000000 | France | NA | 97.70 | 97.77 | 97.82 | |
| E16 | 7 | CAMH01000000 | France | NA | 97.90 | 97.88 | 97.74 | |
| P15 | 8 | AVLV01000000 | USA | 2005 | 97.92 | 97.90 | 97.90 | |
| P8 | 10 | AVLR01000000 | USA | 2001 | 97.68 | 97.83 | 97.73 | |
| T6 | 13 | CAMR01000000 | France | NA | 97.61 | 97.55 | 97.66 | |
| P64 | 14 | AWZL01000000 | USA | 2009 | 97.66 | 97.71 | 97.61 | |
| 5.3 | 15 | AZSH01000000 | Australia | 2008 | 97.97 | 97.93 | 97.95 | |
| DA00129 | 16 | AVIY01000000 | USA | 2010 | 97.52 | 97.46 | 97.77 | |
| CD8-15 | 17 | LYDP01000000 | Italy | 2015 | 97.83 | 97.52 | 97.60 | |
| P30 | 26 | AVME01000000 | USA | 2009 | 97.63 | 97.50 | 97.57 | |
| CD200 | 28 | AVIF01000000 | USA | 2010 | 97.54 | 97.56 | 97.54 | |
| P38 | 29 | AVMH01000000 | USA | 2009 | 97.67 | 97.55 | 97.55 | |
| P7 | 34 | AVLQ01000000 | USA | NA | 97.83 | 97.53 | 97.62 | |
| P29 | 35 | AVMD01000000 | USA | 2008 | 97.64 | 97.44 | 97.62 | |
| E12 | 42 | CAMZ01000000 | France | NA | 97.94 | 97.93 | 97.84 | |
| DA00126 | 43 | AVIW01000000 | USA | 2010 | 97.72 | 97.78 | 97.77 | |
| CD45 | 44 | AVGY01000000 | USA | 2010 | 97.60 | 97.65 | 97.67 | |
| CD113 | 45 | AVHN01000000 | USA | 2010 | 97.76 | 97.64 | 97.64 | |
| ATCC43255 | 46 | CM000604 | Canada | 1935 | 97.77 | 97.78 | 97.73 | |
| CD22 | 48 | AVGP01000000 | USA | 2010 | 97.73 | 97.82 | 97.60 | |
| T17 | 49 | CAMT01000000 | France | NA | 97.98 | 97.74 | 97.76 | |
| P11 | 53 | AVLT01000000 | USA | NA | 97.89 | 97.82 | 97.80 | |
| CD630 | 54 | CP010905 | Switzerland | 1982 | 97.94 | 97.65 | 97.66 | |
| T19 | 55 | CANA01000000 | France | NA | 98.09 | 97.82 | 97.87 | |
| P46 | 58 | AVML01000018 | USA | 2009 | 97.76 | 97.77 | 97.71 | |
| E7 | 63 | CAMV01000000 | France | NA | 97.75 | 97.73 | 97.63 | |
| Y343 | 83 | AVLG01000000 | USA | 2010 | 97.81 | 97.89 | 97.77 | |
| CD105KSE11 | 107 | FJVK01000000 | UK | NA | 98.05 | 97.77 | 97.22 | |
| DA00216 | 110 | AVJX01000000 | USA | 2010 | 97.82 | 97.65 | 97.71 | |
| DA00211 | 139 | AVJU01000000 | USA | 2010 | 97.87 | 97.47 | 97.52 | |
| 2 | CD196 | 1 | NC_013315 | France | 1985 | 97.89 | 97.66 | 97.76 |
| E19 | 62 | CAMO01000000 | France | 2012 | 97.94 | 97.59 | 97.79 | |
| T23 | 67 | CAMN01000000 | France | NA | 97.79 | 97.71 | 97.84 | |
| E15 | 95 | CAMM01000000 | France | NA | 97.51 | 97.54 | 97.58 | |
| P59 | 123 | AVMQ01000000 | USA | 2009 | 97.57 | 97.58 | 97.59 | |
| 4 | M68 | 37 | NC_017175 | UK | 2006 | 97.59 | 97.58 | 97.48 |
| CF5 | 86 | FN665652 | Belgium | 1995 | 97.78 | 98.08 | 97.85 | |
| CD002 | 170 | CAMG01000000 | France | 2010 | 97.74 | 98.06 | 97.71 | |
| 6503 | 320 | ADEI01000000 | USA | 2009 | 97.76 | 97.84 | 97.38 | |
| 5 | M120 | 11 | NC_017174 | UK | 2007 | 96.31 | 96.33 | 96.32 |
| C-I | CD105KSO8 | 297 | FJUI01000000 | UK | NA | 91.24 | 91.43 | 90.45 |
NA, not available.
General features of the three genomes.
| Isolate | ST | Clean reads | Genome size (bp) | GC content | No. of contigs | No. of contigs ≥1,000 bp | No. of coding sequences | No. of tRNA genes |
|---|---|---|---|---|---|---|---|---|
| 103 | 5 | 5,703,662 | 4,060,612 | 28.56% | 243 | 69 | 3,777 | 43 |
| 106 | 285 | 5,698,996 | 4,080,409 | 28.74% | 303 | 61 | 3,764 | 42 |
| 133 | 5 | 5,696,658 | 4,181,327 | 28.38% | 276 | 75 | 4,000 | 43 |
Figure 2Phylogenetic tree based on SNPs of genome sequences of the three isolates, other 10 strains of Clade 3 and 44 representative strains of other clades.
ST is shown after the strain names and PCR ribotype is indicated in parentheses with NA representing not available. The three isolates are highlighted with boxes. Bar, 0.02 changes per nucleotide position.
Figure 3Phylogenetic tree based on SNP identified in the core genomes of Clade 3 strains.
Bar, 0.01 changes per nucleotide position. Bootstrap values >50% (based on 1,000 resamplings) are shown.
PaLoc of isolates 103, 106 and 133 compared with other Clade 3 strains and a representative strain of each Clade 1, 2 and 5.
| Clade | Strain | Tn | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| oxidoreductase | flavodoxin | orf | |||||||||||||
| 1 | CD630 | 12 (6NS) | 92 (48NS) | 5 + 31nt del + 1nt in (7NS + 10 del) | — | — | — | — | — | — | — | — | — | 116 (44NS) | 5 + 318nt del (1NS + 106 del) |
| 2 | CD196 | 12 (6NS) | 446 (175NS) | 3 + 31nt del + 1nt in (6NS + 10 del) | — | — | — | — | — | — | — | — | — | 89 + 63nt del (34NS + 21 del) | 4 + 36nt del (12 del) |
| 5 | M120 | 18 (8NS) | 173 (87NS) | 4 + 42nt in (2NS + 14 in) | — | — | — | — | — | — | — | — | — | 89SNP + 63nt del (44NS + 21 del) | 3 + 15nt del (3NS + 5 del) |
| 3 | ZJCDC-S82 | ID | 6 (3NS) | ID | ID | ID | 5 (3NS) | ID | ID | ID | ID | ID | 11 + 210nt in (5NS + 70 del) | 18 (11NS) | 36nt del (12 del) |
| CD69 | ID | 12 (7NS) | ID | 3 (4NS) | ID | 5 (3NS) | ID | ID | ID | ID | ID | ID | 153 (73NS) | ID | |
| VL-0391 | ID | 2 (2NS) | ID | 7 (4NS) | ID | 3 (3NS) | 1 (1NS) | ID | 1 (1NS) | 1 (1NS) | ID | 1 (0NS) | 11 + 36nt del (5NS + 12 del) | 36nt del (12 del) | |
| VL-0104 | ID | 2 (2NS) | ID | 8 (4NS) | ID | 3 (3NS) | 1 (1NS) | ID | 1 (1NS) | 1 (1NS) | ID | 1 (0NS) | 11 + 36nt del (5NS + 12 del) | 36nt del (12 del) | |
| L058 | 1 (1NS) | 5 (2NS) | ID | 5 (3NS) | ID | 3 (2NS) | ID | ID | 2 (2NS) | ID | ID | ID | 8 (4NS) | ID | |
| OX561 | 1 (1NS) | 6 (3NS) | ID | 9 (3NS) | 1 (1NS) | 2 (2NS) | ID | ID | 1 (1NS) | ID | ID | ID | 7 + 63nt del (5NS + 21 del) | ID | |
| OX1232 | 1 (1NS) | 6 (3NS) | ID | 8 (3NS) | ID | 2 (2NS) | ID | ID | 1 (1NS) | ID | ID | ID | 14 (10NS) | ID | |
| OX1485 | 1 (1NS) | 5 (3NS) | ID | 8 (3NS) | ID | 2 (2NS) | ID | ID | 1 (1NS) | ID | ID | ID | 37 (21NS) | ID | |
| OX2183 | ID | 3 (3NS) | ID | ID | 47nt in (9 in) | 5 (3NS) | ID | ID | ID | ID | ID | ID | 51 (23NS) | 36nt del (12 del) | |
| Q24 | ID | 8 (4NS) | ID | 7 (2NS) | 1 (1NS) | 2 + 108nt in (1NS + 36 in) | 2 (1NS) | 1 (1NS) | 1 (1NS) | 3 (0NS) | 2 (0NS) | ID | 62 (35NS) | 36nt del (12 del) | |
The results shown in the Table are that the PaLoc sequence of isolate 103, 106 and 133 queries those of strains listed in the Strain column. The number of SNP is shown in the column with the resulted amino acid substitutions and deletions being indicated in parentheses. ID, identical; in, insertion; del, deletion; -, absent; nt, nucleotide; NS, non-synonymous SNP. The nucleotide deletions and insertions are consecutive. atcdC of strain NAP08 has a nonsense mutation, resulting in 37 aa deletions of the C-terminal end compared to those of isolates 103, 106 and 133.
Predicted phages in the three isolates.
| Phage region | Region length | Completeness | Total no. of orf | Mostly matched Phage | GC % |
|---|---|---|---|---|---|
| Isolate103 | |||||
| 1 | 27.2 kb | Incomplete | 30 | phiCDHM19_NC_028996(11) | 27.9 |
| 2 | 18.3 kb | Incomplete | 25 | phiC2_NC_009231(6) | 28.3 |
| 3 | 24.7 kb | Questionable | 27 | phiCD505_NC_028764(11) | 28.3 |
| 4 | 17.4 kb | Incomplete | 22 | phiCD27_NC_011398(6) | 27.2 |
| 5 | 15.1 kb | Incomplete | 10 | phiCD505_NC_028764(2) | 28.8 |
| 6 | 7.3 kb | Incomplete | 12 | phiCDMH11_NC_029001(4) | 28.3 |
| 7 | 8 kb | Incomplete | 9 | vB_Bans_Tsamsa_NC_023007(2) | 29.3 |
| 8 | 19.2 kb | Incomplete | 9 | c_st__NC_007581(6) | 27.4 |
| 9 | 23.6 kb | Incomplete | 32 | phiMMP02_NC_019421(10) | 27.7 |
| 10 | 13.8 kb | Incomplete | 21 | CDMH1_NC_024144(8) | 27.5 |
| 11 | 36.1 kb | Intact | 51 | phiMMP03_NC_028959(23) | 32.1 |
| 12 | 39.4 kb | Intact | 61 | CDMH1_NC_024144(16) | 33.8 |
| Isolate106 | |||||
| 1 | 27.2 kb | Incomplete | 30 | phiCDHM19_NC_028996(11) | 28.04 |
| 2 | 26.5 kb | Incomplete | 34 | phiMMP03_NC_028959(11) | 29.86 |
| 3 | 8.3 kb | Incomplete | 9 | vB_Bans_Tsamsa_NC_023007(2) | 29.19 |
| 4 | 21.9 kb | Incomplete | 9 | c_st__NC_007581(3) | 26.62 |
| 5 | 18.5 kb | Incomplete | 22 | CDMH1_NC_024144(12) | 30.13 |
| 6 | 15.1 kb | Incomplete | 11 | phiCD505_NC_028764(2) | 28.86 |
| 7 | 12.5 kb | Incomplete | 21 | phiMMP02_NC_019421(15) | 29.12 |
| 8 | 25.1 kb | Incomplete | 13 | phiARI0746_NC_031907(3) | 39.23 |
| Isolate133 | |||||
| 1 | 27.2 kb | Incomplete | 29 | phiCDHM19_NC_028996(11) | 28 |
| 2 | 32.8 kb | Incomplete | 27 | phiCD27_NC_011398(6) | 28 |
| 3 | 10 kb | Incomplete | 15 | phiMMP02_NC_019421(7) | 28.8 |
| 4 | 22 kb | Incomplete | 10 | c_st__NC_007581(6) | 26.6 |
| 5 | 55.9 kb | Intact | 59 | phiCD6356_NC_015262(40) | 28.4 |
| 6 | 21.2 kb | Incomplete | 10 | phiCD505_NC_028764(2) | 28.8 |
| 7 | 18.1 kb | Incomplete | 24 | phiMMP02_NC_019421(14) | 28.5 |
| 8 | 166.8 kb | Intact | 231 | phiCDHM19_NC_028996(46) | 30 |
aThe phage (phage name_GenBank accession no.) with the highest number of orf most similar to those in the region being indicated in parentheses.
Unique genes of Clade 3 and isolates 103, 106 and 133.
| Locus | Producta | Sizeb | Location | Query Coverage |
|---|---|---|---|---|
| Clade 3c | ||||
| 103_01388 | Transcriptional regulator | 184 | Contig 5c | 100% |
| 103_01956 | Methylthioadenosine/S-adenosylhomocysteine nucleosidase | 181 | Contig 10c | 97% |
| 103_02913d | Hypothetical protein | 133 | Contig 19c | |
| 103_02915d | Putative flavodoxin | 222 | Contig 19c | 100% |
| 103_02916d | Putative oxidoreductase | 286 | Contig 19c | 100% |
| 103_02917d | MerR Transcriptional regulator | 135 | Contig 19c | 100% |
| 103_03125 | Hypothetical protein | 255 | Contig 22c | |
| 103_03128 | Hypothetical protein | 200 | Contig 22c | |
| 103_03130 | Hypothetical protein | 198 | Contig 22c | |
| 103_03132 | Hypothetical protein | 233 | Contig 22c | |
| 103_03133 | Hypothetical protein | 214 | Contig 22c | |
| 103_03507 | Hypothetical protein | 372 | Contig 32c | |
| 103_03634 | Hypothetical protein | 110 | Contig 34c | |
| 103_03635 | MBL fold metallo-hydrolase | 253 | Contig 34c | 100% |
| 103_03735 | Hypothetical protein | 282 | Contig 63 | |
| Isolate 103 | ||||
| 103_01535 | Cobalamin-binding protein | 220 | Contig 7 | 95% |
| 103_01536 | Uroporphyrinogen decarboxylase | 288 | Contig 7 | 75% |
| 103_01537 | Iron-sulfur protein | 574 | Contig 7 | |
| 103_01538 | Methylcobamide–CoM methyltransferase | 344 | Contig 7 | 97% |
| 103_01539 | Uroporphyrinogen-III decarboxylase | 362 | Contig 7 | 98% |
| 103_01540 | Membrane transport protein | 303 | Contig 7 | |
| 103_01541 | Metal ABC transporter substrate-binding protein | 273 | Contig 7 | 99% |
| 103_01542 | Methionine ABC transporter permease | 222 | Contig 7 | 100% |
| 103_01543 | ABC transporter ATP-binding protein | 335 | Contig 7 | 97% |
| 103_01544 | IS | 318 | Contig 7 | 99% |
| Isolate 106 | ||||
| 106_00822 | Multidrug-efflux transporter 1 regulator | 275 | Contig 4 | |
| 106_00823 | Cytidylate kinase | 204 | Contig 4 | 100% |
| 106_00824 | MATE family efflux transporter | 442 | Contig 4 | 100% |
| 106_00827 | RNA polymerase sigma factor FliA | 138 | Contig 4 | 100% |
| 106_01206 | Antirestriction protein ArdA | 167 | Contig 6 | 98% |
| 106_01208 | Hypothetical protein | 188 | Contig 6 | |
| 106_01209 | Nucleotidyltransferase, polymerase | 294 | Contig 6 | 98% |
| 106_01213 | Hypothetical protein | 234 | Contig 6 | |
| 106_02320 | Endonuclease | 367 | Contig 14 | 99% |
| 106_03618 | Ribonuclease/integrase | 407 | Contig 35 | |
| 106_03620 | Transposase | 252 | Contig 35 | 94% |
| 106_03734 | Membrane protein | 716 | Contig 48 | 98% |
| Isolate 133 | ||||
| 133_02100 | GIY-YIG catalytic domain-containing protein | 401 | Contig 11 | |
| 133_02107 | Hypothetical protein | 120 | Contig 11 | |
| 133_02109 | Hypothetical protein | 318 | Contig 11 | |
| 133_02112 | Hypothetical protein | 102 | Contig 11 | |
| 133_02117 | HaeIII-like C-5 cytosine methyltransferase | 340 | Contig 11 | 97% |
| 133_02118 | BspRI-like C-5 cytosine methyltransferase | 350 | Contig 11 | 96% |
| 133_03129 | Flp pilus assembly protein CpaB | 261 | Contig 24 | 91% |
| 133_03131 | ATPase involved in chromosome partitioning/Flp pilus assembly protein, ATPase CpaE | 261 | Contig 24 | 99% |
| 133_03131 | Type II secretion system protein E/Flp pilus assembly protein, ATPase CpaF | 256 | Contig 24 | |
| 133_03133 | Hypothetical protein | 287 | Contig 24 | |
| 133_03134 | Hypothetical protein | 138 | Contig 24 | |
| 133_03137 | Hypothetical protein | 176 | Contig 24 | |
| 133_03174 | Hypothetical protein | 177 | Contig 24 | |
| 133_03175 | Transcriptional regulator | 116 | Contig 24 | 95% |
| 133_03177 | radC-like JAB domain protein | 127 | Contig 24 | |
| 133_03181 | Chromosome/plasmid-partitioning protein ParA | 276 | Contig 24 | 97% |
| 133_03182 | Chromosome/plasmid-partitioning protein ParB | 450 | Contig 24 | |
| 133_03183 | Transcriptional regulator | 194 | Contig 24 | |
| 133_03185 | Adenosine monophosphate/protein transferase VbhT | 112 | Contig 24 | 100% |
aAnnotation was curated using Protein BLAST (https://blast.ncbi.nlm.nih.gov) against non-redundant protein sequences in GenBank. bSize, the number of amino acids. cLoci and locations refer to those of isolate 103. dThese genes are accessory genes of Tn6218.