| Literature DB >> 24482682 |
Hedwig Kurka1, Armin Ehrenreich1, Wolfgang Ludwig1, Marc Monot2, Maja Rupnik3, Frederic Barbut4, Alexander Indra5, Bruno Dupuy2, Wolfgang Liebl1.
Abstract
PCR-ribotyping is a broadly used method for the classification of isolates of Clostridium difficile, an emerging intestinal pathogen, causing infections with increased disease severity and incidence in several European and North American countries. We have now carried out clustering analysis with selected genes of numerous C. difficile strains as well as gene content comparisons of their genomes in order to broaden our view of the relatedness of strains assigned to different ribotypes. We analyzed the genomic content of 48 C. difficile strains representing 21 different ribotypes. The calculation of distance matrix-based dendrograms using the neighbor joining method for 14 conserved genes (standard phylogenetic marker genes) from the genomes of the C. difficile strains demonstrated that the genes from strains with the same ribotype generally clustered together. Further, certain ribotypes always clustered together and formed ribotype groups, i.e. ribotypes 078, 033 and 126, as well as ribotypes 002 and 017, indicating their relatedness. Comparisons of the gene contents of the genomes of ribotypes that clustered according to the conserved gene analysis revealed that the number of common genes of the ribotypes belonging to each of these three ribotype groups were very similar for the 078/033/126 group (at most 69 specific genes between the different strains with the same ribotype) but less similar for the 002/017 group (86 genes difference). It appears that the ribotype is indicative not only of a specific pattern of the amplified 16S-23S rRNA intergenic spacer but also reflects specific differences in the nucleotide sequences of the conserved genes studied here. It can be anticipated that the sequence deviations of more genes of C. difficile strains are correlated with their PCR-ribotype. In conclusion, the results of this study corroborate and extend the concept of clonal C. difficile lineages, which correlate with ribotypes affiliation.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24482682 PMCID: PMC3902958 DOI: 10.1371/journal.pone.0086535
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Listing of accesion number, host and location of isolation of the 27 C. difficile strains sequenced for this study.
| Strain | Accession number | Host | Location |
| E1 | CAMD00000000 (WGS_scaffold) CAMD01000001-CAMD01000212 (contigs) | Human | Austria |
| E10 | CAME00000000 (WGS_scaffold) CAME01000001-CAME01000293 (contigs) | Horse | Slovenia |
| E12 | CAMZ00000000 (WGS_scaffold) CAMZ01000001-CAMZ01000353 (contigs) | Human | UK |
| E13 | CAMF00000000 (WGS_scaffold) CAMF01000001-CAMF01000262 (contigs) | Human | Ireland |
| E14 | CAMS00000000 (WGS_scaffold) CAMS01000001-CAMS01000319 (contigs) | Human | Hungary |
| E15 | CAMM00000000 (WGS_scaffold) CAMM01000001-CAMM01000444 (contigs) | Human | France |
| E16 | CAMH00000000 (WGS_scaffold) CAMH01000001-CAMH01000111 (contigs) | Human | France |
| E19 | CAMO00000000 (WGS_scaffold) CAMO01000001-CAMO01000325 (contigs) | Human | UK |
| E23 | CAMY00000000 (WGS_scaffold) CAMY01000001-CAMY01000264 (contigs) | Human | France |
| E24 | CAMP00000000 (WGS_scaffold) CAMP01000001-CAMP01000063 (contigs) | Human | France |
| E25 | CAMJ00000000 (WGS_scaffold) CAMJ01000001-CAMJ01000441 (contigs) | Human | France |
| E28 | CAMX00000000 (WGS_scaffold) CAMX01000001-CAMX01000274 (contigs) | Human | France |
| E7 | CAMV00000000 (WGS_scaffold) CAMV01000001-CAMV01000409 (contigs) | Human | Austria |
| E9 | CAMU00000000 (WGS_scaffold) CAMU01000001-CAMU01000373 (contigs) | Horse | Canada |
| T10 | CANB00000000 (WGS_scaffold) CANB01000001-CANB01000460 (contigs) | Human | Germany |
| T11 | CAML00000000 (WGS_scaffold) CAML01000001-CAML01000409 (contigs) | Human | France |
| T14 | CANC00000000 (WGS_scaffold) CANC01000001-CANC01000491(contigs) | Human | Ireland |
| T15 | CAMK00000000 (WGS_scaffold) CAMK01000001-CAMK01000631 (contigs) | Human | Belgium |
| T17 | CAMT00000000 (WGS_scaffold) CAMT01000001-CAMT01000449 (contigs) | Human | Hungary |
| T19 | CANA00000000 (WGS_scaffold) CANA01000001-CANA01000247 (contigs) | Human | France |
| T20 | CAMC00000000 (WGS_scaffold) CAMC01000001-CAMC01000210 (contigs) | Human | Ireland |
| T22 | CAMI00000000 (WGS_scaffold) CAMI01000001-CAMI01000332 (contigs) | Human | Hungary |
| T23 | CAMN00000000 (WGS_scaffold) CAMN01000001-CAMN01000293 (contigs) | Human | Italy |
| T42 | CAMQ00000000 (WGS_scaffold) CAMQ01000001-CAMQ01000096 (contigs) | Human | Italy |
| T3 | CAMW00000000 (WGS_scaffold) CAMW01000001-CAMW01000275 (contigs) | Human | Italy |
| T5 | CAMB00000000 (WGS_scaffold) CAMB01000001-CAMB01000227 (contigs) | Human | Italy |
| T6 | CAMR00000000 (WGS_scaffold) CAMR01000001-CAMR01000373 (contigs) | Human | Hungary |
Ribotype, toxinotype and accession number of 21 C. difficile strains with previously reported genome sequences used in this study.
| Strain | Accession number | PCR-ribotype | Toxinotype |
| BI1* | FN668941.1 (chromosome) FN668943.1 (chromosome) FN668942.1 (plasmid) | 027 | III |
| BI9* | FN668944 (chromosome) | 001 | 0 |
| 2007855* | FN665654.1 (chromosome) | 027 | III |
| 630 | AM180355.1 (chromosome) AM180356.1 (plasmid) | 012 | 0 |
| CD196 | FN538970.1 | 027 | III |
| CF5* | FN665652.1 | 017 | VIII |
| M120* | FN665653.1 | 078 | V |
| M68* | FN668375.1 | 017 | VIII |
| R20291 | FN545816.1 | 027 | III |
| ATCC 43255 | CM000604.1 | 087 | 0 |
| CIP-107932 | CM000659 (WGS_scaffold) ABKK02000001-ABKK02000055 (contigs) | 027 | III |
| QCD-23m63 | CM000660 (WGS_scaffold) ABKL02000001-ABKL02000061 (contigs) | 078 | V |
| QCD-32g58 | CM000604 (WGS_scaffold) NZ_AAML04000001-NZ_AAML04000016 (contigs) | 027 | III |
| QCD-37x79 | CM000658 (WGS_scaffold) NZ_ABHG02000001.1 - NZ_ABHG02000031.1 (contigs) | 027 | III |
| QCD-63q42 | CM000637 (WGS_scaffold) ABHD02000001-ABHD02000060 (contigs) | 001 | 0 |
| QCD-66c26 | CM000441 (WGS_scaffold) ABFD02000001-ABFD02000031 (contigs) | 027 | III |
| QCD-76w55 | CM000661 (WGS-scaffold) ABHE02000001-ABHE02000066 (contigs) | 027 | III |
| QCD-97b34 | CM000657 (WGS_scaffold) ABHF02000001-ABHF02000060 (contigs) | 027 | III |
| NAP07 | GG770744-GG770776 (WGS_scaffold) ADVM01000001-ADVM01000100 (contigs) | 078 | V |
| NAP08 | GG770710-GG770733 (WGS_scaffold) ADNX01000001-ADNX01000111 (contigs) | 078 | V |
| CD002 | CAMG00000000 (WGS_scaffold) CAMG01000001-CAMG01000071 (contigs) | 002 | 0 |
Strains marked with an asterisk are not annotated. The ribotypes of these strains were calculated using their GenBank data.
Phenotypic description of the 27 C. difficile strains sequenced for this study.
| Strain | PCR-ribotype (agarose) | PCR-ribotype (WEBRIBO) | Toxinotype | Fluoroquinolone resistance |
| E1 | 126 | 126 | V | S |
| E10 | 033 | 033 | XIa | S |
| E12 | 106 | 106 | 0 | R |
| E13 | 017 | 017 | VIII | R |
| E14 | 014 | 014/0 | 0 | S |
| E15 | 075 | 075 | III | S |
| E16 | 001* | 577 | 0 | S |
| E19 | 036 | 578 | X | S |
| E23 | 001/072 | 241 | 0 | S |
| E24 | 020 | 020 | 0 | S |
| E25 | 005 | 005 | 0 | S |
| E28 | 012 | 012 | 0 | S |
| E7 | 053 | 053 | 0 | R |
| E9 | 009 | 009 | NA | S |
| T10 | 001/072 | 001 | 0 | R |
| T11 | 075 | 075 | III | S |
| T14 | 106 | 106 | 0 | R |
| T15 | 005 | 005 | 0 | I |
| T17 | 025 | 665 | 0 | R |
| T19 | 057 | 237 | 0 | I |
| T20 | 078 | 078 | V | S |
| T22 | No data | No data | 0 | ND |
| T23 | 019 | 019 | IX | S |
| T42 | No data | No data | No data | ND |
| T3 | 012 | 012 | 0 | I |
| T5 | 126 | 126 | V | R |
| T6 | 014 | 014/0 | 0 | I |
The strain names, PCR-ribotypes, toxinotypes, and fluoroquinolone resistance of 27 C. difficile strains used and sequenced for this study are listed. The PCR-ribotype was determined agarose-based [6] and WEBRIBO-based [35]. PCR-ribotype 001* is similar but not identical to PCR ribotype 001/072. The toxinotype of strain E9 could not be determined because it has no ToxinA and ToxinB genes, but genes for the binary toxins.
Characteristics of the 14 conserved genes used for the computation of the distance matrices.
| Conserved gene | Locus tag | Gene ID | Length | Maximal Hamming distance |
| ATPase Alpha (atpA) | CD630_34700 | 4914804 | 1503 | 1.33 |
| ATPase Alpha V-Type (atpA) | CD630_29560 | 4913755 | 1779 | 1.35 |
| ATPase Beta (atpD) | CD630_34680 | 4914802 | 1395 | 1.15 |
| ATPase Beta V-Type (atpB) | CD630_29550 | 4913754 | 1374 | 1.97 |
| RNA polymerase A (rpoA) | CD630_00980 | 4913146 | 948 | 1.59 |
| RNA polymerase B (rpoB) | CD630_00660 | 4914216 | 3717 | 2.21 |
| RNA polymerase C (rpoC) | CD630_00670 | 4914217 | 3486 | 2.84 |
| Gyrase A (gyrA) | CD630_00060 | 4915790 | 2427 | 2.02 |
| Gyrase B (gyrB) | CD630_00050 | 4915789 | 1902 | 2.79 |
| Initiation factor | CD630_13090 | 4914468 | 1941 | 1.2 |
| Recombinase (recap) | CD630_13280 | 4914615 | 1047 | 2.29 |
| Heatshock protein 70 (dnaK) | CD630_24610 | 4916451 | 1848 | 1.25 |
| Heatshock protein 60 (groL) | CD630_01940 | 4915463 | 1629 | 2.71 |
| Elongation factor G (fusA) | CD630_00700 | 4914220 | 2067 | 1.21 |
The locus tag, length values and gene IDs listed in the table were taken from the genome data of C. difficile strain CD630. The maximal hamming distance is a measure for the dissimilarity of the 14 conserved genes in all analysed 48 C. difficile strains.
Figure 1DNA gyrase A gene-based dendrogram.
Neighbor joining dendrogram reflecting the similarity for 48 C. difficile strains based on the gene for DNA gyrase A. The distance matrix was computed using the Hamming distance with the DNA gyrase A genes from the 48 C. difficile isolates containing this gene. The strains always cluster together according to their PCR-ribotype. The PCR-ribotype is indicated in brackets. The strains with ribotype 027 sub-cluster into two different groups.
Figure 2Toxin B gene-based dendrogram.
Neighbor joining dendrogram reflecting the similarity for 46 C. difficile strains based on the gene for Toxin B. The distance matrix was computed using the Hamming distance with the Toxin B genes from the 46 C. difficile isolates containing this gene. The strains always cluster together according to their PCR-ribotype. The PCR-ribotype is indicated in brackets.
Figure 3Virulence associated protein e gene-based dendrogram.
Neighbor joining dendrogram reflecting the similarity for 19 C. difficile strains based on the gene for virulence associated protein E. The distance matrix was computed using the Hamming distance with the virulence associated protein E genes from the 19 C. difficile isolates containing this gene. The strains always cluster together according to their PCR-ribotype. The PCR-ribotype is indicated in brackets. The strains with ribotype 027 sub-cluster into two different groups.