| Literature DB >> 34141631 |
Helena M B Seth-Smith1,2,3, Michael Biggel4, Tim Roloff1,2,3, Vladimira Hinic1, Thomas Bodmer5, Martin Risch5, Carlo Casanova6, Andreas Widmer7, Rami Sommerstein8,9, Jonas Marschall8, Sarah Tschudin-Sutter7, Adrian Egli1,2.
Abstract
Clostridioides difficile causes nosocomial outbreaks which can lead to severe and even life-threatening colitis. Rapid molecular diagnostic tests allow the identification of toxin-producing, potentially hypervirulent strains, which is critical for patient management and infection control. PCR-ribotyping has been used for decades as the reference standard to investigate transmission in suspected outbreaks. However, the introduction of whole genome sequencing (WGS) for molecular epidemiology provides a realistic alternative to PCR-ribotyping. In this transition phase it is crucial to understand the strengths and weaknesses of the two technologies, and to assess their correlation. We aimed to investigate ribotype prediction from WGS data, and options for analysis at different levels of analytical granularity. Ribotypes cannot be directly determined from short read Illumina sequence data as the rRNA operons including the ribotype-defining ISR fragments collapse in genome assemblies, and comparison with traditional PCR-ribotyping results becomes impossible. Ribotype extraction from long read Oxford nanopore data also requires optimization. We have compared WGS-based typing with PCR-ribotyping in nearly 300 clinical and environmental isolates from Switzerland, and in addition from the Enterobase database (n=1778). Our results show that while multi-locus sequence type (MLST) often correlates with a specific ribotype, the agreement is not complete, and for some ribotypes the resolution is insufficient. Using core genome MLST (cgMLST) analysis, there is an improved resolution and ribotypes can often be predicted within clusters, using cutoffs of 30-50 allele differences. The exceptions are ribotypes within known ribotype complexes such as RT078/RT106, where the genome differences in cgMLST do not reflect the ribotype segregation. We show that different ribotype clusters display different degrees of diversity, which could be important for the definition of ribotype cluster specific cutoffs. WGS-based analysis offers the ultimate resolution to the SNP level, enabling exploration of patient-to-patient transmission. PCR-ribotyping does not sufficiently discriminate to prove nosocomial transmission with certainty. We discuss the associated challenges and opportunities in a switch to WGS from conventional ribotyping for C. difficile.Entities:
Keywords: Clostridioides difficile; cgMLST; core genome; molecular epidemiology; ribotyping; single nucleotide polymorphism; whole genome sequencing
Year: 2021 PMID: 34141631 PMCID: PMC8204696 DOI: 10.3389/fcimb.2021.681518
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
Figure 1Basel sequenced isolates (n=294): cgMLST colored by ribotype. Nodes are colored by ribotype and size corresponds to the number of isolates at that node. Distances are shown on edges, other than those ≤6, which are shown with the grey shadowing. The seven genomes with blue outlines are those that could not be PCR-ribotyped. The four genomes with red outlines were later found to have questionable ribotype assignments.
Figure 2The Basel collection and selected Enterobase genomes displayed in Ridom Seqsphere+ (n=2094). (A) Nodes colored by ribotype. (B) Nodes colored by ST. Many overlaps of ribotype and ST can be seen, also cases where multiple ribotypes are found within the same cluster (RT106/500) described by one ST (42), or where diverse ribotypes (015) are split into multiple clusters corresponding to several STs (10, 44 and 160).
Figure 3Predicting ribotype from cgMLST through comparison to genomes with known ribotype. (A) RT012 cluster, with two samples with unknown ribotype (max 7 alleles from other isolates) and closest related ribotypes. (B) RT002 cluster, with samples with unknown ribotype (max 16 alleles from other isolates) and closest related ribotype. (C) RT078/126 cluster, with samples with unknown ribotype and closest related ribotype (max 9 alleles from other isolates).