| Literature DB >> 34279213 |
Casper Jamin1, Sien De Koster2, Stefanie van Koeveringe3, Dieter De Coninck4, Klaas Mensaert4, Katrien De Bruyne4, Natascha Perales Selva3, Christine Lammens2, Herman Goossens2,3, Christian Hoebe1,5, Paul Savelkoul1, Lieke van Alphen1.
Abstract
Whole-genome sequencing (WGS) is becoming the de facto standard for bacterial typing and outbreak surveillance of resistant bacterial pathogens. However, interoperability for WGS of bacterial outbreaks is poorly understood. We hypothesized that harmonization of WGS for outbreak surveillance is achievable through the use of identical protocols for both data generation and data analysis. A set of 30 bacterial isolates, comprising of various species belonging to the Enterobacteriaceae family and Enterococcus genera, were selected and sequenced using the same protocol on the Illumina MiSeq platform in each individual centre. All generated sequencing data were analysed by one centre using BioNumerics (6.7.3) for (i) genotyping origin of replications and antimicrobial resistance genes, (ii) core-genome multi-locus sequence typing (cgMLST) for Escherichia coli and Klebsiella pneumoniae and whole-genome multi-locus sequencing typing (wgMLST) for all species. Additionally, a split k-mer analysis was performed to determine the number of SNPs between samples. A precision of 99.0% and an accuracy of 99.2% was achieved for genotyping. Based on cgMLST, a discrepant allele was called only in 2/27 and 3/15 comparisons between two genomes, for E. coli and K. pneumoniae, respectively. Based on wgMLST, the number of discrepant alleles ranged from 0 to 7 (average 1.6). For SNPs, this ranged from 0 to 11 SNPs (average 3.4). Furthermore, we demonstrate that using different de novo assemblers to analyse the same dataset introduces up to 150 SNPs, which surpasses most thresholds for bacterial outbreaks. This shows the importance of harmonization of data-processing surveillance of bacterial outbreaks. In summary, multi-centre WGS for bacterial surveillance is achievable, but only if protocols are harmonized.Entities:
Keywords: antimicrobial resistance; bacterial typing; harmonisation; nosocomial pathogens; ring-trial; whole genome sequencing
Mesh:
Substances:
Year: 2021 PMID: 34279213 PMCID: PMC8477410 DOI: 10.1099/mgen.0.000567
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Metadata of all isolates used in this study
|
Name |
Species |
Origin |
Study |
County |
Accession no. centre 1 |
Accession no. centre 2 |
Accession no. centre 3 |
|---|---|---|---|---|---|---|---|
|
|
|
Hospital |
i-4-1-Health |
Netherlands |
ERS5219870 |
ERS5219871 |
ERS5219872 |
|
|
|
Long-term healthcare facility |
i-4-1-Health |
Netherlands |
ERS5219873 |
ERS5219874 |
ERS5219875 |
|
|
|
Long-term healthcare facility |
i-4-1-Health |
Netherlands |
ERS5219876 |
ERS5219877 |
ERS5219878 |
|
|
|
Hospital |
i-4-1-Health |
Netherlands |
ERS5219879 |
ERS5219880 |
ERS5219881 |
|
|
|
Long-term healthcare facility |
i-4-1-Health |
Netherlands |
ERS5219882 |
ERS5219883 |
ERS5219884 |
|
|
|
Hospital |
i-4-1-Health |
Netherlands |
ERS5219885 |
ERS5219886 |
ERS5219887 |
|
|
|
Hospital |
SoM |
Netherlands |
ERS5219888 |
ERS5219889 |
ERS5219890 |
|
|
|
Hospital |
SoM |
Netherlands |
ERS5219891 |
ERS5219892 |
ERS5219893 |
|
|
|
Hospital |
SoM |
Netherlands |
ERS5219828 |
ERS5219829 |
ERS5219830 |
|
|
|
Hospital |
SoM |
Netherlands |
ERS5219831 |
ERS5219832 |
ERS5219833 |
|
|
|
Hospital |
i-4-1-health |
Netherlands |
ERS5219834 |
ERS5219835 |
ERS5219836 |
|
|
|
Hospital |
i-4-1-Health |
Netherlands |
ERS5219837 |
ERS5219838 |
ERS5219839 |
|
|
|
Broiler |
i-4-1-Health |
Netherlands |
ERS5219840 |
ERS5219841 |
ERS5219842 |
|
|
|
Weaned pig |
i-4-1-Health |
Netherlands |
ERS5219843 |
ERS5219844 |
ERS5219845 |
|
|
|
Long-term healthcare facility |
i-4-1-Health |
Netherlands |
ERS5219846 |
ERS5219847 |
ERS5219848 |
|
|
|
Broiler |
i-4-1-Health |
Netherlands |
ERS5219849 |
ERS5219850 |
ERS5219851 |
|
|
|
Hospital |
i-4-1-Health |
Netherlands |
ERS5219852 |
ERS5219853 |
ERS5219854 |
|
|
|
Hospital |
Belgium |
ERS5219894 |
ERS5219895 |
ERS5219896 | |
|
|
|
Hospital |
Belgium |
ERS5219897 |
ERS5219898 |
ERS5219899 | |
|
|
|
Hospital |
Belgium |
ERS5219900 |
ERS5219901 |
ERS5219902 | |
|
|
|
Hospital |
Belgium |
ERS5219903 |
ERS5219904 |
ERS5219905 | |
|
|
|
Hospital |
i-4-1-Health |
Belgium |
ERS5219912 |
ERS5219913 |
ERS5219914 |
|
|
|
Hospital |
i-4-1-Health |
Belgium |
ERS5219915 |
ERS5219916 |
ERS5219917 |
|
|
|
Hospital |
i-4-1-Health |
Belgium |
ERS5219906 |
ERS5219907 |
ERS5219908 |
|
|
|
Hospital |
i-4-1-Health |
Netherlands |
ERS5219909 |
ERS5219910 |
ERS5219911 |
|
|
|
Hospital |
i-4-1-Health |
Belgium |
ERS5219855 |
ERS5219856 |
ERS5219857 |
|
|
|
Long-term healthcare facility |
i-4-1-Health |
Netherlands |
ERS5219858 |
ERS5219859 |
ERS5219860 |
|
|
|
Long-term healthcare facility |
i-4-1-Health |
Netherlands |
ERS5219861 |
ERS5219862 |
ERS5219863 |
|
|
|
Hospital |
SoM |
Netherlands |
ERS5219864 |
ERS5219865 |
ERS5219866 |
|
|
|
Hospital |
SoM |
Netherlands |
ERS5219867 |
ERS5219868 |
ERS5219869 |
Fig. 1.Distribution of various quality parameters pre and post de novo assembly. Subplots (a–e) show boxplots with interquartile (IQ) range. Whiskers range up to 1.5 times the IQ range. All single datapoints are represented as single dots. Subplots (f-h) show scatterplots of relations between two quality metrics.
Fig. 2.(a) Heatmap of the number of genotype calls for various origins of replication among the isolates subjected to WGS. Genotype calls per locus was summed up for each centre’s isolate if this locus was detected in their dataset. (b) Heatmap of the number of genotype calls for various AMR genes, among the isolates subjected to WGS. Genotype calls per locus was summed up for each centre’s isolate if this locus was detected in their dataset.
Fig. 3.Boxplot of the allelic distance based on wgMLST between the triplicates that were selected for WGS. Boxes show IQ range and whiskers range up to 1.5 times the IQ range. All single pairwise observations were plotted as dots.
Fig. 4.Boxplots of the inter- and intra-assembly difference in de novo assemblies based on SNPs, using SKA for the dataset. De novo assembly method compared to is indicated above each box. (a) Assembly free, (b) SKESA, (c) SPAdes and (d) Megahit. Boxes show IQ range. Whiskers range up to 1.5 times the IQ range. All single pairwise observations were plotted as dots.
Fig. 5.Boxplot of the SNP distance between the triplicates that were selected for WGS. Boxes show IQ range and whiskers range up to 1.5 times the IQ range. All single pairwise observations were plotted as dots. (a) SNP distances using the raw reads as input for SKA. (b) SNP distances based on the de novo assembly using SKESA.