| Literature DB >> 32048983 |
Ronan M Doyle1,2, Denise M O'Sullivan3, Sean D Aller4, Sebastian Bruchmann5, Taane Clark6, Andreu Coello Pelegrin7,8, Martin Cormican9, Ernest Diez Benavente6, Matthew J Ellington10, Elaine McGrath11, Yair Motro12, Thi Phuong Thuy Nguyen13, Jody Phelan6, Liam P Shaw14, Richard A Stabler15, Alex van Belkum8, Lucy van Dorp16, Neil Woodford10, Jacob Moran-Gilad12, Jim F Huggett17,3, Kathryn A Harris2.
Abstract
Antimicrobial resistance (AMR) poses a threat to public health. Clinical microbiology laboratories typically rely on culturing bacteria for antimicrobial-susceptibility testing (AST). As the implementation costs and technical barriers fall, whole-genome sequencing (WGS) has emerged as a 'one-stop' test for epidemiological and predictive AST results. Few published comparisons exist for the myriad analytical pipelines used for predicting AMR. To address this, we performed an inter-laboratory study providing sets of participating researchers with identical short-read WGS data from clinical isolates, allowing us to assess the reproducibility of the bioinformatic prediction of AMR between participants, and identify problem cases and factors that lead to discordant results. We produced ten WGS datasets of varying quality from cultured carbapenem-resistant organisms obtained from clinical samples sequenced on either an Illumina NextSeq or HiSeq instrument. Nine participating teams ('participants') were provided these sequence data without any other contextual information. Each participant used their choice of pipeline to determine the species, the presence of resistance-associated genes, and to predict susceptibility or resistance to amikacin, gentamicin, ciprofloxacin and cefotaxime. We found participants predicted different numbers of AMR-associated genes and different gene variants from the same clinical samples. The quality of the sequence data, choice of bioinformatic pipeline and interpretation of the results all contributed to discordance between participants. Although much of the inaccurate gene variant annotation did not affect genotypic resistance predictions, we observed low specificity when compared to phenotypic AST results, but this improved in samples with higher read depths. Had the results been used to predict AST and guide treatment, a different antibiotic would have been recommended for each isolate by at least one participant. These challenges, at the final analytical stage of using WGS to predict AMR, suggest the need for refinements when using this technology in clinical settings. Comprehensive public resistance sequence databases, full recommendations on sequence data quality and standardization in the comparisons between genotype and resistance phenotypes will all play a fundamental role in the successful implementation of AST prediction using WGS in clinical microbiology laboratories.Entities:
Keywords: antimicrobial resistance; antimicrobial-susceptibility testing; bioinformatics; carbapenem resistance; whole-genome sequencing
Mesh:
Substances:
Year: 2020 PMID: 32048983 PMCID: PMC7067211 DOI: 10.1099/mgen.0.000335
Source DB: PubMed Journal: Microb Genom ISSN: 2057-5858
Inter-laboratory study sample characteristics
|
Study ID |
Isolate species |
Sequencing method |
Carbapenemase gene |
Median depth of coverage |
Comment |
|---|---|---|---|---|---|
|
A-1 |
|
NEBNext Ultra II+NextSeq 150 bp PE |
OXA-48-like |
190.2× |
Exact duplicate of A-2 |
|
A-2 |
|
NEBNext Ultra II+NextSeq 150 bp PE |
OXA-48-like |
190.2× |
Exact duplicate of A-1 |
|
B-1 |
|
NEBNext Ultra II+NextSeq 150 bp PE |
OXA-48-like |
1.4× |
Very low coverage duplicate of B-2 |
|
B-2 |
|
NEBNext Ultra II+NextSeq 150 bp PE |
OXA-48-like |
142.9× |
High coverage duplicate of B-1 |
|
C-1 |
|
Nextera DNA +HiSeq 100 bp PE |
OXA-48-like |
37.4× |
Same original isolate as C-2 |
|
C-2 |
|
NEBNext Ultra II+NextSeq 150 bp PE |
OXA-48-like |
156.4× |
Same original isolate as C-1 |
|
D |
|
NEBNext Ultra II+NextSeq 150 bp PE |
NDM |
83.5× |
|
|
E |
|
Nextera DNA +HiSeq 100 bp PE |
IMP |
20.6× |
|
|
F |
|
NEBNext Ultra II+NextSeq 150 bp PE |
VIM |
32.5× |
|
|
G |
|
NEBNext Ultra II+NextSeq 150 bp PE |
OXA-23-like and OXA-51-like |
22.2× |
|
PE, Paired end.
Summary of bioinformatic tools used for species identification and detecting AMR by each participant
|
Method step |
Lab_1a* |
Lab_1b* |
Lab_2 |
Lab_3 |
Lab_4 |
Lab_5 |
Lab_6 |
Lab_7 |
Lab_8 |
Lab_9 |
Reference |
|---|---|---|---|---|---|---|---|---|---|---|---|
|
Species ID |
Kraken-HLL |
Kraken-HLL |
|
mash and wgsa |
Kraken |
KmerFinder (assembled contigs) |
KmerFinder (raw reads) |
Centrifuge |
Kraken |
Kmerid |
[ |
|
Read assembly |
Shovill (SPAdes) |
Shovill (SPAdes) |
SPAdes |
Unicycler (SPAdes) |
No assembly |
A5-MiSeq |
Bionumerics |
No assembly |
Unicycler (SPAdes) |
No assembly |
[ |
|
AMR identifier |
rgi |
c-SSTAR |
ABRicate |
rgi and Resfinder |
ariba |
rgi |
Bionumerics |
srst2 |
ABRicate |
Genefinder |
[ |
|
Reference database |
card |
Resfinder and arg-annot |
card |
card and Resfinder |
card and arg-annot |
card |
Resfinder |
arg-annot |
Resfinder |
card and Resfinder (manually curated) |
[ |
|
Sequence identity cut-off |
80% |
95% |
75% |
80 % (card) and 90 % (Resfinder) |
90% |
80% |
90% |
90% |
75% |
90% |
|
|
Breadth of coverage cut-off |
0% |
0% |
0% |
0 % (card) and 80 % (Resfinder) |
20% |
0% |
60% |
90% |
0% |
100% |
|
*Lab_1 provided two sets of results with two separate methods for AMR detection; these are referred to as Lab_1a and Lab_1b.
Species identification for each sample by each participant
|
Participant |
A-1 |
A-2 |
B-1 |
B-2 |
C-1 |
C-2 |
D |
E |
F |
G |
|---|---|---|---|---|---|---|---|---|---|---|
|
Reference |
KP |
KP |
ECl |
ECl |
KO |
KO |
KP |
EC |
CF |
AB |
|
Lab_1 |
KP |
KP |
ECl |
ECl |
KO |
KO |
KP |
EC |
CF |
AB |
|
Lab_2 |
KP |
KP |
– |
ECl |
KO |
KO |
KP |
EC |
CF |
AB |
|
Lab_3 |
KP |
KP |
|
ECl |
KO |
KO |
KP |
EC |
|
AB |
|
Lab_4 |
KP |
KP |
ECl |
ECl |
KO |
KO |
KP |
EC |
|
AB |
|
Lab_5 |
KP |
KP |
ECl |
|
KO |
KO |
|
EC |
CF |
AB |
|
Lab_6 |
KP |
KP |
ECl |
ECl |
– |
KO |
|
EC |
CF |
AB |
|
Lab_7 |
KP |
KP |
ECl |
ECl |
KO |
KO |
KP |
EC |
CF |
AB |
|
Lab_8 |
KP |
KP |
ECl |
ECl |
KO |
KO |
KP |
EC |
CF |
AB |
|
Lab_9 |
KP |
KP |
ECl |
ECl |
KO |
KO |
KP |
EC |
CF |
AB |
Missing data represent no results reported. Results highlighted in bold represent discrepancies.
AB, ; CF, ; EC, ; ECl, ; KO, ; KP, .
Fig. 1.Number of AMR-associated genes identified in each sample by each team of participants.
Carbapenemase genes identified for each sample by each participant and the reference laboratory PCR (Ref PCR)
|
Participant |
A-1 |
A-2 |
B-1* |
B-2 |
C-1 |
C-2 |
D |
E |
F |
G |
|---|---|---|---|---|---|---|---|---|---|---|
|
Ref PCR† |
OXA-48-like |
OXA-48-like |
OXA-48-like |
OXA-48-like |
OXA-48-like |
OXA-48-like |
NDM |
IMP |
VIM |
OXA-23-like+OXA-51-like |
|
Lab_1a‡ |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
IMP-1 |
VIM-4 |
OXA-23+OXA-66 |
|
Lab_1b‡ |
OXA-48 |
OXA-48 |
OXA-48 |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
|
VIM-4 |
OXA-23+OXA-66 |
|
Lab_2 |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
IMP-1 |
VIM-4 |
OXA-23+OXA-66 |
|
Lab_3 |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
IMP-1 |
VIM-4 |
OXA-23+OXA-66 |
|
Lab_4 |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
|
VIM-4 |
OXA-23+OXA-66 |
|
Lab_5 |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
IMP-1 |
VIM-4 |
|
|
Lab_6 |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
|
VIM-4 |
OXA-23+OXA-66 |
|
Lab_7 |
OXA-48 |
OXA-48 |
OXA-48 |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
|
VIM-4 |
OXA-23+OXA-66 |
|
Lab_8 |
OXA-48 |
OXA-48 |
– |
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
|
VIM-4 |
OXA-23+OXA-66 |
|
Lab_9 |
OXA-48 |
OXA-48 |
|
OXA-48 |
OXA-181 |
OXA-181 |
NDM-1 |
IMP-1 |
VIM-4 |
OXA-23+OXA-66 |
*Missing data represent no results reported. Results highlighted in bold represent discrepancies.
†Specific carbapenemase PCR results for each sample.
‡Lab_1 provided different results using two separate methods; these are referred to as Lab_1a and Lab_1b.
Fig. 2.Presence of AMR-associated genes in each sample by each team of participants. Genes are organized and coloured by the class of antibiotics their resistance is associated with. Genes are only shown here if reported by more than one participant and if they were present in more than one reference database used. MLS, Macrolide lincosamide and streptogramin.
Fig. 3.Concordance between phenotypic AST result and the genotypic prediction from WGS data. Results are presented separately for each participant, sample and antibiotic. Each tile is coloured based on whether both the resistant phenotype and genotype agreed (R/R); both phenotype and genotype predicted sensitive (S/S); major errors where the phenotype was sensitive, but the genotype was resistant (S/R); and very major errors where the phenotype was resistant, but the genotype was sensitive (R/S). Missing cells represent a result not reported.