| Literature DB >> 35705638 |
Amogelang R Raphenya1,2,3, James Robertson4, Casper Jamin5, Leonardo de Oliveira Martins6, Finlay Maguire7,8,9, Andrew G McArthur1,2,3, John P Hays10.
Abstract
Whole genome sequencing (WGS) is a key tool in identifying and characterising disease-associated bacteria across clinical, agricultural, and environmental contexts. One increasingly common use of genomic and metagenomic sequencing is in identifying the type and range of antimicrobial resistance (AMR) genes present in bacterial isolates in order to make predictions regarding their AMR phenotype. However, there are a large number of alternative bioinformatics software and pipelines available, which can lead to dissimilar results. It is, therefore, vital that researchers carefully evaluate their genomic and metagenomic AMR analysis methods using a common dataset. To this end, as part of the Microbial Bioinformatics Hackathon and Workshop 2021, a 'gold standard' reference genomic and simulated metagenomic dataset was generated containing raw sequence reads mapped against their corresponding reference genome from a range of 174 potentially pathogenic bacteria. These datasets and their accompanying metadata are freely available for use in benchmarking studies of bacteria and their antimicrobial resistance genes and will help improve tool development for the identification of AMR genes in complex samples.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35705638 PMCID: PMC9200708 DOI: 10.1038/s41597-022-01463-7
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Diagram illustrating the sequence of steps and software involved in generating ‘gold standard’ bacterial whole genome sequence datasets for benchmarking bacterial assembly and prediction software.
Fig. 2Radar plot showing 94 samples analyzed using hAMRonization workflow. There are 579 genes comparing presence or absence for all the 5 tools tested.
Taxonomic composition of the benchmarking dataset.
| Organism | Sample Count |
|---|---|
| 5 | |
| 1 | |
| 4 | |
| 2 | |
| 1 | |
| 1 | |
| 3 | |
| 10 | |
| 2 | |
| 2 | |
| 1 | |
| 18 | |
| 3 | |
| 4 | |
| 56 | |
| 1 | |
| 1 | |
| 6 | |
| 22 | |
| 30 | |
| 1 |
| Measurement(s) | bacterial genomes |
| Technology Type(s) | next generation DNA sequencing |
| Factor Type(s) | None |
| Sample Characteristic - Organism | Bacterium |
| Sample Characteristic - Environment | Varying |
| Sample Characteristic - Location | World |