| Literature DB >> 31447800 |
Baiba Vilne1,2, Irēna Meistere1, Lelde Grantiņa-Ieviņa1, Juris Ķibilds1.
Abstract
Foodborne diseases (FBDs) are infections of the gastrointestinal tract caused by foodborne pathogens (FBPs) such as bacteria [Salmonella, Listeria monocytogenes and Shiga toxin-producing E. coli (STEC)] and several viruses, but also parasites and some fungi. Artificial intelligence (AI) and its sub-discipline machine learning (ML) are re-emerging and gaining an ever increasing popularity in the scientific community and industry, and could lead to actionable knowledge in diverse ranges of sectors including epidemiological investigations of FBD outbreaks and antimicrobial resistance (AMR). As genotyping using whole-genome sequencing (WGS) is becoming more accessible and affordable, it is increasingly used as a routine tool for the detection of pathogens, and has the potential to differentiate between outbreak strains that are closely related, identify virulence/resistance genes and provide improved understanding of transmission events within hours to days. In most cases, the computational pipeline of WGS data analysis can be divided into four (though, not necessarily consecutive) major steps: de novo genome assembly, genome characterization, comparative genomics, and inference of phylogeny or phylogenomics. In each step, ML could be used to increase the speed and potentially the accuracy (provided increasing amounts of high-quality input data) of identification of the source of ongoing outbreaks, leading to more efficient treatment and prevention of additional cases. In this review, we explore whether ML or any other form of AI algorithms have already been proposed for the respective tasks and compare those with mechanistic model-based approaches.Entities:
Keywords: bacterial WGS; bioinformatics analysis pipeline; food-borne disease; machine learning; outbreaks
Year: 2019 PMID: 31447800 PMCID: PMC6691741 DOI: 10.3389/fmicb.2019.01722
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1An overview of an example bacterial sequence data analysis workflow.
An non-exhaustive list of the mechanistic model-based vs. ML tools for microbial genome analysis.
| Velvet (Zerbino and Birney, | PERGA (Zhu et al., | |
| 1. Bacterial strain identification | BLASTN (McGinnis and Madden, | PaPrBaG (Deneke et al., |
| 2. Bacterial genome annotation | PROKKA (Seemann, | Woods (Sharma et al., |
| 3. Virulence gene detection | VirulenceFinder (Joensen et al., | BacFier (Iraola et al., |
| 4. Antimicrobial resistance gene detection | ResFinder (Zankari et al., | DeepARG (Arango-Argoty et al., |
| 1. Reference-based SNP methods | CSI Phylogeny (Kaas et al., | |
| 2. Non-reference-based SNP analysis | KSNP (Gardner et al., | |
| 3. Pangenome-based analysis | Roary (Page et al., | |
| 4. Core genome/whole-genome multi-locus sequence typing (MLST) | EnteroBase (Alikhan et al., | BAPS/hierBAPS (Cheng et al., |
| RAxML (Stamatakis et al., | ||
The tool is dedicated to shotgun metagenomics;
the tool dedicated to 16S metataxonomics.