| Literature DB >> 32117101 |
Zhichang Liu1,2,3,4, Dun Deng1,2,3,4, Huijie Lu1,2,3,4, Jian Sun5, Luchao Lv5, Shuhong Li1,2,3,4, Guanghui Peng1,2,3,4, Xianyong Ma1,2,3,4, Jiazhou Li1,2,3,4, Zhenming Li1,2,3,4, Ting Rong1,2,3,4, Gang Wang1,2,3,4.
Abstract
Antimicrobial resistance (AMR) is becoming a huge problem in countries all over the world, and new approaches to identifying strains resistant or susceptible to certain antibiotics are essential in fighting against antibiotic-resistant pathogens. Genotype-based machine learning methods showed great promise as a diagnostic tool, due to the increasing availability of genomic datasets and AST phenotypes. In this article, Support Vector Machine (SVM) and Set Covering Machine (SCM) models were used to learn and predict the resistance of the five drugs (Tetracycline, Ampicillin, Sulfisoxazole, Trimethoprim, and Enrofloxacin). The SVM model used the number of co-occurring k-mers between the genome of the isolates and the reference genes to learn and predict the phenotypes of the bacteria to a specific antimicrobial, while the SCM model uses a greedy approach to construct conjunction or disjunction of Boolean functions to find the most concise set of k-mers that allows for accurate prediction of the phenotype. Five-fold cross-validation was performed on the training set of the SVM and SCM model to select the best hyperparameter values to avoid model overfitting. The training accuracy (mean cross-validation score) and the testing accuracy of SVM and SCM models of five drugs were above 90% regardless of the resistant mechanism of which were acquired resistant or point mutation in the chromosome. The results of correlation between the phenotype and the model predictions of the five drugs indicated that both SVM and SCM models could significantly classify the resistant isolates from the sensitive isolates of the bacteria (p < 0.01), and would be used as potential tools in antimicrobial resistance surveillance and clinical diagnosis in veterinary medicine.Entities:
Keywords: Actinobacillus pleuropneumoniae; Set Covering Machine; Support Vector Machine; antimicrobial resistance; genomics; machine learning
Year: 2020 PMID: 32117101 PMCID: PMC7016212 DOI: 10.3389/fmicb.2020.00048
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1The phenotype of 96 isolates. (A) Bar plot of phenotype availability for the different drugs. (B) Venn diagram quantifying the number of instances of co-occurrence of resistance between drugs.
FIGURE 2Number of k-mers of resistant genes in the whole genome of 96 isolates of A. pleuropneumoniae. (A) Isolates with k-mers of tetB and tetH genes, (B) Isolates with k-mers of blaROB-1 gene, (C) Isolates with k-mer of sul2 gene, (D) Isolates with k-mers of dfrA14 and dfrA30 genes, (E) Isolates with k-mers in gene of gyrA QRDR without point mutation, (F) Isolates with k-mers in genes of parC and parE QRDR without point mutation.
FIGURE 3Bars with red color show the mean accuracy for the tuned model with five-fold cross-validation on the training dataset. Bars with blue color are the accuracy of the tuned model on the test dataset. The error bars are standard deviations. (A) SVM model, (B) SCM model.
Prediction metrics on test datasets using the best performing SVM and SCM models.
| Tetracycline | 0.95 ± 0.05 | 1.00 ± 0.00 | 0.97 ± 0.03 | 1.00 ± 0.00 | 0.86 ± 0.08 | 1.00 ± 0.00 | 0.92 ± 0.04 | 1.00 ± 0.00 |
| Ampicillin | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 |
| Sulfisoxazole | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 |
| Trimethoprim | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 |
| Enrofloxacin | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 | 1.00 ± 0.00 |
Correlation of phenotype and model predictions of SVM and SCM models.
| Tetracycline | (58, 38) | (55, 38) | (50, 38) | ||
| Ampicillin | (19, 77) | (19, 77) | (19, 77) | ||
| Sulfisoxazole | (46, 50) | (46, 50) | (46, 50) | ||
| Trimethoprim | (16, 80) | (16, 80) | (16, 80) | ||
| Enrofloxacin | (6, 90) | (6, 90) | (6, 90) | ||