| Literature DB >> 32620856 |
Abu Sayed Chowdhury1, Douglas R Call2,3,4, Shira L Broschat2,3,4.
Abstract
With the ever-increasing availability of whole-genome sequences, machine-learning approaches can be used as an alternative to traditional alignment-based methods for identifying new antimicrobial-resistance genes. Such approaches are especially helpful when pathogens cannot be cultured in the lab. In previous work, we proposed a game-theory-based feature evaluation algorithm. When using the protein characteristics identified by this algorithm, called 'features' in machine learning, our model accurately identified antimicrobial resistance (AMR) genes in Gram-negative bacteria. Here we extend our study to Gram-positive bacteria showing that coupling game-theory-identified features with machine learning achieved classification accuracies between 87% and 90% for genes encoding resistance to the antibiotics bacitracin and vancomycin. Importantly, we present a standalone software tool that implements the game-theory algorithm and machine-learning model used in these studies.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32620856 PMCID: PMC7335159 DOI: 10.1038/s41598-020-67949-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Predicted bac AMR sequences for Staphylococcus, Streptococcus, and Listeria using the GTDWFE algorithm.
| NCBI accession number | Protein names | Note |
|---|---|---|
| AAF81096 | Putative undecaprenol kinase | True positive |
| AAO04051 | Undecaprenol kinase | True positive |
| BAE05519 | bacA | True positive |
| BAE19180 | Putative undecaprenol kinase bacitracin resistance protein | True positive |
| CAL27243 | Putative undecaprenol kinase | True positive |
| EEK11594 | Undecaprenyl-diphosphatase UppP | True positive |
| EUJ19660 | Hypothetical protein MAQA_05683 | False positive |
| WP_018370157 | Serine | False positive |
Predicted van AMR sequences for Staphylococcus, Streptococcus, and Listeria using the GTDWFE algorithm.
| NCBI accession number | Protein names | Note |
|---|---|---|
| AAQ17160 | Vancomycin/teicoplanin A-type resistance protein VanA (plasmid) | True positive |
| AAQ17159 | Vancomycin resistance protein VanH (plasmid) | True positive |
| AAQ17157 | Vancomycin response regulator VanR (plasmid) | True positive |
| AAQ17158 | Sensor histidine kinase VanS (plasmid) | True positive |
| AAQ17161 | Vancomycin B-type resistance protein VanX (plasmid) | True positive |
| AAL07292 | D,D-dipeptidase VanXb, partial | True positive |
| AAQ17162 | D-alanyl-D-alanine carboxypeptidase VanY (plasmid) | True positive |
| AAQ17163 | vanZ protein (plasmid) | True positive |
| CDC71755 | Putative uncharacterized protein | False positive |
| WP_018370157 | Serine | False positive |
Figure 1The components of PARGT. Components outlined by dotted lines indicate additional training samples supplied by a user.
Figure 2Illustration of the PARGT GUI with its pop-up menu.