| Literature DB >> 29949970 |
Abstract
Motivation: Antimicrobial resistance (AMR) is becoming a huge problem in both developed and developing countries, and identifying strains resistant or susceptible to certain antibiotics is essential in fighting against antibiotic-resistant pathogens. Whole-genome sequences have been collected for different microbial strains in order to identify crucial characteristics that allow certain strains to become resistant to antibiotics; however, a global inspection of the gene content responsible for AMR activities remains to be done.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29949970 PMCID: PMC6022653 DOI: 10.1093/bioinformatics/bty276
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Growth rates of the pan-genome sizes, core gene cluster and accessory gene cluster numbers with the increasing number of E. coli genomes. The blue, orange and green lines, respectively, represent core-, accessory- and pan-genome sizes
Fig. 2.Differences in the COGs functional distributions between the core- and accessory-genomes. COG percentages were estimated by dividing COG numbers by the total gene cluster numbers in either the core- or accessory-genome. Only COGs differing by at least 2-fold between the core and accessory parts were included
Fig. 3.Prediction accuracies of the AMR activities [in terms of the area under the ROCs curve (AUC)] based on the presence/absence patterns of (i) all core and accessory gene clusters (core + acc); (ii) all accessory gene clusters (acc); (iii) accessory gene clusters with CARD annotations (acc/card) and (iv) all CARD gene clusters. The boxplots indicate the distribution of the predictive accuracy of 12 selected drugs (Section 2 and Section 3). The four blocks of boxplots represent four different machine learning algorithms, including Adaboost, NB, RF and SVM, used in the prediction process. Dashed red line indicates 0.9 AUC
SVM prediction performances (based on the AUC) measured for ampicillin, gentamicin, trimethoprim/sulfamethoxazole and ciprofloxacin
| Drugs | 68 acc/card | Tyson 2005 | Scoary | Scoary/cardd | GA |
|---|---|---|---|---|---|
| Ampicillin | 0.64 | 0.86 | 0.75 | 0.79 | 0.97 |
| Gentamicin | 0.78 | 0.83 | 0.85 | 0.68 | 0.98 |
| Trim/sulfa | 0.87 | 0.82 | 0.76 | 0.87 | 0.94 |
| Ciprofloxacin | 0.71 | 0.78 | 0.93 | 0.87 | 0.93 |
68 accessory gene clusters with CARD annotations.
Genes established in (Tyson ).
Gene clusters that were associated with phenotypes extracted by Scoary.
Gene clusters that can be mapped to the CARD database extracted by Scoary.
Gene clusters selected by the GA.
Trimethoprim/sulfamethoxazole.
AUC measured from the leave-one-out evaluation process using SVM.
Fig. 4.SVM prediction accuracies of the antimicrobial resistance (AMR) activities (in terms of the area under the receiver operating characteristics curve (AUC)) based on 1) 68 accessory genes with CARD annotations (68 acc/card genes); 2) gene clusters selected for each drug based on the genetic algorithm (GA-selected clusters); 3) gene clusters identified by Scoary; and 4) gene clusters with CARD annotations identified by Scoary (Scoary with CARD). The boxplot indicates the distribution of the prediction accuracies for the 12 selected drugs. Dashed red line indicates 0.9 AUC