| Literature DB >> 35317240 |
Yunxiao Ren1, Trinad Chakraborty2,3, Swapnil Doijad2,3, Linda Falgenhauer3,4,5, Jane Falgenhauer2,3, Alexander Goesmann3,6, Oliver Schwengers3,6, Dominik Heider1.
Abstract
Antimicrobial resistance (AMR) is a global health and development threat. In particular, multi-drug resistance (MDR) is increasingly common in pathogenic bacteria. It has become a serious problem to public health, as MDR can lead to the failure of treatment of patients. MDR is typically the result of mutations and the accumulation of multiple resistance genes within a single cell. Machine learning methods have a wide range of applications for AMR prediction. However, these approaches typically focus on single drug resistance prediction and do not incorporate information on accumulating antimicrobial resistance traits over time. Thus, identifying multi-drug resistance simultaneously and rapidly remains an open challenge. In our study, we could demonstrate that multi-label classification (MLC) methods can be used to model multi-drug resistance in pathogens. Importantly, we found the ensemble of classifier chains (ECC) model achieves accurate MDR prediction and outperforms other MLC methods. Thus, our study extends the available tools for MDR prediction and paves the way for improving diagnostics of infections in patients. Furthermore, the MLC methods we introduced here would contribute to reducing the threat of antimicrobial resistance and related deaths in the future by improving the speed and accuracy of the identification of pathogens and resistance.Entities:
Keywords: AMR, Antimicrobial Resistance; MDR, Multi-Drug Resistance; MLC, Multi-Label Classification; Machine learning; Multi-drug resistance; Multi-label classification
Year: 2022 PMID: 35317240 PMCID: PMC8918850 DOI: 10.1016/j.csbj.2022.03.007
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Overview of the dataset.
| Antibiotics | CIP | CTX | CTZ | GEN |
|---|---|---|---|---|
| Resistant | 366 | 358 | 276 | 188 |
| Susceptible | 443 | 451 | 533 | 621 |
Fig. 1Transformation methods of multi-label classification problems. (A) One multi-label dataset. χi ∈ xis a training instance. (B) Binary relevance (BR) transforms the multi-label dataset with m labels into m independent binary datasets. (C) The process of classifier chain (CC) for multi-label data. (D) The possible number of label orders for ensemble classifier chains (ECC). (E) The transformation of the multi-label dataset by label powerset (LP). Labels with different colors represent the different combinations of labels. (F) The transformation of a multi-label dataset by random label space partitioning with label powerset (RD). Labels with different colors represent the different combinations of labels.
Fig. 2Performance of different MLC methods with RF base classifiers for resistance prediction for each antibiotic. (A) F-scores, (B) Precision, (C) Recall, and (D) Jaccard score of five MLC methods with RF base classifiers for predicting resistance against each antibiotic. ∗ p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ns: no significance.
Accuracy, hamming loss, and 0/1 loss of five MLC methods with RF base classifier for predicting resistance against four antibiotics. Mean ± standard deviations (significance label of p-value) are shown in table. The statistical significances were compared each group to all (base-mean). ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ns: no significance.
| MLC | Accuracy | Hamming Loss | 0/1 Loss |
|---|---|---|---|
| BR | 0.51 ± 0.07 (ns) | 0.20 ± 0.03 (ns) | 0.49 ± 0.07 (ns) |
| CC | 0.52 ± 0.07 (ns) | 0.20 ± 0.04 (ns) | 0.48 ± 0.06 (ns) |
| ECC | 0.72 ± 0.13 (ns) | 0.11 ± 0.05 (*) | 0.28 ± 0.13 (ns) |
| LP | 0.53 ± 0.08 (ns) | 0.11 ± 0.05 (ns) | 0.47 ± 0.08 (ns) |
| RD | 0.51 ± 0.09 (ns) | 0.21 ± 0.04 (ns) | 0.49 ± 0.09 (ns) |
Fig. 3Performance of different MLC methods with LR base classifiers for resistance prediction for each antibiotic. (A) F-scores, (B) Precision, (C) Recall, and (D) Jaccard score of five MLC methods with RF base classifiers for predicting resistance against each antibiotic. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ns: no significance.
Accuracy, hamming loss, and 0/1 loss of five MLC methods with LR base classifier for predicting resistance against four antibiotics. Mean ± standard deviations (significance label of p-value) are shown in table. The statistical significances were compared each group to all (base-mean). ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ns: no significance.
| MLC | Accuracy | Hamming Loss | 0/1 Loss |
|---|---|---|---|
| BR | 0.45 ± 0.08 (ns) | 0.24 ± 0.04 (ns) | 0.55 ± 0.08 (ns) |
| CC | 0.47 ± 0.08 (ns) | 0.23 ± 0.04 (ns) | 0.53 ± 0.08 (ns) |
| ECC | 0.65 ± 0.11 (ns) | 0.14 ± 0.05 (*) | 0.35 ± 0.11 (ns) |
| LP | 0.50 ± 0.08 (ns) | 0.23 ± 0.04 (ns) | 0.50 ± 0.08 (ns) |
| RD | 0.47 ± 0.07 (ns) | 0.24 ± 0.05 (ns) | 0.53 ± 0.07 (ns) |
Fig. 4Performance of different MLC methods with SVM base classifiers for resistance prediction for each antibiotic. (A) F-scores, (B) Precision, (C) Recall, and (D) Jaccard score of five MLC methods with RF base classifiers for predicting resistance against each antibiotic. ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ns: no significance.
Accuracy, hamming loss, and 0/1 loss of five MLC methods with SVM base classifier for predicting resistance against four antibiotics. Mean ± standard deviations (significance label of p-value) are shown in table. The statistical significances were compared each group to all (base-mean). ∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001, ns: no significance.
| MLC | Accuracy | Hamming Loss | 0/1 Loss |
|---|---|---|---|
| BR | 0.37 ± 0.08 (ns) | 0.28 ± 0.05 (ns) | 0.63 ± 0.08 (ns) |
| CC | 0.39 ± 0.08 (ns) | 0.28 ± 0.05 (ns) | 0.61 ± 0.08 (ns) |
| ECC | 0.57 ± 0.12 (ns) | 0.18 ± 0.07 (ns) | 0.43 ± 0.12 (ns) |
| LP | 0.47 ± 0.07 (ns) | 0.24 ± 0.03 (ns) | 0.53 ± 0.07 (ns) |
| RD | 0.41 ± 0.09 (ns) | 0.26 ± 0.05 (ns) | 0.59 ± 0.09 (ns) |