| Literature DB >> 32027671 |
Tsi-Shu Huang1, Susan Shin-Jung Lee1,2,3, Chia-Chien Lee1, Fu-Chuen Chang4.
Abstract
BACKGROUND: Carbapenem-resistant Klebsiella pneumoniae (CRKP) is emerging as a significant pathogen causing healthcare-associated infections. Matrix-assisted laser desorption/ionisation mass spectrometry time-of-flight mass spectrometry (MALDI-TOF MS) is used by clinical microbiology laboratories to address the need for rapid, cost-effective and accurate identification of microorganisms. We evaluated application of machine learning methods for differentiation of drug resistant bacteria from susceptible ones directly using the profile spectra of whole cells MALDI-TOF MS in 46 CRKP and 49 CSKP isolates.Entities:
Mesh:
Substances:
Year: 2020 PMID: 32027671 PMCID: PMC7004327 DOI: 10.1371/journal.pone.0228459
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overall flowchart (A) and the representative steps of procedures of dimension reduction and cross validation steps (B).
Top 10 peaks with least sum of 95 ranks produced each time one different spectrum omitted and their percentage of presence in each group.
| m/z | Present in no. (%) of: | No. of times the peak ranked as: | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CRKP | CSKP | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | >10 | |
| 9478.866 | 38 (82.6%) | 1 (2.2%) | 95 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9541.405 | 35 (76.1%) | 1 (2.2%) | 0 | 91 | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| 6288.794 | 35 (76.1%) | 12 (26.1%) | 0 | 2 | 67 | 26 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7705.009 | 37 (80.4%) | 1 (2.2%) | 0 | 1 | 25 | 66 | 1 | 0 | 0 | 0 | 0 | 0 | 2 |
| 7158.634 | 41 (89.1%) | 12 (26.1%) | 0 | 1 | 1 | 1 | 91 | 0 | 0 | 0 | 0 | 0 | 1 |
| 10287.76 | 35 (76.1%) | 2 (4.3%) | 0 | 0 | 0 | 1 | 1 | 92 | 0 | 0 | 0 | 0 | 1 |
| 4768.279 | 25 (54.3%) | 10 (21.7%) | 0 | 0 | 0 | 0 | 2 | 1 | 89 | 3 | 0 | 0 | 0 |
| 2636.88 | 24 (52.2%) | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | 70 | 15 | 6 | 0 |
| 4362.217 | 21 (45.7%) | 10 (21.7%) | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 8 | 58 | 18 | 9 |
| 5379.418 | 25 (54.3%) | 10 (21.7%) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | 7 | 54 | 29 |
Fig 2Performance of five machine learning algorithms with leave-one-out cross validation using 46 CRKP and 49 CSKP mass spectra, in terms of accuracy, sensitivity and specificity for differentiate CRKP from CSKP.
Accuracy (Accu), sensitivity (Sen) and specificity (Spe) are used to evaluate prediction systems. Panel A, Boxplots (25th to 75th percentiles, Min to max) of accuracy, sensitivity, and specificity for differentiation of CRKP from CSKP when 1–100 peaks selected. In Panel B, Values of accuracy, sensitivity and specificity with number of ranked peaks (k = 1–100) with increasing p-value (X axis) selected for machine learning algorithms. k: number of ranked peaks selected with increasing p-value for classification by using all five machine learning algorithms.
Performance of five machine learning algorithms with L1O cross validation using 46 CRKP and 49 CSKP mass spectra, in terms of accuracy, sensitivity and specificity for differentiate CRKP from CSKP.
| Algorithm | Metric | No. of ranked peaks selected with increasing | |||||
|---|---|---|---|---|---|---|---|
| 50 | 60 | 70 | 80 | 90 | 100 | ||
| Random Forest | Accuracy | 94% | 94% | 95% | 97% | 95% | 94% |
| Sensitivity | 91% | 91% | 91% | 93% | 89% | 87% | |
| Specificity | 96% | 96% | 98% | 100% | 100% | 100% | |
| Logistic Regression | Accuracy | 93% | 92% | 92% | 93% | 91% | 91% |
| Sensitivity | 93% | 93% | 96% | 98% | 96% | 98% | |
| Specificity | 92% | 90% | 88% | 88% | 86% | 84% | |
| Naïve Bayes | Accuracy | 86% | 88% | 87% | 89% | 88% | 89% |
| Sensitivity | 74% | 76% | 74% | 78% | 76% | 78% | |
| Specificity | 98% | 100% | 100% | 100% | 100% | 100% | |
| Nearest Neighbors | Accuracy | 91% | 84% | 87% | 87% | 83% | 85% |
| Sensitivity | 87% | 83% | 87% | 89% | 87% | 91% | |
| Specificity | 94% | 86% | 88% | 86% | 80% | 80% | |
| Support Vector Machine | Accuracy | 87% | 84% | 84% | 86% | 85% | 87% |
| Sensitivity | 96% | 96% | 96% | 100% | 98% | 98% | |
| Specificity | 80% | 73% | 73% | 73% | 73% | 78% | |
Fig 3Random amplified polymorph in DNA fingerprinting (RAPD) types of 46 carbapenem-resistant K. pneumoniae generated by arbitrarily primed PCR.
Lanes S, standard strain included in every experiment as a control. Results for the study strains were shown in the order of the date of isolation. Lane M shows the 1-kb DNA ladder.