| Literature DB >> 33267317 |
Sergio Martínez-Agüero1, Inmaculada Mora-Jiménez1, Jon Lérida-García1, Joaquín Álvarez-Rodríguez2, Cristina Soguero-Ruiz1.
Abstract
The presence of bacteria with resistance to specific antibiotics is one of the greatest threats to the global health system. According to the World Health Organization, antimicrobial resistance has already reached alarming levels in many parts of the world, involving a social and economic burden for the patient, for the system, and for society in general. Because of the critical health status of patients in the intensive care unit (ICU), time is critical to identify bacteria and their resistance to antibiotics. Since common antibiotics resistance tests require between 24 and 48 h after the culture is collected, we propose to apply machine learning (ML) techniques to determine whether a bacterium will be resistant to different families of antimicrobials. For this purpose, clinical and demographic features from the patient, as well as data from cultures and antibiograms are considered. From a population point of view, we also show graphically the relationship between different bacteria and families of antimicrobials by performing correspondence analysis. Results of the ML techniques evidence non-linear relationships helping to identify antimicrobial resistance at the ICU, with performance dependent on the family of antimicrobials. A change in the trend of antimicrobial resistance is also evidenced.Entities:
Keywords: antibiogram; antimicrobial resistance; bacteria; clinical data; correspondence analysis; culture; feature selection; intensive care unit; machine learning
Year: 2019 PMID: 33267317 PMCID: PMC7515087 DOI: 10.3390/e21060603
Source DB: PubMed Journal: Entropy (Basel) ISSN: 1099-4300 Impact factor: 2.524
Data set description.
| Type—Feature | Category | Subcategory | # feat. | Mean ± std |
|---|---|---|---|---|
|
| - | - | 1 | 62 ± 14 |
|
| - | - | 1 | 61 ± 14 |
|
| - | - | 1 | 20 ± 7 |
|
| - | - | ||
|
| - | - | 1 | 12.1 ± 18.4 |
|
| Year, | - | 3 | 2010 ± 4, |
| Month, Day | - | 6 ± 3, 2 ± 2 | ||
|
|
|
|
|
|
|
| Male | - | 1 | 61.37 |
|
| Groups | A, B, C, D, | 8 | 14.9, 11.3, 25.1 |
| E, F, G, pluripathology | 7.6,8.8, 25, 28.9 | |||
|
| - | Surgical, Medical, Trauma | 3 | 100 |
|
| Surgery | Scheduled with (without) complications, Urgent with (without) complications | 4 | 25.6 |
| Respiratory | Chronic acute respiratory insufficiency, respiratory failure, Respiratory other | 3 | 21.2 | |
| Cardiovascular | Heart failure, Ischemic heart disease, Severe arrhythmia, Cardiorespiratory arrest, Hypovolemia, Cardiovascular other | 6 | 16.7 | |
| Infection | Serious infection, Immune-compromised infection | 2 | 16.2 | |
| Other medical | Digestive haemorrhage, Diabetic decompensation, Acute renal failure, Hepatic insufficiency, Voluntary intoxication, Pancreatitis | 6 | 9.7 | |
| Neurology | Stroke, Epilepsy, Alteration of the awareness level, Neuromuscular, Neurological other | 5 | 9.0 | |
| Trauma | Severe trauma | 1 | 0.8 | |
|
| Emergency | - | 1 | 27.4 |
| General surgery | - | 1 | 26.7 | |
| Internal medicine | - | 1 | 11.1 | |
| Others | Anesthesia, Dermatology, Digestive, Gastrointestinal, Hematology, Nefrology, Neumology, Neurology, Oncology, Other hospital, Others, Otorrinolaringology, Psychiatry, Surgery, Traumatology, Urology, Ophthalmology | 19 | 34.8 | |
|
| Exudate | Rectal, Nasal, Axillary, Pharyngeal, Inguinal, Wound, Urethral, Press ulcer | 20 | 91.9 |
| Others | Blood, Catheter, Urine, Feces, Abscess, Abdominal abscess, Respiratory, Drainage, Abdominal drainage, Abdominal fluid, Sputum, Pleural, Bronchoalveolar lavage, Secretion, Peritoneal liquid, Ascitic liquid, Biliary liquid | 29 | 8.1 | |
|
| Others | Amikacin, Gentamicin, Gentamicin high load synergy, Kanamycin high load synergy, Tobramycin, Imipenem, Meropenem, Ertapenem, Ceftazidime, Cefepime, Piperacillin, Ticarcillin, Mezlocillin, Colistin, Ciprofloxacin, Levofloxacin, Norfloxacin, Nalidixic acid, Ofloxacin, Moxifloxacin | 20 | 100 |
Figure 1(a) Bar graph of number of observations per germen when considering six antimicrobial families of interest (aminoglycosides (AMG), carbapenemics (CAR), fourth-generation cephalosporins (CF4), broad-spectrum antibiotics (PAP), polymixines (POL), and quinolones (QUI)). (b) Bar graph of number of observations of the most frequent germen (Pseudomonas) in terms of resistant and susceptibility for each antimicrobial family.
Contingency table used in the correspondence analysis for resistance.
| Fam. Antim. | AMG | CAR | QUI | |
|---|---|---|---|---|
| Bacterial Types | ||||
|
| 802 | 897 | 1065 | |
|
| 712 | 633 | 368 | |
|
| 396 | 250 | 1085 | |
Figure 2Correspondence analysis map of bacterial type and antimicrobial family when considering resistant observations.
Chi-square distances for resistant observations used in correspondence analysis.
| Fam. Antim. | AMG | CAR | QUI | |
|---|---|---|---|---|
| Bacterial Types | ||||
|
| 2.8 | 13.8 | 2.8 | |
|
| 64.9 | 41.0 | 153.7 | |
|
| 35.0 | 122.2 | 208.8 | |
Total number of observations (first row) and number of observations in the minority class (second row; in brackets, percentage) for each family of antimicrobians when Pseudomonas are considered. Label S in brackets stands for susceptible, and label R for resistant.
| AMG | CAR | CF4 | PAP | POL | QUI | |
|---|---|---|---|---|---|---|
|
| 2177 | 1458 | 1582 | 2309 | 570 | 1952 |
|
| 802 | 560 | 642 | 842 | 58 | 884 |
|
| (36% R) | (38% S) | (41% S) | (36% S) | (10% R) | (45% R) |
Figure 3Mean values, ranked in descending order, for the MI between each feature and the target when considering 50 different training sets for: (a) family 1. Pseudomonas—AMG; (b) Family 2. Pseudomonas—CAR; (c) Family 3. Pseudomonas—CF4; (d) Family 4. Pseudomonas—PAP; (e) Family 5. Pseudomonas—POL; and (f) Family 6. Pseudomonas—QUI. The solid lines represent the difference between the importance of a certain feature and the MI of the next one in the figure. The selected features for each family are inset.
Hyperparameters found when considering a five-fold cross-validation strategy on the training set for each model associated to the feature selection (FS) strategy and the antimicrobial family.
| FS | Model | Hyperparameter | AMG | CAR | CF4 | PAP | POL | QUI |
|---|---|---|---|---|---|---|---|---|
|
|
| Penalty coefficient | 0.73 | 0.62 | 0.48 | 0.13 | 0.05 | 0.01 |
|
| N° neighbors | 1 | 1 | 1 | 1 | 5 | 1 | |
|
| Max. depth | 22 | 23 | 31 | 20 | 22 | 20 | |
| Min. samples per leaf | 5 | 4 | 8 | 6 | 5 | 6 | ||
|
| Max. depth | 37 | 25 | 30 | 38 | 14 | 20 | |
| Min. samples per leaf | 5 | 4 | 4 | 6 | 5 | 6 | ||
| N° of tress | 50 | 100 | 30 | 50 | 50 | 100 | ||
|
| Activation function | Relu | Relu | Relu | Relu | Relu | Relu | |
| L2 penalty coefficient | 0.02 | 0.01 | 0.01 | 0.01 | 0.20 | 0.05 | ||
| N° of neurons | 59 | 60 | 59 | 62 | 58 | 61 | ||
|
|
| Penalty coefficient | 0.04 | 1.50 | 0.12 | 1.01 | 0.01 | 0.05 |
|
| N° neighbours | 1 | 1 | 1 | 1 | 7 | 1 | |
|
| Max. depth | 22 | 15 | 41 | 19 | 3 | 4 | |
| Min. samples per leaf | 5 | 4 | 5 | 6 | 1 | 20 | ||
|
| Max. depth | 30 | 18 | 18 | 22 | 26 | 30 | |
| Min. samples per leaf | 5 | 4 | 4 | 6 | 4 | 6 | ||
| N° of trees | 50 | 50 | 50 | 50 | 30 | 30 | ||
|
| Activation function | Relu | Relu | Sigmoid | Relu | Relu | Relu | |
| L2 penalty coefficient | 0.01 | 0.01 | 0.10 | 0.01 | 0.13 | 0.03 | ||
| N° of neurons | 64 | 64 | 62 | 63 | 59 | 57 |
Mean ± standard deviation of the performance (accuracy, specificity, sensitivity, F1-score) provided by five kind of models (second column) on 50 test sets. The goal is to determine resistance of Pseudomonas to six families of antimicrobials (first column). Two FS approaches have been considered: clinical knowledge (FS1) and mutual information (FS2). Bold figures refer to the highest performance per figure of merit and antimicrobial family.
| Family | Model | Accuracy | Specitivity | Sensitivity | F1-score | ||||
|---|---|---|---|---|---|---|---|---|---|
| FS1 | FS2 | FS1 | FS2 | FS1 | FS2 | FS1 | FS2 | ||
| AMG | LR | 78.2 ± 1.2 | 75.3 ± 1.7 | 80.0 ± 1.9 | 77.2 ± 2.7 | 76.5 ± 1.8 | 73.5 ± 2.4 | 77.8 ± 1.2 | 74.8 ± 1.7 |
| 79.3 ± 1.6 |
| 84.0 ± 2.5 |
| 74.5 ± 2.1 |
| 78.1 ± 1.3 |
| ||
| DT | 77.0 ± 1.2 | 78.6 ± 2.3 | 78.0 ± 3.7 | 81.7 ± 2.5 | 75.9 ± 2.6 | 76.6 ± 2.7 | 76.5 ± 1.1 | 78.0 ± 2.6 | |
| RF | 80.1 ± 1.6 | 80.8 ± 1.2 | 80.0 ± 2.3 | 81.3 ± 2.4 | 80.0 ± 2.0 | 80.2 ± 2.0 | 80.0 ± 1.6 | 80.5 ± 1.5 | |
| MLP | 80.8 ± 1.3 | 78.3 ± 1.0 | 83.0 ± 2.1 | 82.0 ± 1.7 | 78.6 ± 1.7 | 75.0 ± 1.2 | 80.1 ± 1.3 | 77.9 ± 1.1 | |
| CAR | LR | 77.3 ± 1.4 | 74.8 ± 2.3 | 76.0 ± 3.0 | 72.0 ± 3.4 | 78.7 ± 2.8 | 77.8 ± 3.1 | 77.6 ± 1.8 | 75.5 ± 2.3 |
| 79.9 ± 1.6 | 81.5 ± 1.4 | 80.0 ± 2.8 | 80.3 ± 2.5 | 80.1 ± 2.5 | 82.7 ± 2.2 | 79.8 ± 1.7 | 81.6 ± 1.7 | ||
| DT | 78.1 ± 2.2 | 79.4 ± 2.0 | 80.0 ± 2.6 |
| 75.8 ± 2.7 | 76.2 ± 3.2 | 77.3 ± 2.7 | 78.9 ± 2.0 | |
| RF |
| 82.2 ± 1.7 | 78.0 ± 4.0 | 82.0 ± 3.1 |
| 82.5 ± 2.6 |
| 82.5 ± 1.6 | |
| MLP | 81.9 ± 1.5 | 79.0 ± 1.9 | 81.0 ± 3.3 | 78.6 ± 2.5 | 82.6 ± 2.9 | 80.2 ± 2.1 | 82.3 ± 1.5 | 79.2 ± 1.8 | |
| CF4 | LR | 68.7 ± 2.0 | 67.8 ± 1.3 | 70.0 ± 3.1 | 68.1 ± 2.3 | 67.9 ± 2.6 | 67.1 ± 2.1 | 68.2 ± 2.3 | 67.3 ± 1.8 |
| 77.7 ± 1.5 | 75.6 ± 1.6 |
| 78.9 ± 2.6 | 75.2 ± 2.4 | 72.9 ± 2.6 | 77.0 ± 1.4 | 74.7 ± 2.0 | ||
| DT | 71.0 ± 1.1 | 74.6 ± 2.2 | 74.4 ± 3.7 | 78.0 ± 3.6 | 67.9 ± 3.2 | 71.8 ± 3.1 | 70.3 ± 1.4 | 73.9 ± 2.4 | |
| RF | 75.8 ± 1.9 |
| 72.0 ± 4.3 | 77.2 ± 3.7 | 79.4 ± 3.0 |
| 76.7 ± 1.7 |
| |
| MLP | 77.0 ± 1.5 | 75.8 ± 1.4 | 81.0 ± 4.5 | 77.4 ± 3.6 | 73.4 ± 3.7 | 75.1 ± 2.7 | 76.6 ± 1.1 | 75.3 ± 1.9 | |
| PAP | LR | 69.0 ± 1.4 | 67.7 ± 1.4 | 68.0 ± 2.8 | 65.9 ± 3.0 | 70.4 ± 2.5 | 70.2 ± 2.5 | 69.1 ± 1.6 | 68.1 ± 1.4 |
| 78.3 ± 1.6 |
|
| 81.3 ± 2.2 | 74.8 ± 2.4 | 76.3 ± 2.7 | 77.4 ± 1.9 | 77.7 ± 1.7 | ||
| DT | 72.6 ± 2.2 | 74.6 ± 2.0 | 74.0 ± 3.5 | 78.1 ± 4.1 | 70.9 ± 2.6 | 71.1 ± 3.5 | 71.7 ± 2.5 | 73.4 ± 2.3 | |
| RF | 75.2 ± 1.5 | 75.6 ± 1.2 | 71.0 ± 3.0 | 73.0 ± 3.0 |
| 78.3 ± 2.4 | 75.7 ± 1.8 | 76.1 ± 1.4 | |
| MLP | 78.2 ± 1.7 | 75.4 ± 1.4 | 78.0 ± 1.8 | 76.3 ± 2.3 | 78.0 ± 3.3 | 74.4 ± 1.9 |
| 75.1 ± 1.8 | |
| POL | LR | 68.1 ± 6.3 | 70.95 ± 6.8 | 63.0 ± 7.7 | 65.1 ± 10.0 | 73.4 ± 6.3 |
| 69.6 ± 6.6 | 71.4 ± 6.9 |
| 63.9 ± 7.2 | 70.5 ± 7.0 | 68.0 ± 12.5 | 74.0 ± 6.7 | 61.4 ± 11.8 | 67.4 ± 7.4 | 63.0 ± 9.3 | 69.2 ± 9.0 | ||
| DT | 60.6 ± 5.2 |
| 65.0 ± 12.2 | 69.3 ± 14.6 | 57.0 ± 11.6 | 75.5 ± 7.9 | 58.3 ± 7.8 |
| |
| RF | 65.3 ± 7.9 | 68.9 ± 6.9 | 58.0 ± 15.0 | 66.4 ± 11.2 | 74.2 ± 11.2 | 73.4 ± 6.6 | 68.4 ± 6.9 | 70.0 ± 6.2 | |
| MLP | 67.7 ± 3.5 | 70.8 ± 6.2 |
| 71.0 ± 12.4 | 60.5 ± 9.2 | 71.5 ± 9.5 | 64.4 ± 2.3 | 71.3 ± 5.8 | |
| QUI | LR | 72.1 ± 2.1 | 71.8 ± 1.5 | 70.0 ± 2.8 | 71.6 ± 2.4 | 74.2 ± 2.5 | 72.2 ± 2.6 | 72.7 ± 2.1 | 71.9 ± 1.9 |
| 86.8 ± 1.1 |
| 86.0 ± 1.6 |
| 87.6 ± 1.7 |
| 86.8 ± 1.3 |
| ||
| DT | 81.8 ± 1.7 | 82.3 ± 1.5 | 83.7 ± 2.8 | 85.0 ± 2.8 | 80.1 ± 2.2 | 79.6 ± 2.2 | 81.4 ± 2.1 | 81.7 ± 1.7 | |
| RF | 82.5 ± 1.8 | 83.6 ± 1.4 | 79.0 ± 2.9 | 84.6 ± 2.4 | 86.0 ± 2.9 | 83.6 ± 2.0 | 82.9 ± 1.7 | 83.7 ± 1.3 | |
| MLP | 87.1 ± 1.4 | 83.4 ± 2.0 | 87.0 ± 0.8 | 82.7 ± 1.8 | 87.0 ± 1.0 | 84.8 ± 1.7 | 86.7 ± 1.7 | 83.7 ± 2.0 | |
Figure 4Decision tree classifier trained on quinolones observations: (a) based on clinical knowledge feature selection procedure; and (b) based on MI feature selection procedure.
Mean ± standard deviation of the performance (accuracy, specificity, sensitivity, F1-score) provided by the k-nn model on 50 test sets. Results are shown for every antimicrobial family (first column) after excluding the scores APACHE II and SAPS 3 from the set of selected features by MI.
| Antimicr. Family | Accuracy | Specificity | Sensitivity | F1-Score |
|---|---|---|---|---|
| AMG | 82.2 ± 1.7 | 86.0 ± 2.3 | 78.7 ± 2.3 | 81.6 ± 1.9 |
| CAR | 79.6 ± 2.1 | 81.0 ± 3.5 | 78.3 ± 2.8 | 79.0 ± 2.0 |
| CF4 | 74.9 ± 2.1 | 77.0 ± 3.7 | 72.6 ± 2.6 | 74.3 ± 2.0 |
| PAP | 77.1 ± 1.7 | 80.0 ± 2.9 | 74.0 ± 2.5 | 76.1 ± 1.8 |
| POL | 68.5 ± 7.0 | 62.0 ± 14.2 | 78.1 ± 12.2 | 70.3 ± 7.2 |
| QUI | 88.1 ± 1.6 | 88.0 ± 2.1 | 88.7 ± 2.1 | 88.0 ± 1.8 |