| Literature DB >> 35372195 |
Çaǧlar Çaǧlayan1, Sean L Barnes2, Lisa L Pineles3, Anthony D Harris3, Eili Y Klein4,5.
Abstract
Background: The rising prevalence of multi-drug resistant organisms (MDROs), such as Methicillin-resistant Staphylococcus aureus (MRSA), Vancomycin-resistant Enterococci (VRE), and Carbapenem-resistant Enterobacteriaceae (CRE), is an increasing concern in healthcare settings. Materials andEntities:
Keywords: Methicillin-resistant Staphylococcus aureus (MRSA); carbapenem-resistant Enterobacteriaceae (CRE); data-centric analytics; healthcare-associated infections (HAIs); machine learning (ML); multidrug-resistant organisms (MDROs); predictive analytics; vancomycin-resistant enterococci (VRE)
Mesh:
Year: 2022 PMID: 35372195 PMCID: PMC8968755 DOI: 10.3389/fpubh.2022.853757
Source DB: PubMed Journal: Front Public Health ISSN: 2296-2565
Figure 1Threshold optimization formulation.
Model parameters for best performing logistic regression models.
|
| ||||
|---|---|---|---|---|
|
|
|
|
|
|
| Cs | 100 | 100 | 100 | 100 |
| class_weight | None | None | None | None |
| cv=StratifiedKFold | n_splits=10 | n_splits=10 | n_splits=10 | n_splits=10 |
| dual | False | False | False | False |
| fit_intercept | True | True | True | True |
| intercept_scaling | 1 | 1 | 1 | 1 |
| max_iter | 100 | 100 | 100 | 100 |
| multi_class | 'ovr' | 'ovr' | 'ovr' | 'ovr' |
| n_jobs | 1 | 1 | 1 | 1 |
| penalty | L1 | L1 | L1 | L1 |
| random_state | None | None | None | None |
| refit | True | True | True | True |
| scoring | roc_auc | roc_auc | roc_auc | roc_auc |
| solver | liblinear | liblinear | liblinear | liblinear |
| tol | 0.0001 | 0.0001 | 0.0001 | 0.0001 |
| verbose | 0 | 0 | 0 | 0 |
| Threshold Bound | 0.15 | 0.025 | 0.20 | 0.50 |
Figure 2Threshold value for converting predicted probabilities to binary predictions.
Performance summary of the machine learning models for VRE, CRE, MRSA, and MDRO colonization predictions.
|
|
|
|
| |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||||||||||
|
|
|
|
|
| ||||||||||||
| Training AUC | 0.76 | 0.77 | 0.77 | Training AUC | 0.70 | 0.76 | 0.78 | Training AUC | 0.65 | 0.66 | 0.66 | Training AUC | 0.72 | 0.86 | 0.88 | 0.75 |
| Testing AUC | 0.80 | 0.77 | 0.77 | Testing AUC | 0.78 | 0.72 | 0.71 | Testing AUC | 0.66 | 0.66 | 0.69 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.76 |
| Testing sensitivity | 0.99 | 0.97 | 1.00 | Testing sensitivity | 1.00 | 0.73 | 0.82 | Testing sensitivity | 1.00 | 0.88 | 1.00 | Testing sensitivity | 0.92 | 0.97 | 1.00 | 0.93 |
| Testing specificity | 0.09 | 0.33 | 0.15 | Testing specificity | 0.31 | 0.68 | 0.37 | Testing specificity | 0.02 | 0.22 | 0.00 | Testing specificity | 0.30 | 0.39 | 0.21 | 0.51 |
|
|
|
|
| |||||||||||||
|
|
|
|
|
| ||||||||||||
| Training AUC | 0.76 | 0.77 | 0.76 | Training AUC | 0.70 | 0.76 | 0.80 | Training AUC | 0.65 | 0.66 | 0.67 | Training AUC | 0.72 | 0.86 | 0.88 | 0.76 |
| Testing AUC | 0.80 | 0.77 | 0.77 | Testing AUC | 0.78 | 0.72 | 0.71 | Testing AUC | 0.66 | 0.66 | 0.71 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.81 |
| Testing sensitivity | 0.89 | 0.79 | 0.88 | Testing sensitivity | 0.82 | 0.73 | 0.73 | Testing sensitivity | 0.76 | 0.71 | 0.82 | Testing sensitivity | 0.68 | 0.82 | 0.90 | 0.93 |
| Testing specificity | 0.49 | 0.57 | 0.48 | Testing specificity | 0.57 | 0.77 | 0.51 | Testing specificity | 0.45 | 0.53 | 0.45 | Testing specificity | 0.59 | 0.74 | 0.69 | 0.58 |
|
|
|
|
| |||||||||||||
|
|
|
|
|
| ||||||||||||
| Training AUC | 0.76 | 0.77 | 0.76 | Training AUC | 0.70 | 0.76 | 0.80 | Training AUC | 0.65 | 0.66 | 0.66 | Training AUC | 0.72 | 0.86 | 0.87 | 0.76 |
| Testing AUC | 0.80 | 0.77 | 0.78 | Testing AUC | 0.78 | 0.72 | 0.73 | Testing AUC | 0.66 | 0.66 | 0.68 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.81 |
| Testing sensitivity | 0.80 | 0.73 | 0.78 | Testing sensitivity | 0.73 | 0.55 | 0.73 | Testing sensitivity | 0.64 | 0.67 | 0.73 | Testing sensitivity | 0.65 | 0.75 | 0.85 | 0.89 |
| Testing specificity | 0.66 | 0.65 | 0.59 | Testing specificity | 0.69 | 0.83 | 0.63 | Testing specificity | 0.60 | 0.60 | 0.57 | Testing specificity | 0.63 | 0.82 | 0.79 | 0.65 |
|
|
|
|
| |||||||||||||
|
|
|
|
|
| ||||||||||||
| Training AUC | 0.76 | 0.77 | 0.77 | Training AUC | 0.70 | 0.76 | 0.79 | Training AUC | 0.65 | 0.66 | 0.67 | Training AUC | 0.72 | 0.86 | 0.87 | 0.76 |
| Testing AUC | 0.80 | 0.77 | 0.77 | Testing AUC | 0.78 | 0.72 | 0.71 | Testing AUC | 0.66 | 0.66 | 0.69 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.79 |
| Testing sensitivity | 0.66 | 0.63 | 0.75 | Testing sensitivity | 0.73 | 0.36 | 0.64 | Testing sensitivity | 0.48 | 0.56 | 0.71 | Testing sensitivity | 0.57 | 0.69 | 0.82 | 0.79 |
| Testing specificity | 0.76 | 0.75 | 0.66 | Testing specificity | 0.73 | 0.89 | 0.67 | Testing specificity | 0.75 | 0.71 | 0.58 | Testing specificity | 0.73 | 0.86 | 0.83 | 0.64 |
|
|
|
|
| |||||||||||||
|
|
|
|
|
| ||||||||||||
| Training AUC | 0.76 | 0.77 | 0.77 | Training AUC | 0.70 | 0.76 | 0.79 | Training AUC | 0.65 | 0.66 | 0.66 | Training AUC | 0.72 | 0.86 | 0.88 | 0.76 |
| Testing AUC | 0.80 | 0.77 | 0.77 | Testing AUC | 0.78 | 0.72 | 0.71 | Testing AUC | 0.66 | 0.66 | 0.69 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.80 |
| Testing sensitivity | 0.66 | 0.65 | 0.61 | Testing sensitivity | 0.64 | 0.36 | 0.64 | Testing sensitivity | 0.70 | 0.45 | 0.67 | Testing sensitivity | 0.56 | 0.70 | 0.85 | 0.77 |
| Testing specificity | 0.78 | 0.72 | 0.74 | Testing specificity | 0.78 | 0.91 | 0.74 | Testing specificity | 0.55 | 0.78 | 0.59 | Testing specificity | 0.75 | 0.84 | 0.79 | 0.70 |
|
|
|
|
| |||||||||||||
|
|
|
|
|
| ||||||||||||
| Training AUC | 0.76 | 0.77 | 0.77 | Training AUC | 0.70 | 0.76 | 0.79 | Training AUC | 0.65 | 0.65 | 0.66 | Training AUC | 0.72 | 0.86 | 0.88 | 0.77 |
| Testing AUC | 0.80 | 0.77 | 0.78 | Testing AUC | 0.78 | 0.72 | 0.72 | Testing AUC | 0.66 | 0.66 | 0.70 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.80 |
| Testing sensitivity | 0.61 | 0.59 | 0.63 | Testing sensitivity | 0.55 | 0.27 | 0.64 | Testing sensitivity | 0.48 | 0.24 | 0.76 | Testing sensitivity | 0.58 | 0.70 | 0.84 | 0.89 |
| Testing specificity | 0.82 | 0.77 | 0.74 | Testing specificity | 0.82 | 0.94 | 0.79 | Testing specificity | 0.75 | 0.89 | 0.59 | Testing specificity | 0.72 | 0.85 | 0.79 | 0.62 |
Performance summary of the supervised machine learning models with the highest Youden's index.
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
| |||||
| Training AUC | 0.76 | 0.77 | 0.77 | Training AUC | 0.65 | 0.66 | 0.66 | |
| Testing AUC | 0.80 | 0.77 | 0.77 | Testing AUC | 0.66 | 0.66 | 0.70 | |
| Testing sensitivity | 0.80 | 0.73 | 0.75 | Testing sensitivity | 0.70 | 0.67 | 0.76 | |
| Testing specificity | 0.66 | 0.65 | 0.66 | Testing specificity | 0.55 | 0.60 | 0.59 | |
| Youden index | 0.46 | 0.39 | 0.41 | Youden index | 0.24 | 0.27 | 0.34 | |
| Threshold opt. bound | 0.15 | 0.15 | 0.20 | Threshold opt. bound | 0.20 | 0.10 | 0.30 | |
|
|
|
|
|
| ||||
| Training AUC | 0.70 | 0.76 | 0.79 | Training AUC | 0.72 | 0.86 | 0.87 | 0.76 |
| Testing AUC | 0.78 | 0.72 | 0.72 | Testing AUC | 0.70 | 0.87 | 0.89 | 0.81 |
| Testing sensitivity | 0.73 | 0.73 | 0.64 | Testing sensitivity | 0.56 | 0.75 | 0.82 | 0.89 |
| Testing specificity | 0.73 | 0.77 | 0.79 | Testing specificity | 0.75 | 0.82 | 0.83 | 0.65 |
| Youden index | 0.45 | 0.50 | 0.42 | Youden index | 0.30 | 0.57 | 0.65 | 0.54 |
| Threshold opt. bound | 0.025 | 0.015 | 0.05 | Threshold opt. bound | 0.50 | 0.30 | 0.40 | 0.30 |
Top five common predictors for VRE, CRE, MRSA, and MDRO colonization identified by the machine learning models.
|
| |||||
|---|---|---|---|---|---|
|
|
| ||||
|
|
|
|
| ||
| Long-term care facility stay | Yes | 1 | 3 | 1 | |
| Recent 1-digit ICD10 procedure | Other procedures | 2 | 1 | 2 | |
| Current diagnosis CCS class | Skin and subcutaneous tissue | 3 | 2 | 3 | |
| Recent 2-digit ICD10 procedure | Medical/surgical anatomical | 6 | 5 | 8 | |
| Recent 2-digit ICD10 procedure | Administration circulatory | 8 | 4 | 6 | |
|
| |||||
|
|
| ||||
|
|
|
|
| ||
| Current diagnosis CCS class | Skin and subcutaneous tissue | 2 | 2 | 1 | |
| Recent 1-digit ICD10 procedure | Other procedures | 3 | 3 | 2 | |
| Prior ICU stay | > 20 days | 4 | 6 | 5 | |
| Long-term care facility stay | Yes | 5 | 6 | 6 | |
| Number of current diagnosis PoA | > 30 and ≤ 50 | 6 | 8 | 3 | |
|
| |||||
|
|
| ||||
|
|
|
|
| ||
| Recent 1-digit ICD10 procedure | Other procedures | 1 | 2 | 1 | |
| Current diagnosis CCS class | Skin and subcutaneous tissue | 2 | 9 | 2 | |
| Current diagnosis CCS class | Injury and poisoning | 7 | 1 | 8 | |
| Current diagnosis CCS class | Infectious and parasitic | 9 | 8 | 5 | |
| Recent 1-digit ICD10 procedure | Administration | −3 | 3 | 14 | |
|
| |||||
|
|
| ||||
|
|
|
|
|
| |
| Recent 1-digit ICD10 procedure | Other procedures | 7 | 1 | 1 | 2 |
| Current diagnosis CCS class | Skin and subcutaneous tissue | 16 | 2 | 2 | 14 |
| Current diagnosis CCS class | Mental illness | 35 | 6 | 3 | 12 |
| Current diagnosis CCS class | Infectious and parasitic | 57 | 12 | 4 | 16 |
| Sex | Female | 89 | 3 | 5 | 9 |
Model parameters for best performing random forest models.
|
| ||||
|---|---|---|---|---|
|
|
|
|
|
|
| cv=StratifiedKFold | n_splits=10 | n_splits=10 | n_splits=10 | n_splits=10 |
| estimator=RandomForestClassifier | Yes | Yes | Yes | Yes |
| bootstrap | True | True | True | True |
| max_depth | None | None | None | None |
| max_leaf_nodes | None | None | None | None |
| min_impurity_decrease | 0 | 0 | 0 | 0 |
| init_min_samples_leaf | 1 | 1 | 1 | 1 |
| init_min_samples_split | 2 | 2 | 2 | 2 |
| n_estimators | 200 | 200 | 200 | 200 |
| n_jobs | 4 | 4 | 4 | 4 |
| param_grid={'min_samples_leaf'} | [5, 10,..., 250] | [5, 10,..., 250] | [5, 10,..., 250] | [5, 10,..., 250] |
| param_grid={pre_dispatch} | 2*n_jobs | 2*n_jobs | 2*n_jobs | 2*n_jobs |
| param_grid={scoring} | roc_auc | roc_auc | roc_auc | roc_auc |
| optimal_min_samples_leaf | 5 | 30 | 10 | 5 |
| Threshold Bound | 0.20 | 0.05 | 0.30 | 0.40 |
Predictors and coefficients (i.e., odds ratios) of the best performing logistic regression models.
|
|
|
|
|
|
|---|---|---|---|---|
| Prior diagnosis CCS class | Neoplasms | 2.00 | - | - |
| Blood and blood-forming organs | - | 1.36 | - | |
| Infectious and parasitic | - | 1.18 | - | |
| Mental illness | 0.84 | - | - | |
| Symptoms, signs, ill-defined conditions | 0.79 | - | - | |
| Circulatory system | - | - | 0.81 | |
| Current diagnosis CCS class | Skin and subcutaneous tissue | 1.92 | 1.55 | 1.52 |
| Nervous system and sense organs | 1.26 | |||
| Respiratory system | 1.25 | |||
| Injury and poisoning | 1.15 | |||
| Infectious and parasitic | 1.07 | |||
| Genitourinary system | - | 1.18 | - | |
| Mental illness | - | 1.02 | ||
| Circulatory system | - | - | 0.85 | |
| Endocrine, nutritional, metabolic, immunity | 0.88 | - | 0.65 | |
| Neoplasms | 0.75 | - | 0.83 | |
| Recent 1-digit ICD 10 procedure | Other procedures | 1.76 | 1.80 | 1.76 |
| Extracorporeal/systemic | 1.30 | - | - | |
| Administration | - | - | 0.68 | |
| Medical and surgical | - | - | 0.90 | |
| Prior ICU stay | > 0 Days and <5 Days | 1.20 | 1.02 | - |
| 10–20 Days | - | 1.39 | - | |
| > 20 Days | 1.73 | - | - | |
| Prior 2-digit ICD 10 procedure | Medical/surgical gastrointestinal | 1.35 | - | 1.05 |
| Medical/surgical upper veins | - | 1.03 | - | |
| Medical/surgical respiratory | - | - | 1.33 | |
| Recent 2-digit ICD 10 procedure | Medical/surgical gastrointestinal | - | - | 1.32 |
| Medical/surgical anatomical | - | 1.33 | - | |
| Medical/surgical heart and vessels | 1.21 | - | - | |
| Administration circulatory | 1.27 | 1.28 | - | |
| Medical/surgical hepatobiliary | - | 1.13 | - | |
| Prior antibiotics use | Yes | 1.25 | - | - |
| Prior antibiotics Fluoro use | 1.28 | - | - | |
| Prior antibiotics Ceph use | - | 1.09 | - | |
| Number of different types used = 3 | 1.19 | - | - | |
| Number of recent procedures | ≤ 2 | 0.92 | - | 0.87 |
| > 2 and ≤ 5 | - | 0.91 | - | |
| > 5 and ≤ 10 | - | 0.89 | 0.96 | |
| Number of prior diagnosis | ≤ 10 | - | - | 0.91 |
| > 10 and ≤ 20 | - | 1.01 | - | |
| > 50 and ≤ 100 | - | 1.04 | - | |
| Number of prior procedures | > 20 | - | 1.31 | - |
| Number of current diagnosis | ≤ 10 | - | 0.72 | 0.67 |
| > 10 and ≤ 20 | 0.74 | 0.89 | - | |
| > 30 and ≤ 50 | 1.38 | 1.03 | - | |
| Admission type or source | Elective | 0.96 | 0.78 | - |
| Home or self referral | 0.73 | 0.83 | - | |
| Physician referral | 0.65 | 0.70 | - | |
| Race/ethinicity | Black | 0.91 | 0.94 | - |
| Sex | Female | - | - | 0.89 |
| Age group | Age 30–40 | - | 0.97 | - |
| Age 40–50 | 0.93 | - | - | |
| Long-term care facility stay | Yes | 1.69 | 1.95 | - |
Model parameters for best performing XGBoost models.
|
| ||||
|---|---|---|---|---|
|
|
|
|
|
|
| colsample_bytree | 0.8 | 0.8 | 0.8 | 0.8 |
| gamma | 0 | 0 | 0 | 0 |
| learning_rate | 0.05 | 0.05 | 0.05 | 0.05 |
| max_depth | 5 | 5 | 5 | 5 |
| min_child_weight | 1 | 1 | 1 | 1 |
| n_estimators | 200 | 200 | 200 | 200 |
| nthread | 4 | 4 | 4 | 4 |
| objective | binary:logistic | binary:logistic | binary:logistic | binary:logistic |
| seed | 1337 | 1337 | 1337 | 1337 |
| subsample | 0.8 | 0.8 | 0.8 | 0.8 |
| Threshold Bound | 0.15 | 0.015 | 0.10 | 0.30 |