| Literature DB >> 21887234 |
Ying-Jui Chang1, Min-Li Yeh, Yu-Chuan Li, Chien-Yeh Hsu, Chao-Cheng Lin, Meng-Shiuan Hsu, Wen-Ta Chiu.
Abstract
BACKGROUND: Hospital-acquired infections (HAI) are associated with increased attributable morbidity, mortality, prolonged hospitalization, and economic costs. A simple, reliable prediction model for HAI has great clinical relevance. The objective of this study is to develop a scoring system to predict HAI that was derived from Logistic Regression (LR) and validated by Artificial Neural Networks (ANN) simultaneously. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2011 PMID: 21887234 PMCID: PMC3160843 DOI: 10.1371/journal.pone.0023137
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Variables Used for Statistical Analysis.
| Category | Variables | Remark |
| Demographic | Age | |
| Gender | ||
| Underlying health status | Diagnosis number at admission | Represented as complexity of disease |
| General anesthesia | Major surgical procedures and interventions | |
| Hemodialysis | Underlying healthy status | |
| Procedural | Arterial line | |
| Central venous catheterization | ||
| Endotracheal intubation | ||
| Tracheostomy | ||
| Nasogastric tube | ||
| Foley catheterization | ||
| Draining tubes | Chest tube, draining tube, double-lumen tube…etc. | |
| Therapeutical | Chemotherapeutic agents | Used for more than 3 days |
| Systemic Glucocorticosteroids | Used for more than 5 days | |
| Stress ulcer prophylaxes | H2 antagonists, sucralfate, and proton pump inhibitors used for more than 3 days | |
| Non-steroid anti-inflammatory drugs | Used for more than 3 days |
Univariate Analyses for Demographic and Clinical Data of Infection and Non-Infection Sets (N = 1,852).
| Infection | Non-infection | ||
| Variables | Coding | (N = 476) | (N = 1,376) |
| Age, years | 65.32±13.40 | 51.82±18.55 | |
| Diagnosis numbers at admission | 1.67±1.01 | 1.41±0.76 | |
| Gender | Male | 274 (57.6%) | 685 (49.8%) |
| Female | 202 (42.4%) | 691 (50.2%) | |
| General anesthesia | Yes | 222 (46.6%) | 368 (26.7%) |
| No | 254 (53.4%) | 1,008 (73.3%) | |
| Hemodialysis | Yes | 75 (17.3%) | 42 (3.1%) |
| No | 401 (82.7%) | 1,334 (96.9%) | |
| Arterial line | Yes | 216 (45.4%) | 59 (4.3%) |
| No | 260 (54.6%) | 1317 (95.7%) | |
| CVC | Yes | 296 (62.2%) | 89 (6.5%) |
| No | 180 (37.8%) | 1,287 (93.5%) | |
| Endotracheal intubation | Yes | 378 (79.4%) | 133 (9.7%) |
| No | 98 (20.6%) | 1,243 (90.3%) | |
| Tracheostomy | Yes | 107 (22.5%) | 17 (1.2%) |
| No | 369 (77.5%) | 1,359 (98.8%) | |
| NG tube | Yes | 420 (88.2%) | 151 (11.0%) |
| No | 56 (11.8%) | 1,225 (89.0%) | |
| Foley catheterization | Yes | 355 (74.6%) | 202 (19.8%) |
| No | 121 (25.4%) | 1,104 (80.2%) | |
| Draining tubes | Yes | 58 (12.2%) | 22 (1.6%) |
| No | 418 (87.8%) | 1,354 (98.4%) | |
| Chemotherapy | Yes | 24 (5.0%) | 19 (1.4%) |
| No | 452 (95.0%) | 1,357 (98.6%) | |
| Systemic Glucocorticosteroids | Yes | 143 (30.0%) | 42 (3.1%) |
| No | 333 (70.0%) | 1,334 (96.9%) | |
| Stress ulcer prophylaxes | Yes | 331 (69.5%) | 130 (9.4%) |
| No | 145 (30.5%) | 1,246 (90.6%) | |
| NSAID | Yes | 136 (28.6%) | 430 (31.3%) |
| No | 340 (71.4%) | 946 (68.8%) | |
Statistics of each variable between infection and non-infection sets,
Mean±SD,
*p<0.05.
Abbreviations: CVC = central venous catheter, NG = nasogastric, NSAID: non-steroid anti-inflammatory drug.
Definition of Variable Groups for Analysis.
| Setting | Variables | Remark |
| Group 1 | All 7 variables | Selected by final LR model |
| Group 2 | Foley catheter, NG tube and steroids | High odds ratio variables |
| Group 3 | Foley catheter, CVC, arterial line and NG tube | Medical devices |
| Group 4 | CVC, arterial line and stress ulcer prophylaxes | Low odds ratio variables |
| Group 5 | Hemodialysis, stress ulcer prophylaxes and steroids | Underlying condition and medications |
Abbreviations: CVC = central venous catheter, NG = nasogastric, LR = logistic regression.
Coefficients of the Logistic Regression Model (N = 927).
| Coefficient (β) | SE | OR | 95% CI | p value | |
| NG tube | −2.594 | 0.317 | 13.389 | 7.193–24.924 | <0.0001 |
| Steroids | −1.975 | 0.415 | 7.207 | 3.195–16.256 | <0.0001 |
| Foley catheter | −1.812 | 0.282 | 6.125 | 3.524–10.645 | <0.0001 |
| Hemodialysis | −1.527 | 0.528 | 4.606 | 1.637–12.958 | 0.004 |
| Stress ulcer prophylaxes | −1.368 | 0.303 | 3.928 | 2.171–7.108 | <0.0001 |
| Arterial line | −0.950 | 0.432 | 2.586 | 1.109–6.032 | 0.028 |
| CVC | −0.753 | 0.374 | 2.123 | 1.021–4.415 | 0.044 |
| Constant | 6.518 | 0.754 | - | - | - |
Abbreviations: SE = standard error, OR = odds ratio, CI: confidence interval, CVC = central venous catheter, NG = nasogastric.
Figure 1The Optimal Network Architecture of The Artificial Neural Network.
A multilayer perceptron with 16 input nodes and 13 hidden nodes in the network.
Figure 2Comparison of The Area Under the Receiver Operating Characteristic Curves (AUCs) in Training Set (N = 927).
All variables were included in artificial neural network (ANN) model and 7 variables were included in logistic regression (LR) model. The AUCs for ANN and LR are 0.995±0.003 and 0.966±0.008 respectively (p<0.001).
Comparison of ANN and LR in Training Set, % (N = 927).
| Models | Accuracy | Sens. | Spec. | PPV | NPV | AUC | p value | p value | p value |
| Group 1 | |||||||||
| ANN | 95.04 | 97.06 | 96.52 | 90.6 | 99.0 | 0.995±0.003 | - | - | |
| LR | 91.05 | 93.7 | 91.0 | 78.2 | 97.7 | 0.966±0.008 | <0.001 | - | |
| Group 2 | |||||||||
| ANN | 89.97 | 92.44 | 89.7 | 75.6 | 97.2 | 0.949±0.010 | - | <0.001 | |
| LR | 89.32 | 92.44 | 89.7 | 75.6 | 97.2 | 0.947±0.010 | 0.796 | 0.005 | |
| Group 3 | |||||||||
| ANN | 90.51 | 91.60 | 87.37 | 71.5 | 96.8 | 0.948±0.010 | - | <0.001 | |
| LR | 90.40 | 90.34 | 89.40 | 74.7 | 96.4 | 0.953±0.010 | 0.183 | 0.054 | |
| Group 4 | |||||||||
| ANN | 86.62 | 85.29 | 85.78 | 67.4 | 94.4 | 0.884±0.015 | - | <0.001 | |
| LR | 86.73 | 84.45 | 86.65 | 68.6 | 94.2 | 0.886±0.015 | 0.490 | <0.001 | |
| Group 5 | |||||||||
| ANN | 85.33 | 82.35 | 86.36 | 67.6 | 93.4 | 0.867±0.016 | - | <0.001 | |
| LR | 85.33 | 82.35 | 86.36 | 67.6 | 93.4 | 0.867±0.016 | 1.000 | <0.001 |
Mean±SE,
comparison with same variables set,
comparison with ANN model,
comparison with LR model.
Group 1: all variables.
Group 2: high odds ratio variables (Foley, nasogastric tube and steroids).
Group 3: medical devices as variables (Foley, CVC catheter, arterial line and nasogastric tube).
Group 4: low odds ratio variables (CVC catheter, arterial line and stress ulcer prophylaxes).
Group 5: underlying condition and medications as variables (hemodialysis, stress ulcer prophylaxes and steroids).
Abbreviations: ANN = artificial neural network, LR = logistic regression, Sens. = sensitivity, Spec. = specificity, PPV = positive predictive value, NPV = negative predictive value, AUC = area under the receiver operating characteristic curve.
Comparison of ANN and LR in Internal Validation, % (N = 461).
| Models | Accuracy | Sens. | Spec. | PPV | NPV | AUC | p value | p value | p value |
| Group 1 | |||||||||
| ANN | 90.24 | 96.64 | 85.96 | 70.6 | 98.7 | 0.964±0.012 | - | - | |
| LR | 91.54 | 92.44 | 91.52 | 79.1 | 97.2 | 0.969±0.011 | 0.507 | - | |
| Group 2 | |||||||||
| ANN | 90.02 | 90.76 | 86.55 | 70.1 | 96.4 | 0.949±0.014 | - | 0.177 | |
| LR | 87.64 | 90.76 | 86.55 | 70.1 | 96.4 | 0.952±0.014 | 0.574 | 0.024 | |
| Group 3 | |||||||||
| ANN | 91.54 | 92.44 | 86.84 | 71.0 | 97.1 | 0.949±0.014 | - | 0.205 | |
| LR | 91.76 | 96.64 | 85.96 | 70.6 | 98.7 | 0.959±0.013 | 0.095 | 0.295 | |
| Group 4 | |||||||||
| ANN | 86.98 | 81.51 | 89.18 | 72.4 | 93.3 | 0.873±0.022 | - | <0.001 | |
| LR | 87.64 | 80.67 | 90.06 | 73.8 | 93.1 | 0.876±0.022 | 0.553 | <0.001 | |
| Group 5 | |||||||||
| ANN | 85.47 | 71.43 | 90.35 | 72.0 | 90.1 | 0.829±0.025 | - | <0.001 | |
| LR | 85.47 | 71.43 | 90.35 | 72.0 | 90.1 | 0.829±0.025 | 1.000 | <0.001 |
Mean±SE,
comparison with same variables set,
comparison with ANN model,
comparison with LR model.
Group 1: all variables.
Group 2: high odds ratio variables (Foley, nasogastric tube and steroids).
Group 3: medical devices as variables (Foley, CVC catheter, arterial line and nasogastric tube).
Group 4: low odds ratio variables (CVC catheter, arterial line and stress ulcer prophylaxes).
Group 5: underlying condition and medications as variables (hemodialysis, stress ulcer prophylaxes and steroids).
Abbreviations: ANN = artificial neural network, LR = logistic regression, Sens. = sensitivity, Spec. = specificity, PPV = positive predictive value, NPV = negative predictive value, AUC = area under the receiver operating characteristic curve.
Figure 3Comparison of The Area Under the Receiver Operating Characteristic Curves (AUCs) in Internal Validation Set (N = 461).
The comparison of AUCs between different variable groups of artificial neural network (ANN-1, ANN-3, ANN-5 for Group 1, 3, 5 respectively) and logistic regression (LR-1, LR-2, LR-3 respectively) in internal validation.
Comparison of ANN, LR and Scoring in External Validation, % (N = 2,500).
| Models | Accuracy | Sens. | Spec. | AUC | p value |
| Group 1 | |||||
| ANN | 96.12 | 82.76 | 78.15 | 0.850±0.045 | - |
| LR | 98.76 | 82.76 | 80.90 | 0.870±0.043 | 0.447 |
| Score | 91.24 | 68.97 | 91.50 | 0.871±0.043 | 0.362 |
| Group 3 | |||||
| ANN | 95.44 | 72.41 | 84.66 | 0.820±0.048 | - |
| LR | 98.52 | 75.86 | 81.63 | 0.831±0.047 | 0.521 |
| Score | 92.24 | 62.07 | 92.59 | 0.830±0.047 | 0.524 |
| Group 5 | |||||
| ANN | 94.28 | 68.97 | 86.16 | 0.791±0.050 | - |
| LR | 98.84 | 68.97 | 86.16 | 0.792±0.050 | 0.929 |
| Score | 84.44 | 68.97 | 84.62 | 0.791±0.050 | 0.967 |
Mean±SE,
comparison with ANN model.
Group 1: all variables.
Group 3: medical devices as variables (Foley, CVC catheter, arterial line and nasogastric tube).
Group 5: underlying condition and medications as variables (hemodialysis, stress ulcer prophylaxes and steroids).
Abbreviations: ANN = artificial neural network, LR = logistic regression, Sens. = sensitivity, Spec. = specificity, AUC = area under the receiver operating characteristic curve.
Figure 4Comparison of The Area Under the Receiver Operating Characteristic Curves (AUCs) in External Validation Set (N = 2,500).
The comparison of AUCs between different variable groups of artificial neural network (ANN-1, ANN-3, ANN-5 for Group 1, 3, 5 respectively), logistic regression (LR-1, LR-2, LR-3 respectively) and scoring system (Score-1, Score-3, Score-5 respectively) in external validation.