| Literature DB >> 31581716 |
Kyoung Hwa Lee1, Jae June Dong2, Su Jin Jeong3, Myeong-Hun Chae4, Byeong Soo Lee5, Hong Jae Kim6, Sung Hun Ko7, Young Goo Song8.
Abstract
An adequate model for predicting bacteraemia has not yet been developed. This study aimed to evaluate the performance of an artificial neural network (ANN)-based prediction model in comparison with previous statistical models. The performance of multi-layer perceptron (MLP), a representative ANN model, was verified via comparison with a non-neural network model. A total of 1260 bacteraemia episodes were identified in 13,402 patients. In MLP with 128 hidden layer nodes, the area under the receiver operating characteristic curve (AUC) of the prediction performance was 0.729 (95% confidence interval [CI]; 0.712-0.728), while in MLP with 256 hidden layer nodes, it was 0.727 (95% CI; 0.713-0.727). In a conventional Bayesian statistical method, the AUC was 0.7. The aforementioned two MLP models exhibited the highest sensitivity (0.810). The ranking of clinical variables was used to describe the influential power of the prediction. Serum alkaline phosphatase was one of the most influential clinical variables, and one-out search was the best ranking method for measuring the influence of the clinical variables. Furthermore, adding variables beyond the 10 top-ranking ones did not significantly affect the prediction of bacteraemia. The ANN model is not inferior to conventional statistical approaches. Bacteraemia can be predicted using only the top 10 clinical variables determined by a ranking method, and the model can be used in clinical practice by applying real-time monitoring.Entities:
Keywords: artificial intelligence; artificial neural network; bacteraemia; machine learning; prediction
Year: 2019 PMID: 31581716 PMCID: PMC6832527 DOI: 10.3390/jcm8101592
Source DB: PubMed Journal: J Clin Med ISSN: 2077-0383 Impact factor: 4.241
Figure 1Flow chart of the study design and analysis. ANN: artificial neural network; LRP, layer-wise relevance propagation; MLP, multi-layer perceptron; RF, random forest; and SVM, support vector machine.
Comparison of the clinical characteristics of two groups classified according to bacteremia.
| Bacteremia, Yes ( | Bacteremia, No ( | ||
|---|---|---|---|
| Age, years | 64.4 ± 15.3 | 60.1 ± 17.3 | <0.001 |
| Sex, male | 558 (44.3%) | 533 (42.3%) | 0.306 |
| Vital sign | |||
| SBP, mmHg | 91.6 ± 23.2 | 99.4 ± 22.8 | <0.001 |
| Body temperature (min), °C | 36.1 ± 0.4 | 36.1 ± 0.3 | 0.578 |
| Body temperature (max), °C | 38.1 ± 0.9 | 37.6 ± 0.9 | <0.001 |
| Heart rate, beats/min | 115.7 ± 29.2 | 105.7 ± 28.5 | <0.001 |
| Respiratory rate, /min | 25.5 ± 8.6 | 23.6 ± 8.9 | <0.001 |
| Laboratory data | |||
| Creatinine, mg/dL | 1.6 ± 1.5 | 1.2 ± 1.3 | 0.317 |
| Albumin, g/dL | 2.8 ± 0.6 | 3.2 ± 0.7 | <0.001 |
| CRP, mg/L | 1.6 ± 1.5 | 1.2 ± 1.3 | <0.001 |
| Alkaline phosphatase, IU/L | 191.3 ± 213.7 | 136.0 ± 141.0 | <0.001 |
| WBC count (min), /μL | 10.3 ± 6.2 | 9.3 ± 5.5 | <0.001 |
| WBC count (max), /μL | 14.8 ± 8.9 | 11.9 ± 7.3 | <0.001 |
| Platelet count, /μL | 185.9 ± 134.8 | 218.7 ± 129.8 | <0.001 |
| Prothrombin time, s | 18.5 ± 8.9 | 15.9 ± 6.6 | <0.001 |
| Clinical information | |||
| Hospital day to blood culture, days | 24.6 ± 75.3 | 10.3 ± 17.6 | <0.001 |
| ICU care, yes | 410 (32.6%) | 235 (18.7%) | <0.001 |
| Central venous catheter, yes | 114 (9.1%) | 0 (0%) | <0.001 |
| Steroid therapy, yes | 477 (37.9) | 429 (34.0) | 0.045 |
| Antibiotic therapy, yes | 591 (46.9%) | 526 (41.7%) | 0.009 |
Data are expressed as mean ± standard deviation or n (%). Abbreviations: SBP, systolic blood pressure; DBP, diastolic blood pressure; CRP, C-reactive protein; WBC, white blood cell; and ICU, intensive care unit.
Results and performance of the bacteremia predictions using various machine learning techniques.
| Model | Algorithms | AUC (95% CI) | Sensitivity | Specificity |
|---|---|---|---|---|
| Previous | Naive Bayesian | 0.7 | - | - |
| A | MLP (128) | 0.729 (0.712–0.728) | 0.810 | 0.589 |
| B | MLP (256) | 0.727 (0.713–0.727) | 0.810 | 0.532 |
| C | RF | 0.732 (0.722–0.733) | 0.682 | 0.655 |
| D | SVM | 0.699 (0.687–0.700) | 0.692 | 0.553 |
| E | MLP (128) + MLP (256) + RF | 0.732 (0.728–0.735) | 0.793 | 0.571 |
Standard errors and 95% confidences were obtained from 10-fold cross-validation. Abbreviations: AUC, area under the receiver operating characteristic curve; SE, standard error; CI, confidence interval.
Figure 2Area under the receiver operating characteristic curve of the bacteremia predictions. A: MLP model with 128 nodes of the hidden layer, B: MLP model with 256 nodes of the hidden layer, C: RF model, D: support vector machine model, and E: combination model of the MLP and RF.
Influence of the ranking of the clinical variables on bacteremia predictions.
| Rank | One-Out Search | Gini Importance |
|---|---|---|
| 1 | ALP | Age |
| 2 | Platelet | Prothrombin time |
| 3 | Body temperature (max) | Heart rate |
| 4 | Systolic BP | Hospital day to blood culture |
| 5 | WBC count (min) | Albumin |
| 6 | ICU stay | ALP |
| 7 | CRP | Platelet |
| 8 | Central venous catheter | Body temperature (max) |
| 9 | Prothrombin time | WBC count (max) |
| 10 | Albumin | Creatinine |
| 11 | Steroid use | systolic BP |
| 12 | Sex | WBC count (min) |
| 13 | Antibiotic | Respiratory rate |
| 14 | Body temperature (min) | ICU stay |
| 15 | Hospital day to blood culture | Steroid |
| 16 | WBC count (max) | Sex |
| 17 | Heart rate | Body temperature (min) |
| 18 | Respiratory rate | Antibiotics |
| 19 | Age | CRP |
| 20 | Creatinine | Central venous catheter |
Abbreviations: ALP, alkaline phosphatase; BP, blood pressure; CRP, C-reactive protein.
Figure 3Changing trend of the area under the receiver operating characteristic curve for variables added or removed by ranking. (A) Analysis with removed variables according to ranking. (B) Analysis with added variables according to ranking. (C) Non-ANN analysis with removed variables according to ranking. (D) Non-ANN analysis with added variables according to ranking. GI, Gini importance; LRP, Layer-wise Relevance Propagation; OOS, one-out search. ANN, artificial neural network; MLP, multi-layer perceptron; RF, random forest; and SVM, support vector machine.