| Literature DB >> 33776495 |
Jie Song1, Yuan Gao2, Pengbin Yin3, Yi Li1, Yang Li2, Jie Zhang4, Qingqing Su1, Xiaojie Fu2, Hongying Pi5.
Abstract
PURPOSE: Build machine learning models for predicting pressure ulcer nursing adverse event, and find an optimal model that predicts the occurrence of pressure ulcer accurately. PATIENTS AND METHODS: Retrospectively enrolled 5814 patients, of which 1673 suffer from pressure ulcer events. Support vector machine (SVM), decision tree (DT), random forest (RF) and artificial neural network (ANN) models were used to construct the pressure ulcer prediction models, respectively. A total of 19 variables are included, and the importance of screening variables is evaluated. Meanwhile, the performance of the prediction models is evaluated and compared.Entities:
Keywords: adverse event; machine learning; pressure ulcer; risk management
Year: 2021 PMID: 33776495 PMCID: PMC7987326 DOI: 10.2147/RMHP.S297838
Source DB: PubMed Journal: Risk Manag Healthc Policy ISSN: 1179-1594
Figure 1Flowchart of the model construction. The data of patients suffering from pressure ulcer was selected within 48 hours before pressure ulcer occurred. For patients without pressure ulcer, the data within 48 hours between 24 hours after admission and before discharge was randomly selected. Then, the two sets of data were fully mixed and randomly divided into two parts, namely train set (n=2883) and Test set (n=2931). Two sets. The model learns features in the train set, without knowing the actual pressure ulcer occurrence. Cross-validation was performed in the train set and the model performance was evaluated on the Test set.
Figure 2Flow diagram of data inclusion. The figure shows the data sources, data selection process, inclusion and exclusion criteria for patients with and without pressure ulcer.
Hyperparameter Tuning in Models
| Model | Hyperparameter | Range | Increment | Final Setting |
|---|---|---|---|---|
| Minimum of number of instances per leaf | 2–18 | 2 | 2 | |
| Gamma | 0.01 | 0.01 | 0.02 | |
| Number of hidden neurons | 3–12 | 1 | 8 | |
| Mtry | 2–11 | 1 | 8 | |
| Number of trees | 100–500 | 100 | 500 |
General Information and Maternal Characteristics of Pressure Ulcer Patients
| Variables | Pressure Ulcer Group (n=1673) n/ X̄±s | No Pressure Ulcer Group (n=4141) n/ X̄±s | |
|---|---|---|---|
| Male | 1031 | 2415 | 0.020 |
| Female | 642 | 1726 | |
| 64.34±18.30 | 51.89±17.55 | <0.001 | |
| 165.82±13.15 | 163.54±16.09 | 0.003 | |
| 65.62±12.32 | 63.96±14.24 | 0.014 | |
| 1577.66±1917.74 | 1508.15±2221.84 | <0.001 | |
| 1786.26±1046.73 | 2105.70±1058.91 | <0.001 | |
| 36.78±0.53 | 36.62±0.64 | <0.001 | |
| 72.42±10.70 | 73.12±10.00 | <0.001 | |
| 9.22±2.44 | 8.61±2.39 | <0.001 | |
| 8.15±2.87 | 7.89±3.64 | <0.001 | |
| 279 (16.68%) | 215 (5.19%) | <0.001 | |
| 147 (8.79%) | 198 (4.78%) | <0.001 | |
| 108 (6.46%) | 188 (4.54%) | 0.003 | |
| 127 (7.59%) | 192 (4.64%) | <0.001 | |
| 72 (4.30%) | 113 (2.73%) | 0.002 | |
| 252 (15.06%) | 57 (1.38%) | <0.001 | |
| Total Score | 13.78±3.14 | 18.74±2.38 | <0.001 |
| Incontinence score | 3.38±0.90 | 3.96±0.31 | <0.001 |
| Activity score | 1.74±1.04 | 3.52±0.90 | <0.001 |
| Mind score | 3.75±0.70 | 3.94±0.33 | <0.001 |
| 1.33±0.95 | 0.27±0.57 | <0.001 | |
| 1019 (60.91%) | 1282 (30.96%) | <0.001 |
Logistic Regression Analysis on Important Variables
| Variable | Regression Coefficients | SE(b) | Wald value | OR value | 95% CI of OR value | ||
|---|---|---|---|---|---|---|---|
| Lower Limit | Upper Limit | ||||||
| −9.560 | 4.307 | 4.927 | 0.026 | ||||
| 0.019 | 0.004 | 28.776 | <0.001 | 1.019 | 1.012 | 1.027 | |
| 0.012 | 0.005 | 6.157 | 0.013 | 1.012 | 1.003 | 1.022 | |
| 0.626 | 0.238 | 6.921 | 0.009 | 1.870 | 1.173 | 2.981 | |
| 0.933 | 0.188 | 24.513 | <0.001 | 2.543 | 1.757 | 3.679 | |
| 0.381 | 0.188 | 4.092 | 0.043 | 1.463 | 1.012 | 2.116 | |
| −0.736 | 0.151 | 23.632 | <0.001 | 0.479 | 0.356 | 0.645 | |
| Total score | 2.138 | 0.308 | 48.081 | <0.001 | 8.486 | 4.637 | 15.532 |
| Incontinence score | −0.833 | 0.101 | 68.063 | <0.001 | 0.435 | 0.357 | 0.530 |
| Activity score | −1.145 | 0.069 | 274.010 | <0.001 | 0.318 | 0.278 | 0.364 |
| Mind score | 0.403 | 0.114 | 12.401 | <0.001 | 1.496 | 1.196 | 1.873 |
| −0.504 | 0.021 | 579.498 | <0.001 | 0.604 | 0.580 | 0.629 | |
| 0.539 | 0.079 | 46.286 | <0.001 | 1.714 | 1.467 | 2.002 | |
| 0.577 | 0.232 | 6.172 | 0.013 | 1.780 | 1.129 | 2.805 | |
| −0.595 | 0.183 | 10.540 | 0.001 | 0.552 | 0.385 | 0.790 | |
| 0.215 | 0.109 | 3.927 | 0.048 | 1.240 | 1.002 | 1.534 | |
| 0.011 | 0.004 | 6.536 | 0.011 | 1.011 | 1.002 | 1.019 | |
| 0.053 | 0.017 | 9.569 | 0.002 | 1.054 | 1.019 | 1.090 | |
| 0.435 | 0.190 | 5.254 | 0.022 | 1.545 | 1.065 | 2.241 | |
| 1.250 | 0.165 | 57.475 | <0.001 | 3.490 | 2.526 | 4.821 | |
Comparison of the Prediction Performance of the Four Pressure Ulcer Prediction Models
| Model (n=2931) | TP | TN | FP | FN | Accuracy | Recall | Precision | F1 Value | AUC | AUPRC |
|---|---|---|---|---|---|---|---|---|---|---|
| SVM | 770 | 2.070 | 41 | 50 | 94.94% | 93.90% | 96.90% | 94.42% | 0.9940 | 0.9103 |
| DT | 804 | 2.094 | 17 | 16 | 97.93% | 98.05% | 98.87% | 97.99% | 0.9960 | 0.9607 |
| RF | 814 | 2.075 | 1 | 1 | 99.88% | 99.88% | 99.93% | 99.88% | 0.9999 | 0.9910 |
| ANN | 648 | 2.016 | 172 | 95 | 79.02% | 87.21% | 90.89% | 82.92% | 0.9590 | 0.8542 |
| Norton scale | 437 | 1261 | 490 | 850 | 47.14% | 33.95% | 55.89% | 39.48% | 0.5709 | 0.8339 |
Abbreviations: TP, true positive; FN, false negative; TN, true negative; FP, false positive; AUC, area under the curve; AUPRC, area under the precision recall curve.
Figure 3Performance metrics of pressure ulcer prediction models on the test data set. Based on the prediction results of the model, the ROC curves are drawn, including SVM (A), DT (B), RF (C), and ANN (D). The results are compared with the ROC curve of Norton scale (E). Norton scale is inferior to the machine learning model in terms of ROC curve or AUC value. Graphically, DT and RF achieve similar performance, but RF obtains higher prediction accuracy in terms of AUC value.
Figure 4Histograms and reliability diagrams for models. (A–C) are the histograms and reliability diagrams before model calibration. The column (D) is the histogram of the model after calibration.