Literature DB >> 33776495

The Random Forest Model Has the Best Accuracy Among the Four Pressure Ulcer Prediction Models Using Machine Learning Algorithms.

Jie Song¹, Yuan Gao², Pengbin Yin³, Yi Li¹, Yang Li², Jie Zhang⁴, Qingqing Su¹, Xiaojie Fu², Hongying Pi⁵.

Abstract

PURPOSE: Build machine learning models for predicting pressure ulcer nursing adverse event, and find an optimal model that predicts the occurrence of pressure ulcer accurately. PATIENTS AND METHODS: Retrospectively enrolled 5814 patients, of which 1673 suffer from pressure ulcer events. Support vector machine (SVM), decision tree (DT), random forest (RF) and artificial neural network (ANN) models were used to construct the pressure ulcer prediction models, respectively. A total of 19 variables are included, and the importance of screening variables is evaluated. Meanwhile, the performance of the prediction models is evaluated and compared.
RESULTS: The experimental results show that the four pressure ulcer prediction models all achieve good performance. Also, the AUC values of the four models are all greater than 0.95. Besides, the comparison of the four models indicates that RF model achieves a higher accuracy for the prediction of pressure ulcer.
CONCLUSION: This research verifies the feasibility of developing a management system for predicting nursing adverse event based on big data technology and machine learning technology. The random forest and decision tree model are more suitable for constructing a pressure ulcer prediction model. This study provides a reference for future pressure ulcer risk warning based on big data.

Entities: Chemical

Keywords: adverse event; machine learning; pressure ulcer; risk management

Year: 2021 PMID： 33776495 PMCID： PMC7987326 DOI： 10.2147/RMHP.S297838

Source DB: PubMed Journal: Risk Manag Healthc Policy ISSN： 1179-1594

Introduction

Pressure ulcer, also known as pressure injury, refers to the localized injury of the skin and/or subcutaneous tissue, which usually occurs at the bone protuberance, and the part in contact with medical equipment or other equipment. It can be expressed as intact skin or open ulcers and may be accompanied with pain.1 According to related research reports, the incidence of pressure ulcer in hospital is usually 2%~5%, and the incidence of tape avulsion in elderly patients is as high as 15%.2,3 Pressure ulcer can destroy the integrity of the skin, increase the risk of infection, and is difficult to heal. The high incidence, serious hazards and complex causes have made pressure ulcer a challenging issue, which attracts continuous attention in clinical care. In addition to the serious medical complications, people suffering from pressure ulcer are also faced with 229 kinds of physiological, social and psychological effects, which significantly affects the quality of life. The clinical intervention of pressure ulcer focuses on prevention. Research suggests that through dynamic monitoring and effective management, the occurrence of pressure ulcer can be successfully prevented. Therefore, providing an effective early warning model of pressure ulcer to assist clinicians and nurses in making timely predictions and taking corresponding measures is of great value for the prevention of pressure ulcer. It is identified that pressure ulcer is related to many risk factors, including continuous local pressure, hospital stay, long-term bed rest, neurological changes, etc.4 Meanwhile, there are newly discovered factors every year. The Braden-Norton-Waterlow pressure ulcer assessment scale has been widely used in clinics and has made significant contributions to the management of pressure ulcer. However, the specificity and sensitivity of these scales are low, and there is still no evidence that these scales can effectively predict the occurrence of pressure ulcer. Recently, the big data and machine learning technology have undergone a fast-growing. These new technologies are able to directly extract data from the medical system for real-time analysis, and the accuracy of analysis is improved by huge amount of data.5 In this case, these technologies are expected to solve many problems clinics, such as the pressure ulcer management. This study aims to predict the pressure ulcer adverse events of inpatients through the machine learning technology. Firstly, the data of pressure ulcer in inpatients are analyzed and are associated with the electronic case data system. A logistic regression model is then used to identify 19 independent risk factors. These risk factors are input into four commonly machine learning algorithms to construct the prediction model. Finally, experimental results of the machine learning algorithms are investigated to find the algorithm with the best prediction performance.

Patients and Methods

Population

A total of 1839 patients are retrospectively included in the First Medical Center of the Chinese People’s Liberation Army General Hospital during the period of hospitalization from January 1, 2013 to December 31, 2016. The included patients were over 18 years old and met the 2009 NPUAP pressure ulcer diagnostic criteria (the criteria adopted by the hospital). The false-positive data and suspicious cases as well as pressure ulcer events that occurred before hospitalization and within 24 hours after admission were excluded. In this study, the standard for defining false-positive cases is that after the nurse reports the patient’s pressure ulcer event, the pressure ulcer management team composed of three doctors contact the patient to diagnose the pressure ulcer, and give the final diagnosis result for pressure ulcer. If the result is consistent with that reported by the first-line nurse, the case is considered as pressure ulcer, otherwise, the case is considered as false positive. If the result cannot be defined, the case is considered as suspected. Finally, 1673 patients with pressure ulcer and 4141 patients without pressure ulcer were included.

Primary Outcome

The diagnostic definition of pressure ulcer in the adverse event reporting system of Chinese medicine hospital is based on the 2009 NPUAP Quick Reference Guide, where pressure ulcer refers to the local damage of the skin or/and subcutaneous tissue, usually located at the bone protrusion.6 This kind of damage is generally caused by pressure or pressure combined with shear force. However, considering recent in-depth research on pressure ulcer, NPUAP has updated the definition of pressure ulcer. After comparing the diagnostic criteria of pressure ulcer in 2009 and 2019, all included cases meet the latest diagnostic criteria.

Variables

The predictor of pressure ulcer mainly includes general demographic data, basic vital signs data, medical care measures, disease-related data, and nursing evaluation items. Indicators are measured every day, all using the average value within 48 hours before the pressure ulcer occurs. In this study, the time of the pressure ulcer occurrence is randomly distributed. To match the time randomness of the pressure ulcer occurrence, the average data within 48 hours are randomly selected for analysis by computer (see Figure 1) during the period from 24 hours after admission to the discharge of patients in the control group.

Figure 1

Flowchart of the model construction. The data of patients suffering from pressure ulcer was selected within 48 hours before pressure ulcer occurred. For patients without pressure ulcer, the data within 48 hours between 24 hours after admission and before discharge was randomly selected. Then, the two sets of data were fully mixed and randomly divided into two parts, namely train set (n=2883) and Test set (n=2931). Two sets. The model learns features in the train set, without knowing the actual pressure ulcer occurrence. Cross-validation was performed in the train set and the model performance was evaluated on the Test set. The electronic medical system in the hospital records complete general demographic data, including gender, age (y), height (cm), and weight (kg). The basic vital signs review the data recorded in the “nursing workstation”, which is developed by the hospital to inspect patients and performs nursing records. The inspection includes total intake (mL/day), total output (mL/day), body temperature (°C), systolic blood pressure (mmHg), and blood glucose (mmol/L). In the electronic medical system, the medical care measures taken by the patient before the occurrence of pressure ulcer were found, including the length of stay (day), whether to stay in bed, whether to use restraint bands and whether to undergo surgery. These measures are very related to the pressure ulcer of hospitalized patients caused by medical equipment. According to the disease diagnosis in the electronic medical record, we paid attention to whether the patient has diarrhea, diabetes, fractures and other related records. These variables are all related to the occurrence of pressure ulcer, and we incorporated these variables into the model for better accuracy. Inpatients will undergo daily pressure ulcer assessments, passive turn-over status records and nutritional assessments. These data will be recorded in the medical system in the form of electronic documents. The data is extracted and the average value of pressure ulcer within 48 hours before the occurrence of pressure ulcer is used. The evaluation of pressure ulcer in this study uses Norton pressure ulcer assessment (Norton scale). The sub-items of the evaluation scale consist of nutritional status score, mentality score, activity score, walking score, urinary and incontinence score. Further retrospective analysis found that some items have significance for predictions. Therefore, the incontinence score, activity score, mentality score, and the total score of the pressure ulcer assessment scale are used in the predictors (all scores). This study also counts the total scores of patients’ nutritional risks, and the nutritional assessment follows the NRS2002 Nutritional Risk Assessment Scale (Nutritional Assessment). Since position changes can reduce the risk of pressure ulcer, the patients that are assessed by clinical nurses to have risk of pressure ulcer will receive nursing services to change their positions every 2 hours, and the patients receiving care are recorded. According to the documents and literature investigated by the hospital, 108 features were obtained from the hospital digital medical record database, including 5 general demographic data, 10 basic vital signs data, 34 medical care measures, 21 disease-related data, 8 nursing evaluation items, and 30 types of drugs. From these data, 19 features related to the occurrence of pressure ulcer were obtained through the logistic regression model.

Data Processing

Missing Value Filling

The data preparation in this study includes data elimination, missing value filling, as well as data format and unit unification, which was performed by two professionals good at biomedical information data processing. 7356 cases of data were extracted from the digital medical record database SQL server, including 1839 patients in the pressure ulcer group (PU) and 5517 patients in the non-pressure ulcer group (No-PU) (see Figure 2). To verify the accuracy of the data extracted from the digital medical record database, the value and timestamps in the extracted data were compared with that displayed in the clinician’s electronic health record manually. As for implementation of the fully developed query for all manually validated cases, consistent values and timestamps (within 10 minutes) were obtained for all 7356 cases (100% agreement). Besides, the individual variables were cleaned by Stata 13 software (StataCorp LLC), and some non-conforming data were further excluded, including cases without data on the day of occurrence of pressure ulcer, repeated reported cases, unstructured data that cannot be processed and non-related data, as well as cases with more than 10% missing data. A total of 1184 unqualified data were eliminated, including 298 cases with pressure ulcer and 886 cases without pressure ulcer. In addition, the missing values were filled in the data. Specifically, on the day when the pressure ulcer event is reported, the average value of the corresponding structural data in the current natural day was taken; if there is no data on the day of reporting, the average value of the continuous data or the mode of non-continuous data in the previous three days was taken; if there no appropriate value can be used, the case is eliminated. A total of 358 cases were eliminated in the second step. Also, 316 cases with missing values were filled and 6.3% of the data was repaired. Finally, the data format and unit were unified.

Figure 2

Flow diagram of data inclusion. The figure shows the data sources, data selection process, inclusion and exclusion criteria for patients with and without pressure ulcer.

Data Segmentation

The data of 1673 patients with pressure ulcer event and 4141 patients without pressure ulcer event were mixed and merged. For machine learning algorithm using cross-validation as the evaluation method, the training data are split to realize a uniform sampling in the factor analysis.7 Therefore, 50% of the patient data were randomly selected for training (n = 2883), and the other 50% were used for testing (n = 2931) (shown in Figure 1).

Model Fitting and Development

The construction of the model includes three stages: 1) Create a data set with no missing values; 2) Find 19 important variables through logistic regression; 3) Build a machine learning model based on the 19 variables. The selection of important variables in stage 2 is fundamental. Though the predictive variables can be analyzed based on clinical experience, it is impossible to determine the variables that are actually important. Therefore, this study used a logistic regression model to verify the importance of the variables at the statistical level (selected P<0.05 variables), thus the best set of predictor variables can be found. By weighing the number of main variables, a balance is achieved between the model complexity (mainly dependent on the number of variables) and the model reliability. Four methods are used to build prediction models, including support vector machine (SVM), decision tree (DT), random forest (RF) and artificial neural network (ANN) models. The performance of these four methods is compared. The DT-based model in this study uses the C5.0 algorithm with a minimum number of leaf nodes, which avoids the problem of too many branches in the ID3 algorithm. Also, pruning is performed during the construction of the decision tree to discretize continuous data, and the limit is set to the maximum number of leaf nodes. SVM-based model uses Gaussian inner product as the kernel function (SVM-Kernel). Through the iterative solution of sub-problems, the prediction of large-scale problems is finally completed. The gamma parameter in the model is set to 0.024. ANN is divided into an input layer, an output layer and a hidden layer. The information is collected through the input layer, and the data is input into the hidden layer for analysis and processing.8 This study uses a multi-layer perceptron (MLP) model with a single hidden layer, and the initial learning rate is 0.3. In a random forest (RF) model, the entire random forest is composed of 500 decision trees (ntree = 500), and each decision tree randomly selects 8 variables (mtry = 8) from 40 variables to build a decision tree. illustrates the source code.

Validation

As mentioned before, all data is divided into training set (n=2883) and test set (n=2931). In the training stage, k-fold cross-validation was used to train the model. The training set was divided into 10 parts through non-repetitive sampling. Each time, 9 parts were extracted for model training, and the remaining 1 part was used for model verification. The process was repeated for 10 times, obtaining 10 different models. The test set (n=2931) was used for testing to get the average of the output results as the final indicator (shown in Figure 1). Then, another 50% of the test set (n=2931) was entered to test the performance of the model. The prediction output by the model was compared with the actual diagnosis result to obtain the final result. The prediction performance of models can be significantly influenced by the hyperparameter setting, such as DT’s minimum of number of instances per leaf, SVM’s gamma, ANN’s number of hidden neurons and RF’s mtry and number of trees. The details of the parameter tuning for each model are listed in Table 1.

Table 1

Hyperparameter Tuning in Models

Model	Hyperparameter	Range	Increment	Final Setting
DT	Minimum of number of instances per leaf	2–18	2	2
SVM	Gamma	0.01	0.01	0.02
ANN	Number of hidden neurons	3–12	1	8
RF	Mtry	2–11	1	8
RF	Number of trees	100–500	100	500

Hyperparameter Tuning in Models

Model Performance

The Norton scale was used to evaluate the risk of the data in the test group through a two-person team. Specifically, the high risk had a low Norton score (≤15) and an intermediate Norton score (16–18), and the low risk had a high Norton score (≥19).9,10 The scoring method is based on previous research. The score ≤18 is considered as predictive positive, and the score >18 is considered as predictive negative. The predictive result of Norton scale is calculated following the same performance evaluation method used by the machine learning model. The result is compared with that of the four machine prediction models developed in this research. The common performance indicators in machine learning including accuracy, recall, precision, F1 value, and ROC curve area (Area Under Curve, AUC) are used in this study. The accuracy of the model was evaluated by calculating the confusion matrix and comparing the ROC curve. Then, we performed model calibration and evaluated the performance of the model after calibration by comparing histograms and reliability diagrams. SPSS V.22.0 software (IBM, Armonk, New York, USA) was used for descriptive statistics.1 All model analysis was also performed in R language (version 2.9.0 for Windows, ).

Results

Basic Features and Potential Predictors

Information of the included patients is shown in Table 1. 1673 patients (28.78%) had pressure ulcer, and their average age was 64.34. The proportion of wards vs ICU was 1504 to 169. Their average length of stay was 8.15 days. 4141 patients (71.22%) did not have pressure ulcer, and their average age was 51.89. The proportion of wards vs ICU was 3557 to 564. The average length of stay was 7.89 days (see Table 2). Multivariate logistic regression analysis indicated that a total of 19 important variables of pressure ulcer risk factors exhibited significant statistical differences. These factors are listed in Table 3.

Table 2

General Information and Maternal Characteristics of Pressure Ulcer Patients

Variables	Pressure Ulcer Group (n=1673) n/ X̄±s	No Pressure Ulcer Group (n=4141) n/ X̄±s	P
Gender
Male	1031	2415	0.020
Female	642	1726
Age (year)	64.34±18.30	51.89±17.55	<0.001
Height (cm)	165.82±13.15	163.54±16.09	0.003
Weight (kg)	65.62±12.32	63.96±14.24	0.014
Total intake (mL)	1577.66±1917.74	1508.15±2221.84	<0.001
Total output (mL)	1786.26±1046.73	2105.70±1058.91	<0.001
Body temperature (°C)	36.78±0.53	36.62±0.64	<0.001
Systolic blood pressure (mmHg)	72.42±10.70	73.12±10.00	<0.001
Blood glucose	9.22±2.44	8.61±2.39	<0.001
Length of stay (days)	8.15±2.87	7.89±3.64	<0.001
Stay in bed	279 (16.68%)	215 (5.19%)	<0.001
Restraint bands	147 (8.79%)	198 (4.78%)	<0.001
Surgery	108 (6.46%)	188 (4.54%)	0.003
Diarrhea	127 (7.59%)	192 (4.64%)	<0.001
Diabetes	72 (4.30%)	113 (2.73%)	0.002
Fracture	252 (15.06%)	57 (1.38%)	<0.001
Norton pressure ulcer assessment
Total Score	13.78±3.14	18.74±2.38	<0.001
Incontinence score	3.38±0.90	3.96±0.31	<0.001
Activity score	1.74±1.04	3.52±0.90	<0.001
Mind score	3.75±0.70	3.94±0.33	<0.001
Nutritional Assessment	1.33±0.95	0.27±0.57	<0.001
Acceptance of passive turning over	1019 (60.91%)	1282 (30.96%)	<0.001

Table 3

Logistic Regression Analysis on Important Variables

Variable	Regression Coefficients	SE_(b)	Wald value	P value	OR value	95% CI of OR value
Variable	Regression Coefficients	SE_(b)	Wald value	P value	OR value	Lower Limit	Upper Limit
Constant	−9.560	4.307	4.927	0.026
Age	0.019	0.004	28.776	<0.001	1.019	1.012	1.027
Weight	0.012	0.005	6.157	0.013	1.012	1.003	1.022
Diarrhea	0.626	0.238	6.921	0.009	1.870	1.173	2.981
Stay in bed	0.933	0.188	24.513	<0.001	2.543	1.757	3.679
Restraint bands	0.381	0.188	4.092	0.043	1.463	1.012	2.116
Surgery	−0.736	0.151	23.632	<0.001	0.479	0.356	0.645
Norton pressure ulcer assessment
Total score	2.138	0.308	48.081	<0.001	8.486	4.637	15.532
Norton pressure ulcer assessment
Incontinence score	−0.833	0.101	68.063	<0.001	0.435	0.357	0.530
Norton pressure ulcer assessment
Activity score	−1.145	0.069	274.010	<0.001	0.318	0.278	0.364
Norton pressure ulcer assessment
Mind score	0.403	0.114	12.401	<0.001	1.496	1.196	1.873
Acceptance of passive turning over	−0.504	0.021	579.498	<0.001	0.604	0.580	0.629
Nutritional assessment	0.539	0.079	46.286	<0.001	1.714	1.467	2.002
Total intake	0.577	0.232	6.172	0.013	1.780	1.129	2.805
Total output	−0.595	0.183	10.540	0.001	0.552	0.385	0.790
Body temperature	0.215	0.109	3.927	0.048	1.240	1.002	1.534
Systolic blood pressure	0.011	0.004	6.536	0.011	1.011	1.002	1.019
Blood glucose	0.053	0.017	9.569	0.002	1.054	1.019	1.090
Diabetes	0.435	0.190	5.254	0.022	1.545	1.065	2.241
Fracture	1.250	0.165	57.475	<0.001	3.490	2.526	4.821

General Information and Maternal Characteristics of Pressure Ulcer Patients Logistic Regression Analysis on Important Variables

Performance Evaluation of Four Prediction Models

The test results of the four models are listed in Table 3. The SVM model achieves an accuracy of 94.94%, a recall of 93.90%, a precision of 96.90%, and an F1 value of 94.42%. Compared with the SVM model, the DT model obtains better performance in all indicators. Especially, the F1 value is 3.57% higher than that of the SVM model. The RF model achieves the best performance, with an accuracy of 99.88%, a recall of 99.88%, a precision of 99.93% and an F1 value of 99.88%. The excessive high value of the indicators may be related to the overfitting of this model. The indicators of ANN obtain the lowest value among the four models, showing an accuracy of 79.02%, a recall of 87.21%, a precision of 90.89%, and an F1 value of 82.92%. All four machine learning models exhibited an improved performance compared with Norton scale, suggesting that the accuracy of Norton scale for predicting pressure ulcer is worse than that of machine learning model. Indeed, the accuracy and precision of Norton scale are both low. In addition, the AUCs of all four prediction models are higher than 0.95, indicating good fitting effect (see Table 4).

Table 4

Comparison of the Prediction Performance of the Four Pressure Ulcer Prediction Models

Model (n=2931)	TP	TN	FP	FN	Accuracy	Recall	Precision	F1 Value	AUC	AUPRC
SVM	770	2.070	41	50	94.94%	93.90%	96.90%	94.42%	0.9940	0.9103
DT	804	2.094	17	16	97.93%	98.05%	98.87%	97.99%	0.9960	0.9607
RF	814	2.075	1	1	99.88%	99.88%	99.93%	99.88%	0.9999	0.9910
ANN	648	2.016	172	95	79.02%	87.21%	90.89%	82.92%	0.9590	0.8542
Norton scale	437	1261	490	850	47.14%	33.95%	55.89%	39.48%	0.5709	0.8339

Abbreviations: TP, true positive; FN, false negative; TN, true negative; FP, false positive; AUC, area under the curve; AUPRC, area under the precision recall curve.

Comparison of the Prediction Performance of the Four Pressure Ulcer Prediction Models Abbreviations: TP, true positive; FN, false negative; TN, true negative; FP, false positive; AUC, area under the curve; AUPRC, area under the precision recall curve. ROC curves of the four models are shown in Figure 3. According to the results, it can be seen that the prediction performance of the four models is quite different, while all the models exhibit acceptable ROC curves and prediction efficiency. Meanwhile, compared with the ANN model, the other three models show higher predictive accuracy and diagnostic value. Besides, the RF model achieves the best prediction performance among the four models. According to the ROC curves, the four machine learning models achieve higher prediction accuracy than the conventional Norton scale (shown in Figure 3). This may be attributed to the multi-variate nature of machine learning model and its training process. Meanwhile, different machine learning models show consistent accuracies, indicating that the prediction performance of machine learning can be improved by data processing. In addition, traditional evaluations are inferior to the algorithms based on feature values, and the reason may be that the scale was developed earlier and did not incorporate multiple variables. The machine learning technology has advantages in adjustable algorithm and real-time data.

Figure 3

Performance metrics of pressure ulcer prediction models on the test data set. Based on the prediction results of the model, the ROC curves are drawn, including SVM (A), DT (B), RF (C), and ANN (D). The results are compared with the ROC curve of Norton scale (E). Norton scale is inferior to the machine learning model in terms of ROC curve or AUC value. Graphically, DT and RF achieve similar performance, but RF obtains higher prediction accuracy in terms of AUC value. The result of model calibration is shown in Figure 4. The reliability diagrams of ANN, DT, SVM, and RF all show an S-shape, which are significantly improved after calibration. After calibration, the predicted value of output tends to be distributed in two levels of 0 and 1. Model calibration can significantly improve the performance of pressure ulcer prediction models. Moreover, the RF model still has the best performance after calibration. Model calibration plays an important role in improving the performance of the model, which will make the model more useful in clinical applications, especially as a clinical decision support tool for ulcer risk scoring.

Figure 4

Histograms and reliability diagrams for models. (A–C) are the histograms and reliability diagrams before model calibration. The column (D) is the histogram of the model after calibration.

Discussion

This study presents a prediction model constructed based on retrospective data. The characteristics of high-risk factors within 48 hours before the occurrence of pressure ulcer are taken into consideration. After weighting these high-risk variables, the model training is constructed and implemented, and an optimal model is selected. The output result gives the probability of pressure ulcer occurring within 48 hours of the data time point (shown in Figure 1). Therefore, when the model captures the data of high-risk variables, it can predict whether the target patient will suffer from pressure ulcer in the next 48 hours. The related variables related to the occurrence of pressure ulcer in patients are found in this study, including age, weight, total intake, total output, body temperature, systolic blood pressure, blood glucose, diarrhea, stay in bed, restraint bands, surgery, total score of pressure ulcer assessment, acceptance of passive turning over, nutritional assessment, diabetes, and fracture. Several key variables such as age, weight, total intake, total output, body temperature, systolic blood pressure and blood glucose are consistent with those reported in other studies.11–14 Systolic blood pressure is highly correlated with pressure ulcers, which may be related to the patient’s hemodynamic factors.15 Diarrhea is an important factor, and it is closely related to pressure ulcer, especially the one occurred around anus.16 It promotes flushing, edema, and pain of the perianal skin, and can even cause skin ulcers and infections. Bedridden patients are easy to suffer from localized damage to the skin and subcutaneous tissues.17 This study also found a correlation between bed rest and pressure ulcer. According to a multicenter survey, 31.4% of long-term bedridden patients suffer from pressure ulcer among hospitalized patients.18 The incidence of pressure ulcer in bedridden patients ranges from 13.3% to 57.6%.19–23 The average time of stay in surgery is about 8.6 days (slightly less than the hospital level of 9.1 days in China), which is consistent with the researches.24–26 This study found that surgery is related to pressure ulcer, and Aloweni et al confirmed this result. Other studies have confirmed that the limitation and irregular use of the restraint belt are related to pressure ulcer.27 The analysis of acceptance of passive turning over, pressure ulcer evaluation, and nutritional evaluation helps us predict the occurrence of pressure ulcer from the perspective of prevention. Conventional scales such as Norton, Breslow, and Waterlow have all evaluated the nutritional status. However, patients receiving treatment in the hospital will receive nutritional intervention treatment and some nursing measures to prevent pressure ulcer, such as turning over.28 The use of these measures will affect the occurrence of pressure ulcer, making it necessary to consider these positive and related measures for predicting pressure ulcer. The advantages of machine learning are related to the database, which makes the response time of variable analysis very fast. Based on the important variables investigated by this research, nurses can save the link of scale evaluation during the pressure ulcer evaluation, which contributes to a reduced evaluation time and more accurate result for pressure ulcer prediction. The selection of important variables related to pressure ulcer is crucial, because it directly affects the accuracy of the feature extraction and output results of the machine learning model. More variables related to pressure ulcer can be exploited to upgrade the model in future research. Meanwhile, the pressure ulcer prediction models constructed based on these 19 factors have achieved satisfactory prediction performance, and the RF model performs the best. The machine learning model shows a comprehensive predictive ability for pressure ulcer, a complex multi-factor disease. Compared with Norton scale, the proposed machine learning model exhibited an improved prediction performance. It may be that the individual differences of pressure ulcer-related variables cannot reflected by the assessment scales. However, these problems can be handled by machine learning, through obtaining more variable data, performing multivariable tasks, screening and processing the multi-factor characteristics of pressure ulcer, and outputting prediction results highly correlated with pressure ulcer. Additionally, machine learning adopts more optimized algorithms and more explicit feature extraction, which leads to more optimized data fitting than traditional tools. In the process of constructing the machine learning prediction model in this study, clarifying the related factors of pressure ulcer is helpful for the model training and make the parameters of the pressure ulcer prediction model more accurate. Three methods are used in this study to improve the prediction performance of the model, including controlling the number of variables, improving the test process, and selecting the appropriate model.29 Logistic regression is exploited to refine the characteristics of risk, reduce low-related variables, and find enough important variables. The low prediction accuracy of the prediction model may be related to the less included factors.30 It is impossible to accurately determine whether the patient is at risk of nursing adverse events with limited factors. But more variables will make the generated machine learning model over-fitted. In this case, a preliminary selection of predictive variables is conducted based on the clinical knowledge. In stage 2 of the model construction, a logistic regression model was used to determine the variables (variables with P<0.05 were selected), so that the number of variables was limited. These variables have a high correlation with pressure ulcer, and the prediction accuracy of the machine learning model is greatly improved. As a current mainstream method, K-fold cross-validation is used in the train set to improve the efficiency of model training.31 For small sample data, the model performance cannot be improved by only setting train and test data sets. The nested use of K-fold cross-validation in the train set can achieve the expected prediction effect.32 Also, the verification in the train set can maximize the use of data to a certain extent, for the amount of pressure ulcer data in this study is not very huge. The comparison between models is also an important way to improve model’s prediction performance. Different machine learning algorithms are suitable for different clinical problems, and pressure ulcer prediction on digital medical record data is often regarded as a linear fitting problem or classification problem. Previous studies usually selected the best model by comparing the performance of multiple models.33,34 Because the machine learning model has different degrees of adaptation when processing different data types. The most common machine learning models are compared to find the subtle differences in the models. Through evaluations of the four models for predicting pressure ulcer adverse event, it was found that the accuracy and recall of the four models were high. RF model showed superior performance than the traditional prediction models. This study adopted a similar method for constructing the pressure ulcer model and obtained similar findings.35,36 Alderden et al also demonstrated that the machine learning model achieved high accuracy for predicting pressure ulcer, and random forest model obtained the best performance (AUC=0.79).37 However, more factors are considered in this study, which can better analyze the risk of pressure ulcer. The random forest model proposed by Hu et al provided a good inspiration for pressure ulcer prediction. In this study, the average precision of DT, logistic regression and RF is 0.969, 0.799, and 0.998, respectively.38 It indicates that the random forest algorithm is more efficient in processing classification problems. In addition, other studies use decision tree models to predict pressure ulcer.39 However, DT is not as good as RF in fitting. The low predictive ability of the ANN model shows that there may be an “overfitting” phenomenon. Also, the 19 variables may bring a large computational burden for the ANN model. The performance of ANN on categorical variables is not as good as that of other models. Therefore, not all machine learning models are suitable for pressure ulcer prediction, suggesting that researchers need to be more cautious in choosing machine learning models for making disease predictions in the future.

Limitations of the Study

At the time of writing this paper, the electronic medical system contains text description rather than image of the pressure ulcer of the patient, and the description content could not be unified. In this case, it can be determined whether the patient has a pressure ulcer, but the classification and prognosis of pressure ulcer are hard. In the future, the description of pressure ulcer event needs to be standardized in the electronic form. At the hospital where the study is located, the incidence of pressure ulcer is only 2.5%, while the average incidence reported is 0.4%～38%. Due to heavy workload, nurses may neglect the care for patients, especially some patients with mild pressure ulcer or patients about to be discharged. This reduces the model’s ability to identify mild pressure ulcers, and this issue remains to be confirmed. Therefore, single-center research may make the processing of diverse data a challenge, requiring more case data from multiple centers. This challenge could further limit the initial clinical application of the proposed model. Additionally, there is no consensus on the risks of pressure ulcer so far. Although many factors are included in this study, the risks taken into consideration for constructing the pressure ulcer prediction model cannot cover all potential pressure ulcer risks. Limited to the hospital digital medical record database, only a strong correlation exists between the model’s inference and the characteristics, but it is still impractical to draw a complete causal relationship. Finally, it is essential to build a model based on a new pressure ulcer medical record database that covers all factors in the future, and the use of multi-center data may significantly promote the clinical application of the model constructed in this study.

Conclusion

Four machine learning models for predicting the pressure ulcer adverse events are studied in this paper. Experimental results based on the common evaluation indictors show that the four models achieve high capabilities in predicting pressure ulcer adverse events. Also, the horizontal comparisons of the models show that the random forest and the decision tree model have better prediction performance and are more suitable for predicting pressure ulcer adverse events. The construction of the pressure ulcer adverse event prediction model provides new early warning tools for pressure ulcer risk in the clinic and improves the feasibility of personalized care. Although the machine learning has been widely used to predict many diseases, there are few cases for pressure ulcer prediction. The pressure ulcer prediction model based on machine learning can be used as a pressure ulcer prevention tool with broad prospects.

37 in total

1. Risk factors for the development of pressure sores in hospitalized elderly patients: results of a prospective study.

Authors: A Bianchetti; O Zanetti; R Rozzini; M Trabucchi
Journal: Arch Gerontol Geriatr Date: 1993 May-Jun Impact factor: 3.250

2. Pressure Injury in the Pediatric Population: Cohort Study Using the Braden Q Scale.

Authors: Marcelli Cristine Vocci; Cassiana Mendes Bertoncello Fontes; Luciana Patricia Fernandes Abbade
Journal: Adv Skin Wound Care Date: 2018-10 Impact factor: 2.347

3. Enhanced Recovery after Surgery in Gynecology: A Review of the Literature.

Authors: Stacey A Scheib; May Thomassee; Jamaan L Kenner
Journal: J Minim Invasive Gynecol Date: 2018-12-20 Impact factor: 4.137

4. An economic analysis of repositioning for the prevention of pressure ulcers.

Authors: Zena Moore; Seamus Cowman; John Posnett
Journal: J Clin Nurs Date: 2013-08 Impact factor: 3.036

5. Prevalence of pressure ulcer and associated risk factors in middle- and older-aged medical inpatients in Norway.

Authors: Tove E Børsting; Christine R Tvedt; Ingrid J Skogestad; Tove I Granheim; Caryl L Gay; Anners Lerdal
Journal: J Clin Nurs Date: 2017-10-23 Impact factor: 3.036

6. Predicting Pressure Injury in Critical Care Patients: A Machine-Learning Model.

Authors: Jenny Alderden; Ginette Alyce Pepper; Andrew Wilson; Joanne D Whitney; Stephanie Richardson; Ryan Butcher; Yeonjung Jo; Mollie Rebecca Cummins
Journal: Am J Crit Care Date: 2018-11 Impact factor: 2.228

7. Machine Learning For Tuning, Selection, And Ensemble Of Multiple Risk Scores For Predicting Type 2 Diabetes.

Authors: Yujia Liu; Shangyuan Ye; Xianchao Xiao; Chenglin Sun; Gang Wang; Guixia Wang; Bo Zhang
Journal: Risk Manag Healthc Policy Date: 2019-11-05

8. Applying of Decision Tree Analysis to Risk Factors Associated with Pressure Ulcers in Long-Term Care Facilities.

Authors: Mikyung Moon; Soo-Kyoung Lee
Journal: Healthc Inform Res Date: 2017-01-31

9. Comparison of machine learning models for seizure prediction in hospitalized patients.

Authors: Aaron F Struck; Andres A Rodriguez-Ruiz; Gamaledin Osman; Emily J Gilmore; Hiba A Haider; Monica B Dhakar; Matthew Schrettner; Jong W Lee; Nicolas Gaspard; Lawrence J Hirsch; M Brandon Westover
Journal: Ann Clin Transl Neurol Date: 2019-06-27 Impact factor: 4.511

10. Machine learning algorithm validation with a limited sample size.

Authors: Andrius Vabalas; Emma Gowen; Ellen Poliakoff; Alexander J Casson
Journal: PLoS One Date: 2019-11-07 Impact factor: 3.240

2 in total

1. Machine Learning-Based Pressure Ulcer Prediction in Modular Critical Care Data.

Authors: Petr Šín; Alica Hokynková; Nováková Marie; Pokorná Andrea; Rostislav Krč; Jan Podroužek
Journal: Diagnostics (Basel) Date: 2022-03-30

2. A Machine Learning Model for Predicting Unscheduled 72 h Return Visits to the Emergency Department by Patients with Abdominal Pain.

Authors: Chun-Chuan Hsu; Cheng-C J Chu; Ching-Heng Lin; Chien-Hsiung Huang; Chip-Jin Ng; Guan-Yu Lin; Meng-Jiun Chiou; Hsiang-Yun Lo; Shou-Yen Chen
Journal: Diagnostics (Basel) Date: 2021-12-30

2 in total