| Literature DB >> 35627755 |
Teresa Angela Trunfio1, Anna Borrelli2, Giovanni Improta3,4.
Abstract
The proximal fracture of the femur and hip is the most common reason for hospitalization in orthopedic departments. In Italy, 115,989 hip-replacement surgeries were performed in 2019, showing the economic relevance of studying this type of procedure. This study analyzed the data relating to patients who underwent hip-replacement surgery in the years 2010-2020 at the "San Giovanni di Dio e Ruggi d'Aragona" University Hospital of Salerno. The multiple linear regression (MLR) model and regression and classification algorithms were implemented in order to predict the total length of stay (LOS). Lastly, using a statistical analysis, the impact of COVID-19 was evaluated. The results obtained from the regression analysis showed that the best model was MLR, with an R2 value of 0.616, compared with XGBoost, Gradient-Boosted Tree, and Random Forest, with R2 values of 0.552, 0.543, and 0.448, respectively. The t-test showed that the variables that most influenced the LOS, with the exception of pre-operative LOS, were gender, age, anemia, fracture/dislocation, and urinary disorders. Among the classification algorithms, the best result was obtained with Random Forest, with a sensitivity of the longest LOS of over 89%. In terms of the overall accuracy, Random Forest and Gradient-Boosted Tree achieved a value of 71.76% and an error of 28.24%, followed by Decision Tree, with an accuracy of 71.13% and an error of 28.87%, and, finally, Support Vector Machine, with an accuracy of 65.06% and an error of 34.94%. A significant difference in cardiovascular disease, fracture/dislocation, and post-operative LOS variables was shown by the chi-squared test and Mann-Whitney test in the comparison between 2019 (before COVID-19) and 2020 (in full pandemic emergency conditions).Entities:
Keywords: data mining; hip; length of stay
Mesh:
Year: 2022 PMID: 35627755 PMCID: PMC9141454 DOI: 10.3390/ijerph19106219
Source DB: PubMed Journal: Int J Environ Res Public Health ISSN: 1660-4601 Impact factor: 4.614
Figure 1Distribution of the features in the dataset.
Figure 2Homoscedasticity of the data.
Figure 3Q–Q plot.
Pearson correlation.
| LOS | Gender | Age | Pre-Operative LOS | Diabetes | Hypertension | Obesity | Anemia | Vitamin D Deficiency | Tumor | Fracture/Dislocation | Brain Disorders | Urinary Disorders | Cardiovascular Disease | Respiratory Disease | Anticoagulant Therapy | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pearson Correlation | LOS | 1.000 | 0.054 | 0.137 | 0.772 | −0.027 | −0.104 | −0.023 | 0.049 | −0.054 | 0.069 | 0.248 | −0.009 | 0.046 | 0.109 | 0.024 | 0.002 |
| Gender | 0.054 | 1.000 | 0.182 | −0.010 | −0.008 | 0.080 | 0.029 | 0.104 | 0.040 | −0.008 | 0.055 | 0.011 | −0.035 | −0.016 | −0.085 | −0.029 | |
| Age | 0.137 | 0.182 | 1.000 | .088 | 0.060 | 0.189 | −0.018 | 0.126 | 0.115 | −0.005 | 0.095 | 0.119 | 0.077 | 0.218 | 0.054 | 0.064 | |
| Pre-operative LOS | 0.772 | −0.010 | 0.088 | 1.000 | −0.064 | −0.161 | −0.022 | −0.064 | −0.101 | 0.072 | 0.260 | −0.019 | 0.005 | 0.078 | −0.002 | −0.008 | |
| Diabetes | −0.027 | −0.008 | 0.060 | −0.064 | 1.000 | 0.202 | −0.020 | 0.090 | 0.036 | 0.052 | −0.024 | 0.028 | 0.033 | 0.066 | 0.079 | 0.068 | |
| Hypertension | −0.104 | 0.080 | 0.189 | −0.161 | 0.202 | 1.000 | 0.062 | 0.174 | 0.130 | −0.039 | −0.142 | 0.058 | 0.062 | 0.177 | 0.112 | 0.066 | |
| Obesity | −0.023 | 0.029 | −0.018 | −0.022 | −0.020 | 0.062 | 1.000 | 0.007 | −0.011 | −0.006 | −0.031 | −0.019 | 0.028 | 0.004 | −0.014 | 0.031 | |
| Anemia | 0.049 | 0.104 | 0.126 | −0.064 | 0.090 | 0.174 | 0.007 | 1.000 | 0.154 | 0.033 | −0.029 | 0.090 | 0.089 | 0.063 | 0.055 | 0.066 | |
| Vitamin D deficiency | −0.054 | 0.040 | 0.115 | −0.101 | 0.036 | 0.130 | −0.011 | 0.154 | 1.000 | 0.001 | −0.052 | 0.125 | 0.005 | 0.072 | 0.083 | 0.024 | |
| Tumor | 0.069 | −0.008 | −0.005 | 0.072 | 0.052 | −0.039 | −0.006 | 0.033 | 0.001 | 1.000 | 0.017 | 0.004 | 0.024 | 0.042 | 0.105 | −0.018 | |
| Fracture/Dislocation | 0.248 | 0.055 | 0.095 | 0.260 | −0.024 | −0.142 | −0.031 | −0.029 | −0.052 | 0.017 | 1.000 | −0.041 | −0.019 | 0.202 | −0.042 | −0.050 | |
| Brain disorders | −0.009 | 0.011 | 0.119 | −0.019 | 0.028 | 0.058 | −0.019 | 0.090 | 0.125 | 0.004 | −0.041 | 1.000 | −0.018 | 0.040 | 0.038 | 0.014 | |
| Urinary disorders | 0.046 | −0.035 | 0.077 | 0.005 | 0.0033 | 0.062 | 0.028 | 0.089 | 0.005 | 0.024 | −0.019 | −0.018 | 1.000 | 0.067 | 0.008 | 0.027 | |
| Cardiovascular disease | 0.109 | −0.016 | 0.218 | 0.078 | 0.066 | 0.177 | 0.004 | 0.063 | 0.072 | 0.042 | 0.202 | 0.040 | 0.067 | 1.000 | 0.040 | 0.183 | |
| Respiratory disease | 0.024 | −0.085 | 0.054 | −0.002 | 0.079 | 0.112 | −0.014 | 0.055 | 0.083 | 0.105 | −0.042 | 0.038 | 0.008 | 0.040 | 1.000 | 0.025 | |
| Anticoagulant therapy | 0.002 | −0.029 | 0.064 | −0.008 | 0.068 | 0.066 | 0.031 | 0.066 | 0.024 | −0.018 | −0.050 | 0.014 | 0.027 | 0.183 | 0.025 | 1.000 | |
| Sig. (1-tailed) | LOS | 0.003 | 0.000 | 0.000 | 0.089 | 0.000 | 0.120 | 0.007 | 0.003 | 0.000 | 0.000 | 0.326 | 0.011 | 0.000 | 0.110 | 0.465 | |
| Gender | 0.003 | 0.000 | 0.308 | 0.341 | 0.000 | 0.071 | 0.000 | 0.023 | 0.340 | 0.003 | 0.284 | 0.040 | 0.218 | 0.000 | 0.071 | ||
| Age | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.190 | 0.000 | 0.000 | 0.402 | 0.000 | 0.000 | 0.000 | 0.000 | 0.004 | 0.001 | ||
| Pre-operative LOS | 0.000 | 0.308 | 0.000 | 0.001 | 0.000 | 0.132 | 0.001 | 0.000 | 0.000 | 0.000 | 0.177 | 0.394 | 0.000 | 0.451 | 0.352 | ||
| Diabetes | 0.089 | 0.341 | 0.001 | 0.001 | 0.000 | 0.160 | 0.000 | 0.036 | 0.005 | 0.117 | 0.082 | 0.048 | 0.000 | 0.000 | 0.000 | ||
| Hypertension | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 | 0.026 | 0.000 | 0.002 | 0.001 | 0.000 | 0.000 | 0.000 | ||
| Obesity | 0.120 | 0.071 | 0.190 | 0.132 | 0.160 | 0.001 | 0.354 | 0.289 | 0.373 | 0.060 | 0.169 | 0.083 | 0.421 | 0.235 | 0.060 | ||
| Anemia | 0.007 | 0.000 | 0.000 | 0.001 | 0.000 | 0.000 | 0.354 | 0.000 | 0.050 | 0.076 | 0.000 | 0.000 | 0.001 | 0.003 | 0.000 | ||
| Vitamin D deficiency | 0.003 | 0.023 | 0.000 | 0.000 | 0.036 | 0.000 | 0.289 | 0.000 | 0.482 | 0.005 | 0.000 | 0.392 | 0.000 | 0.000 | 0.114 | ||
| Tumor | 0.000 | 0.340 | 0.402 | 0.000 | 0.005 | 0.026 | 0.373 | 0.050 | 0.482 | 0.194 | 0.420 | 0.118 | 0.017 | 0.000 | 0.183 | ||
| Fracture/dislocation | 0.000 | 0.003 | 0.000 | 0.000 | 0.117 | 0.000 | 0.060 | 0.076 | 0.005 | 0.194 | 0.021 | 0.166 | 0.000 | 0.017 | 0.006 | ||
| Brain disorders | 0.326 | 0.284 | 0.000 | 0.177 | 0.082 | 0.002 | 0.169 | 0.000 | 0.000 | 0.420 | 0.021 | 0.189 | 0.022 | 0.028 | 0.236 | ||
| Urinary disorders | 0.011 | 0.040 | 0.000 | 0.394 | 0.048 | 0.001 | 0.083 | 0.000 | 0.392 | 0.118 | 0.166 | 0.189 | 0.000 | 0.352 | 0.090 | ||
| Cardiovascular disease | 0.000 | 0.218 | 0.000 | 0.000 | 0.000 | 0.000 | 0.421 | 0.001 | 0.000 | 0.017 | 0.000 | 0.022 | 0.000 | 0.022 | 0.000 | ||
| Respiratory disease | 0.110 | 0.000 | 0.004 | 0.451 | 0.000 | 0.000 | 0.235 | 0.003 | 0.000 | 0.000 | 0.017 | 0.028 | 0.352 | 0.022 | 0.107 | ||
| Anticoagulant therapy | 0.465 | 0.071 | 0.001 | 0.352 | 0.000 | 0.000 | 0.060 | 0.000 | 0.114 | 0.183 | 0.006 | 0.236 | 0.090 | 0.000 | 0.107 | ||
Multiple linear regression model.
| R | R2 | Adjusted R2 | Std. Error of the Estimate | |
|---|---|---|---|---|
| Model | 0.785 | 0.616 | 0.613 | 3.726 |
Coefficients of MLR model.
| Unstandardized Coefficients | Standardized Coefficients | t | |||
|---|---|---|---|---|---|
| B | Std. Error | Beta | |||
| (Constant) | 4.405 | 0.522 | - | 8.442 | 0.000 |
| Gender | 0.609 | 0.162 | 0.048 | 3.762 |
|
| Age | 0.020 | 0.007 | 0.040 | 2.960 |
|
| Pre-operative LOS | 1.011 | 0.017 | 0.760 | 57.908 |
|
| Diabetes | 0.221 | 0.257 | 0.011 | 0.862 | 0.389 |
| Hypertension | −0.166 | 0.178 | −0.013 | −0.933 | 0.351 |
| Obesity | −0.624 | 1.250 | −0.006 | −0.499 | 0.618 |
| Anemia | 1.130 | 0.173 | 0.084 | 6.537 |
|
| Vitamin D deficiency | 0.127 | 0.430 | 0.004 | 0.295 | 0.768 |
| Tumor | 0.328 | 0.705 | 0.006 | 0.465 | 0.642 |
| Fracture/dislocation | 0.593 | 0.196 | 0.040 | 3.020 |
|
| Brain disorders | −0.159 | 0.261 | −0.008 | −0.610 | 0.542 |
| Urinary disorders | 1.115 | 0.433 | 0.032 | 2.572 |
|
| Cardiovascular disease | 0.348 | 0.176 | 0.027 | 1.983 |
|
| Respiratory disease | 0.632 | 0.335 | 0.024 | 1.888 | 0.059 |
| Anticoagulant therapy | −0.116 | 0.470 | −0.003 | −0.248 | 0.804 |
Results of regression algorithms.
| LR | RF | GBT | XGBoost | |
|---|---|---|---|---|
| R2 | 0.552 | 0.448 | 0.543 | 0.552 |
| Root mean squared error | 3.843 | 4.497 | 3.883 | 3.843 |
Figure 4Linear Regression.
Figure 5XGBoost.
Performance metrics of all selected algorithms.
| Performance Metrics | Class | DT | GBT | RF | SVM |
|---|---|---|---|---|---|
| Accuracy (%) | Overall | 71.13 | 71.76 | 71.76 | 65.06 |
| Error (%) | Overall | 28.87 | 28.24 | 28.24 | 34.94 |
| Precision (%) | 1 | 65.35 | 69.49 | 55.04 | 63.46 |
| 2 | 61.58 | 60.93 | 80.68 | 61.29 | |
| 3 | 89.19 | 89.66 | 75.14 | 67.69 | |
| Sensitivity (%) | 1 | 64.34 | 63.57 | 76.34 | 76.74 |
| 2 | 71.02 | 74.43 | 59.17 | 32.39 | |
| 3 | 76.30 | 75.14 | 89.66 | 89.60 | |
| Specificity (%) | 1 | 87.39 | 89.68 | 84.94 | 83.67 |
| 2 | 74.17 | 72.19 | 85.71 | 88.08 | |
| 3 | 94.75 | 95.08 | 87.09 | 75.74 | |
| F-measure (%) | 1 | 64.84 | 66.40 | 63.96 | 69.47 |
| 2 | 65.96 | 67.01 | 68.27 | 42.38 | |
| 3 | 82.24 | 81.76 | 81.76 | 77.11 |
Random Forest confusion matrix.
| Real/Predicted | 1 | 2 | 3 |
|---|---|---|---|
| 1 | 71 | 20 | 2 |
| 2 | 57 | 142 | 41 |
| 3 | 1 | 14 | 130 |
Figure 6Global importance Feature.
Analysis of COVID-19 impact.
| Variable | 2019 | 2020 | |
|---|---|---|---|
| Age | |||
| Mean | 77.76 | 78.22 | 0.800 |
| Gender | |||
| Male | 88 | 59 | 0.918 |
| Female | 184 | 126 | |
| Pre-operative LOS | |||
| Mean | 3.05 | 3.14 | 0.066 |
| Post-operative LOS | |||
| Mean | 7.70 | 7.09 |
|
| Diabetes | |||
| No | 233 | 155 | 0.582 |
| Yes | 39 | 30 | |
| Hypertension | |||
| No | 159 | 101 | 0.413 |
| Yes | 113 | 84 | |
| Anemia | |||
| No | 168 | 117 | 0.749 |
| Yes | 104 | 68 | |
| Obesity | |||
| No | 268 | 185 | 0.098 |
| Yes | 4 | 0 | |
| Vitamin D deficiency | |||
| No | 225 | 154 | 0.884 |
| Yes | 47 | 31 | |
| Tumor | |||
| No | 264 | 180 | 0.880 |
| Yes | 8 | 5 | |
| Fracture/dislocation | |||
| No | 262 | 142 |
|
| Yes | 10 | 43 | |
| Brain disorders | |||
| No | 218 | 155 | 0.325 |
| Yes | 54 | 30 | |
| Urinary disorders | |||
| No | 261 | 177 | 0.883 |
| Yes | 11 | 8 | |
| Cardiovascular disease | |||
| No | 192 | 101 |
|
| Yes | 80 | 84 | |
| Anticoagulant therapy | |||
| No | 257 | 178 | 0.396 |
| Yes | 15 | 7 | |
| Respiratory disease | |||
| No | 243 | 174 | 0.080 |
| Yes | 29 | 11 | |
| LOS | |||
| Mean | 10.75 | 10.22 | 0.240 |