| Literature DB >> 35447732 |
Carlo Ricciardi1, Alfonso Maria Ponsiglione1, Arianna Scala2, Anna Borrelli3, Mario Misasi4, Gaetano Romano4, Giuseppe Russo5, Maria Triassi2,6, Giovanni Improta2,6.
Abstract
Fractures of the femur are a frequent problem in elderly people, and it has been demonstrated that treating them with a diagnostic-therapeutic-assistance path within 48 h of admission to the hospital reduces complications and shortens the length of the hospital stay (LOS). In this paper, the preoperative data of 1082 patients were used to further extend the previous research and to generate several models that are capable of predicting the overall LOS: First, the LOS, measured in days, was predicted through a regression analysis; then, it was grouped by weeks and was predicted with a classification analysis. The KNIME analytics platform was applied to divide the dataset for a hold-out cross-validation, perform a multiple linear regression and implement machine learning algorithms. The best coefficient of determination (R2) was achieved by the support vector machine (R2 = 0.617), while the mean absolute error was similar for all the algorithms, ranging between 2.00 and 2.11 days. With regard to the classification analysis, all the algorithms surpassed 80% accuracy, and the most accurate algorithm was the radial basis function network, at 83.5%. The use of these techniques could be a valuable support tool for doctors to better manage orthopaedic departments and all their resources, which would reduce both waste and costs in the context of healthcare.Entities:
Keywords: clinical pathway; machine learning; multiple linear regression; orthopaedic
Year: 2022 PMID: 35447732 PMCID: PMC9029792 DOI: 10.3390/bioengineering9040172
Source DB: PubMed Journal: Bioengineering (Basel) ISSN: 2306-5354
Figure 1Workflow of the study.
Descriptive statistics of the dataset.
| Variables | Categories |
| LOS (Days) | |
|---|---|---|---|---|
| Age (years) | <75 | 225 | 11.46 ± 5.527 | 0.399 |
| 75–90 | 720 | 11.21 ± 5.071 | ||
| >90 | 137 | 10.79 ± 4.538 | ||
| Allergies | Yes | 138 | 11.79 ± 5.609 | 0.153 |
| No | 944 | 11.12 ± 5.025 | ||
| Cardiovasculardiseases | Yes | 897 | 11.36 ± 4.987 | 0.002 |
| No | 185 | 10.48 ± 5.601 | ||
| Diabetes | Yes | 268 | 11.56 ± 5.657 | 0.291 |
| No | 814 | 11.40 ± 5.162 | ||
| ASA score | I-II | 94 | 9.21 ± 3.970 | <0.001 |
| III-IV | 988 | 11.39 ± 5.242 | ||
| DTAP | No | 534 | 13.21 ± 5.126 | <0.001 |
| Yes | 548 | 9.25 ± 4.259 |
Evaluation metrics for the regression analysis of LOS, measured in days.
| Multiple | Random | MLP | RBF | SVM | |
|---|---|---|---|---|---|
| R2 | 0.610 | 0.507 | 0.584 | 0.616 | 0.610 |
| Mean absolute error | 3.987 | 2.45 | 2.109 | 2.077 | 2.000 |
| Mean squared error | 11.624 | 11.949 | 10.075 | 9.302 | 9.268 |
Coefficients of the multiple linear regression model.
| Variables | Regression | t | |
|---|---|---|---|
| Intercept | 3.42 | 2.24 | 0.02 |
| Age | −0.01 | −0.85 | 0.39 |
| ASA score | 1.22 | −0.76 | 0.45 |
| Diabetes | −0.21 | −0.75 | 0.45 |
| Cardiovascular diseases | −0.25 | 3.83 | 0.001 |
| Allergies | 0.04 | 0.11 | 0.91 |
| Preoperative LOS | 1.03 | 27.77 | <0.001 |
| DTAP | 0.35 | 1.23 | 0.22 |
The accuracies and the best confusion matrix for the classification analysis of LOS, measured in weeks.
| RF | MLP | RBF Network | SVM | |
|---|---|---|---|---|
| Accuracy (%) | 81.1 | 82.3 | 83.5 | 81.1 |
| The best confusion matrix (RBF network) | ||||
| Real/Predicted | 1 | 2 | 3 | 4 |
| 1 | 154 | 12 | 0 | 0 |
| 2 | 29 | 108 | 1 | 0 |
| 3 | 1 | 5 | 17 | 0 |
| 4 | 1 | 2 | 4 | 0 |
Figure 2Histogram describing the feature importance.