| Literature DB >> 35459787 |
Christos Kokkotis1,2, Serafeim Moustakidis3, Themistoklis Tsatalas4, Charis Ntakolia5,6, Georgios Chalatsis7, Stylianos Konstadakos8, Michael E Hantes7, Giannis Giakas4, Dimitrios Tsaopoulos9.
Abstract
Anterior cruciate ligament (ACL) deficient and reconstructed knees display altered biomechanics during gait. Identifying significant gait changes is important for understanding normal and ACL function and is typically performed by statistical approaches. This paper focuses on the development of an explainable machine learning (ML) empowered methodology to: (i) identify important gait kinematic, kinetic parameters and quantify their contribution in the diagnosis of ACL injury and (ii) investigate the differences in sagittal plane kinematics and kinetics of the gait cycle between ACL deficient, ACL reconstructed and healthy individuals. For this aim, an extensive experimental setup was designed in which three-dimensional ground reaction forces and sagittal plane kinematic as well as kinetic parameters were collected from 151 subjects. The effectiveness of the proposed methodology was evaluated using a comparative analysis with eight well-known classifiers. Support Vector Machines were proved to be the best performing model (accuracy of 94.95%) on a group of 21 selected biomechanical parameters. Neural Networks accomplished the second best performance (92.89%). A state-of-the-art explainability analysis based on SHapley Additive exPlanations (SHAP) and conventional statistical analysis were then employed to quantify the contribution of the input biomechanical parameters in the diagnosis of ACL injury. Features, that would have been neglected by the traditional statistical analysis, were identified as contributing parameters having significant impact on the ML model's output for ACL injury during gait.Entities:
Mesh:
Year: 2022 PMID: 35459787 PMCID: PMC9026057 DOI: 10.1038/s41598-022-10666-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Learning curves with testing accuracy scores for different ML models trained on feature subsets of increasing dimensionality in the 3-class problem (refering to both ACL deficient and ACL reconstructed patients).
Best testing accuracies (%) achieved for ML models in 3-class problem along with precision, recall, f1-score and the optimum number of features.
| Models | Accuracy | Classes | Precision | Recall | F1-Score | Num. of features |
|---|---|---|---|---|---|---|
| XGBoost | 74.75 | CON | 70.80 | 95.24 | 81.22 | 19 |
| ACLD | 81.48 | 44.00 | 57.14 | |||
| ACLR | 79.31 | 71.88 | 75.41 | |||
| Random Forest | 86.36 | CON | 80.00 | 95.24 | 86.96 | 23 |
| ACLD | 90.00 | 72.00 | 80.00 | |||
| ACLR | 94.83 | 85.94 | 90.16 | |||
| Decision Trees | 73.74 | CON | 76.67 | 82.14 | 79.31 | 21 |
| ACLD | 72.09 | 62.00 | 66.67 | |||
| ACLR | 70.77 | 71.88 | 71.32 | |||
| Naive Bayes | 57.58 | CON | 65.69 | 79.76 | 72.04 | 14 |
| ACLD | 40.91 | 54.00 | 46.55 | |||
| ACLR | 66.67 | 31.25 | 42.55 | |||
| SVM | 94.95 | CON | 95.35 | 97.62 | 96.47 | 21 |
| ACLD | 92.16 | 94.00 | 93.07 | |||
| ACLR | 96.72 | 92.19 | 94.40 | |||
| KNN | 90.40 | CON | 85.26 | 96.43 | 90.50 | 19 |
| ACLD | 95.12 | 78.00 | 85.71 | |||
| ACLR | 95.16 | 92.19 | 93.65 | |||
| Logistic Regression | 68.18 | CON | 70.64 | 91.67 | 79.79 | 20 |
| ACLD | 57.50 | 46.00 | 51.11 | |||
| ACLR | 71.43 | 54.69 | 61.95 | |||
| NN | 92.89 | CON | 96.30 | 92.86 | 94.55 | 20 |
| ACLD | 90.57 | 96.00 | 93.20 | |||
| ACLR | 90.48 | 90.48 | 90.48 |
Figure 2(a) Average feature impact magnitude for all instances in the 3-class problem; (b) Features’ impact on SVM model output for local problem 1. This figure shows the average impact magnitude for all instances in the task of differentiating the control group vs pre-surgery group; (c) Average feature impact magnitude for all instances in the local problem 2 (control versus ACLR); (d) Average feature impact magnitude for all instances for local problem 3 (pre-surgery group versus post-surgery group).
Statistical comparison at the global level.
| Features | Statistical comparison | CON | ACLD | ACLR |
|---|---|---|---|---|
| Mean (std) | Mean (std) | Mean (std) | ||
| K2 | P = 0.004 | 8.59 ± 3.81 | 8.35 ± 4.89 | 9.72 ± 4.70 |
| H4 | P = 0.007 | 37.93 ± 4.93 | 36.26 ± 6.42 | 37.03 ± 5.53 |
| A3 | P = 0.000 | 13.81 ± 6.52 | 15.91 ± 7.43 | 16.88 ± 7.25 |
| GRF4 | P = 0.000 | 19.61 ± 4.30 | 16.94 ± 5.17 | 16.76 ± 4.84 |
| GRF7 | P = 0.000 | 5.19 ± 1.65 | 5.81 ± 2.03 | 6.18 ± 2.92 |
| K1 | P = 0.001 | 21.62 ± 5.88 | 19.73 ± 6.56 | 19.88 ± 6.27 |
| A4 | P = 0.011 | 0.18 ± 0.08 | 0.19 ± 0.08 | 0.20 ± 0.09 |
| GRF6 | P = 0.004 | 5.69 ± 1.43 | 6.01 ± 2.21 | 6.36 ± 2.97 |
Statistical analysis at the local level for ACL diagnosis and rehabilitation.
| Features* | CON vs ACLD | CON vs ACLR |
|---|---|---|
| H4 | P = 0.002 | P = 0.057 |
| K7 | P = 0.000 | P = 0.001 |
| GRF3 | P = 0.000 | P = 0.090 |
| H1 | P = 0.288 | P = 0.792 |
| H2 | P = 0.723 | P = 0.326 |
| GRF6 | P = 0.061 | P = 0.001 |
| GRF4 | P = 0.000 | P = 0.000 |
| GRF5 | P = 0.721 | P = 0.147 |
Evaluated parameters of the gait cycle for vertical and horizontal GRFs and sagittal plane kinematics and kinetics.
| Variables | Description |
|---|---|
| GRF1 | Local minimum vertical GRF during support (% BW) |
| GRF2 | First vertical GRF peak (% BW) |
| GRF3 | Second vertical GRF peak (% BW) |
| GRF4 | Anterior (propulsive) GRF peak (% BW) |
| GRF5 | Posterior (braking) GRF peak (% BW) |
| GRF6 | First medial GRF peak (% BW) |
| GRF7 | Second medial GRF peak (% BW) |
| H1 | Hip flexion angle at initial contact (°) |
| H2 | Maximum hip flexion angle during stance phase (°) |
| H3 | Maximum hip extension angle during stance phase (°) |
| H4 | Maximum hip flexion angle during swing phase (°) |
| H5 | Maximum hip extension moment during stance phase (Nm/kg) |
| H6 | Maximum hip flexion moment during stance phase (Nm/kg) |
| H7 | Maximum hip extension moment during swing phase (Nm/kg) |
| K1 | Peak knee flexion angle during stance phase (°) |
| K2 | Minimum knee flexion angle during stance phase (°) |
| K3 | Knee flexion angle at foot off (°) |
| K4 | Maximum knee flexion angle during swing phase (°) |
| K5 | Knee flexion angle at first maximum knee extension moment during stance phase (°) |
| K6 | Knee flexion angle at first vertical ground rection force peak (°) |
| K7 | First maximum knee extension moment during stance phase (Nm/kg) |
| A1 | Ankle angle at initial contact (°) |
| A2 | Maximum dorsi-flexion angle during stance phase (°) |
| A3 | Maximum plantar-flexion angle over the entire gait cycle (°) |
| A4 | Maximum dorsiflexion moment during stance phase (Nm/kg) |
| A5 | Maximum plantarflexion moment during stance phase (Nm/kg) |
Subjects’ characteristics.
| Characteristics | ACLD | ACLR | CON |
|---|---|---|---|
| Gender | 31 males and 13 females | 40 males and 14 females | 34 males and 19 females |
| Height | 175.3 ± 0.86 cm | 177.6 ± 0.80 cm | 174.1 ± 0.98 cm |
| Weight | 77.38 ± 14.91 kg | 76.37 ± 14.35 kg | 72.23 ± 15.81 kg |
Figure 3Three dimensional GRFs (a), sagittal plane kinematic (b) and kinetic (c) variables of interest during walking. Ankle dorsiflexion, knee flexion, hip flexion, anterior and medial GRFs, as well as internal ankle plantar flexion, knee extension and hip extension moments were all defined as positive.