| Literature DB >> 36236533 |
Ali Al-Ramini1, Mahdi Hassan2,3, Farahnaz Fallahtafti2,3, Mohammad Ali Takallou4, Hafizur Rahman2, Basheer Qolomany5, Iraklis I Pipinos3,6, Fadi Alsaleem4, Sara A Myers2,3.
Abstract
Peripheral artery disease (PAD) manifests from atherosclerosis, which limits blood flow to the legs and causes changes in muscle structure and function, and in gait performance. PAD is underdiagnosed, which delays treatment and worsens clinical outcomes. To overcome this challenge, the purpose of this study is to develop machine learning (ML) models that distinguish individuals with and without PAD. This is the first step to using ML to identify those with PAD risk early. We built ML models based on previously acquired overground walking biomechanics data from patients with PAD and healthy controls. Gait signatures were characterized using ankle, knee, and hip joint angles, torques, and powers, as well as ground reaction forces (GRF). ML was able to classify those with and without PAD using Neural Networks or Random Forest algorithms with 89% accuracy (0.64 Matthew's Correlation Coefficient) using all laboratory-based gait variables. Moreover, models using only GRF variables provided up to 87% accuracy (0.64 Matthew's Correlation Coefficient). These results indicate that ML models can classify those with and without PAD using gait signatures with acceptable performance. Results also show that an ML gait signature model that uses GRF features delivers the most informative data for PAD classification.Entities:
Keywords: deep learning; gait analysis; machine learning; peripheral artery disease; vascular disease
Mesh:
Year: 2022 PMID: 36236533 PMCID: PMC9572112 DOI: 10.3390/s22197432
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1A flowchart briefly describing the utilized data and the methods applied.
Figure A1Gait features extracted from raw biomechanics data. (a) Ground reaction forces (GRF) raw signals and the peak points extracted, (b) Ankle, (c) Hip, and (d) Knee raw signals with the peak points extracted.
Biomechanics Data and Gait Features Sources and Descriptions.
| Gait Feature Source | Raw Signal | Gait Signature Extracted | Definition and Explanation |
|---|---|---|---|
|
| GRF x-axis | GRF is recorded on overground force plates, where the center of pressure is expressed in a standard cartesian coordinate system (x, y, z). The ground reaction force is exerted by the ground on a body in contact with it and is composed of three components: vertical, anterior-posterior, and mediolateral. These forces can be combined with the limb orientation data to calculate ankle, knee, and hip joint torques and powers. The rotating effect of the force located at a distance from the joint axis is quantified using joint torques, while the joint power quantifies the power output of individual joints during walking. | |
| GRF y-axis | |||
| GRF z-axis | |||
|
| Ankle Joint Angle | The ankle is plantar flexed at heel strike in the range of 5–6 degrees, moves to 10–12 degrees of dorsiflexion, and then back to plantarflexion (15–20 degrees) at toe-off. | |
| Ankle Torque | During loading, the ankle has a dorsiflexor torque as the foot is lowered to the ground. Next, a plantarflexion torque occurs through midstance to control the weight transfer over the ankle as the body moves over the foot. Finally, at late stance, the plantarflexion torque continues as the plantar flexors advance the foot into the swing. | ||
| Ankle Power | At loading response, power is absorbed by the dorsiflexors as the foot is lowered to the ground. Power absorption continues by the plantar flexors as the body moves over the foot. Finally, power is generated by the plantar flexors to drive the leg into the swing. | ||
|
| Hip Joint Angle | Peak hip flexion usually occurs at heel contact and is approximately 35–50 degrees. After heel contact, hip flexion reduces throughout support until toe-off. | |
| Hip Torque | A net hip extensor torque during the initial loading phase of support continues through midstance into late stance. | ||
| Hip Power | At heel contact, there is power generation of the hip extensors. In late stance, there is new power absorption by the hip extensors to decelerate the hip flexors, followed by power generation of the hip flexors to propel the leg into the swing. | ||
|
| Knee Joint Angle | The ankle is plantar flexed at heel strike in the range of 5–6 degrees and moves to 10–12 degrees of dorsiflexion and then back to plantarflexion (15–20 degrees) at toe-off. | |
| Knee Torque | The loading response at the knee involves an extensor torque of the knee, which transfers to a flexor torque after the knee angle moves into extension towards toe-off. | ||
| Knee Power | There is knee flexion controlled by the extensors (power absorption) at heel contact moving into midstance, where there is a knee extensor torque controlled by the extensors (power generation). In late stance, there is a knee extensor torque controlled by the extensors (power generation). |
Figure 2Feature variance comparison of gait features between patients with PAD and healthy controls. (a) GRF gait features, (b) Ankle gait features, (c) Hip gait features, (d) Knee gait features. The green asterisks indicate a significant difference in variance based on Levene’s homogeneity of variance.
Figure 3A comparison between gait feature sources in terms of (a) Average ANOVA F-statistic and (b) Average Information Gain. GRF features have higher F-statistic and information gain than the ankle, knee, and hip gait parameters.
Figure 4A correlation study of all numeric gait features regardless of PAD status (healthy controls or patients with PAD). The color line indicates the correlation coefficient between features from “Dark Red: −1” to “Navy: 1”. A correlation coefficient of “−1” between two variables implies a perfect negative relationship, and a correlation coefficient of “1” between two variables implies a perfect positive relationship. If the correlation between two variables is 0, there is no linear relationship.
Figure 5Flowchart showing the 3-step ML analysis method to identify PAD and extract the most valuable gait signatures for PAD diagnosis.
The hyperparameters for each algorithm.
| Algorithm | List of Hyperparameters |
|---|---|
| Neural Networks |
Activation function Optimizer Kernel Initializer Learning rate Regularization Batch size Number of epochs |
| Random Forest |
Number of trees Maximum number of features Maximum depth of layers Criteria |
| SVM |
Kernel Gamma Penalty parameter |
| Logistic Regression |
Regularization |
Figure 6Performance metric results that measure the ability of each ML model algorithm to distinguish PAD.
A summary of the models’ performances on the testing data. It evaluates the applied model scores utilizing every group based on four performance metrics. The shaded columns highlight the comparison between all gait features models (Group 1) and GRF features (Group 6).
| Metric | Model Type | Group Category | |||||
|---|---|---|---|---|---|---|---|
| All | Ankle | Hip | Knee | GRF | Ankle, Hip, Knee | ||
| Group 1 | Group 2 | Group 3 | Group 4 | Group 5 | Group 6 | ||
| Accuracy | Neural Networks | 0.89 | 0.79 | 0.78 | 0.81 | 0.82 | 0.84 |
| Random Forest | 0.89 | 0.69 | 0.73 | 0.75 | 0.87 | 0.83 | |
| Discriminant Power | Neural Networks | 1.94 | 0.95 | 0.82 | 0.90 | 1.87 | 1.33 |
| Random Forest | 1.94 | 0.64 | 0.29 | 0.71 | 2.09 | 1.19 | |
| Geometric Mean | Neural Networks | 0.83 | 0.65 | 0.61 | 0.54 | 0.84 | 0.84 |
| Random Forest | 0.83 | 0.63 | 0.46 | 0.60 | 0.87 | 0.63 | |
| Matthew’s Correlation Coefficient | Neural Networks | 0.64 | 0.33 | 0.27 | 0.27 | 0.57 | 0.44 |
| Random Forest | 0.64 | 0.22 | 0.09 | 0.24 | 0.64 | 0.39 | |
| Best model type | Neural Networks, Random Forest | Neural Networks | Neural Networks | Neural Networks | Random Forest | Neural Networks | |
| ML Performance Metrics Description: Accuracy: the number of correct predictions divided by the total number of examples. Range: (0 to 1), an accuracy value of “1” means the model predicts perfectly with no errors. Discriminant Power: measures the ability of the classifier to distinguish between minority (healthy controls) and majority (Patients with PAD) cases. A higher Discriminant Power value translates to better model performance. Geometric Mean measures the balance of the classification performance in the majority and minority cases. The higher the geometric mean, the better the model performance. Matthew’s Correlation Coefficient provides a good score only if the model performs well in all four confusion matrix categories. Range: (−1 to 1), with “1” as the perfect model, “−1” as the worst model, and 0 no better than a random naïve model. | |||||||
Figure 7A measurement of the ability of a few laboratory-based gait features to distinguish PAD using Matthew’s Correlation Coefficient. Generally, the GRF-based models (Groups 1 and 5) performed better than joint data models (Groups 2, 3, 4, and 6) and provided comparable results to using all gait features.
Figure 8A comparison between GRF-based signals in identifying PAD using Matthew’s Correlation Coefficient as a performance metric. For both Neural Network and Random Forest model approaches, including all GRF components led to better PAD classification compared with any single GRF component (x = anteroposterior, y = mediolateral, and z = vertical).