| Literature DB >> 35391750 |
Rana Zia Ur Rehman1, Yu Guan2, Jian Qing Shi3,4, Lisa Alcock1, Alison J Yarnall1,5, Lynn Rochester1,5, Silvia Del Din1.
Abstract
Parkinson's disease (PD) is a common neurodegenerative disease. PD misdiagnosis can occur in early stages. Gait impairment in PD is typical and is linked with an increased fall risk and poorer quality of life. Applying machine learning (ML) models to real-world gait has the potential to be more sensitive to classify PD compared to laboratory data. Real-world gait yields multiple walking bouts (WBs), and selecting the optimal method to aggregate the data (e.g., different WB durations) is essential as this may influence classification performance. The objective of this study was to investigate the impact of environment (laboratory vs. real world) and data aggregation on ML performance for optimizing sensitivity of PD classification. Gait assessment was performed on 47 people with PD (age: 68 ± 9 years) and 52 controls [Healthy controls (HCs), age: 70 ± 7 years]. In the laboratory, participants walked at their normal pace for 2 min, while in the real world, participants were assessed over 7 days. In both environments, 14 gait characteristics were evaluated from one tri-axial accelerometer attached to the lower back. The ability of individual gait characteristics to differentiate PD from HC was evaluated using the Area Under the Curve (AUC). ML models (i.e., support vector machine, random forest, and ensemble models) applied to real-world gait showed better classification performance compared to laboratory data. Real-world gait characteristics aggregated over longer WBs (WB 30-60 s, WB > 60 s, WB > 120 s) resulted in superior discriminative performance (PD vs. HC) compared to laboratory gait characteristics (0.51 ≤ AUC ≤ 0.77). Real-world gait speed showed the highest AUC of 0.77. Overall, random forest trained on 14 gait characteristics aggregated over WBs > 60 s gave better performance (F1 score = 77.20 ± 5.51%) as compared to laboratory results (F1 Score = 68.75 ± 12.80%). Findings from this study suggest that the choice of environment and data aggregation are important to achieve maximum discrimination performance and have direct impact on ML performance for PD classification. This study highlights the importance of a harmonized approach to data analysis in order to drive future implementation and clinical use. Clinical Trial Registration: [09/H0906/82].Entities:
Keywords: Parkinson’s disease; accelerometer; gait; gait aggregation; laboratory; machine learning; real-world; wearables
Year: 2022 PMID: 35391750 PMCID: PMC8981298 DOI: 10.3389/fnagi.2022.808518
Source DB: PubMed Journal: Front Aging Neurosci ISSN: 1663-4365 Impact factor: 5.750
FIGURE 1Overall workflow from gait assessment to classification: (A) Gait assessment protocol, (B) WB detection and gait characterization, (C) 14 gait characteristics (Var, variability; Asy, asymmetry), (D) Classification modeling.
Demographics and clinical measures of the Parkinson’s disease (PD) and healthy controls (HC) group.
| Characteristics | HC ( | PD ( |
|
| M/F ( | 28/24 | 32/15 | 0.083 |
| Age (years) | 70.39 ± 6.88 | 68.36 ± 8.98 | 0.216 |
| Height (m) | 1.69 ± 0.08 | 1.70 ± 0.08 | 0.542 |
| Mass (kg) | 81.13 ± 15.15 | 80.27 ± 15.67 | 0.786 |
| BMI (kg/m2) | 28.29 ± 4.23 | 27.62 ± 4.62 | 0.455 |
| MoCA | 27.61 ± 2.39 | 26.28 ± 3.60 |
|
| ABCs (0–100%) | 91.02 ± 11.65 | 80.88 ± 16.18 |
|
| Medication (LEDD, mg/day) | 415.08 ± 212.61 | ||
| Time from Clinical Diagnosis (months) | 26.42 ± 5.48 | ||
| Hoehn and Yahr ( | HY I–7 (15%) | ||
| HY II–38 (81%) | |||
| HY III–2 (4%) | |||
| MDS-UPDRS III | 31.53 ± 9.79 | ||
| (HY I–16.6 ± 4.73) | |||
| (HY II–33.28 ± 8.81) | |||
| (HY III–35.5 ± 0.71) |
M, Males; F, Females; BMI, Body Mass Index; MoCA, Montreal Cognitive Assessment; ABCs, Activities Specific Balance Confidence scale; LEDD, Levodopa Equivalent Daily Dose; MDS-UPDRS, Movement Disorder Society Unified Parkinson’s Disease Rating Scale. Bold values mean a significant difference between PD and HC.
FIGURE 2Distribution of all detected walking bouts (WBs) in Parkinson’s disease (PD) and healthy control (HC) groups in the real-world assessment of 7 days. WB are categorized intro 14 thresholds based on time in seconds followed by their average of 7 days.
FIGURE 3Distribution of step velocity (m/s) into different WB thresholds and average of 7 days.
Mixed ANOVA results: main effects and interactions between gait assessment environmental conditions (lab vs. real-world) and groups (PD vs. HC) for each gait characteristic.
| Gait characteristics | Between participant factor: Group (HC, PD) | Within participant factor: Gait data (Lab, real-world, and all WB) | Interaction: Group × Gait | ||||||
|
|
|
| |||||||
|
|
| η2 |
|
| η2 |
|
| η2 | |
| Step Velocity (m/s) | 9.46 | 0.003 | 0.089 | 73.36 | <0.001 | 0.431 | 5.47 | 0.001 | 0.053 |
| Step Length (m) | 9.95 | 0.002 | 0.093 | 138.22 | <0.001 | 0.588 | 2.56 | 0.058 | 0.026 |
| Swing Time Variability (s) | 6.83 | 0.01 | 0.066 | 158.35 | <0.001 | 0.62 | 2.25 | 0.085 | 0.23 |
| Step Time (s) | 5.45 | 0.022 | 0.053 | 42.58 | <0.001 | 0.305 | 4.83 | 0.001 | 0.047 |
| Swing Time (s) | 6.85 | 0.01 | 0.066 | 66.63 | <0.001 | 0.407 | 3.67 | 0.007 | 0.036 |
| Stance Time (s) | 4.64 | 0.034 | 0.046 | 37.07 | <0.001 | 0.276 | 4.54 | 0.002 | 0.045 |
| Step Velocity Variability (m/s) | 0.22 | 0.638 | 0.002 | 128.51 | <0.001 | 0.57 | 1.49 | 0.218 | 0.015 |
| Step Length Variability (m) | 1.58 | 0.211 | 0.016 | 58.31 | <0.001 | 0.375 | 2.11 | 0.097 | 0.021 |
| Step Time Variability (s) | 5.15 | 0.026 | 0.5 | 173.13 | <0.001 | 0.641 | 2.81 | 0.04 | 0.028 |
| Stance Time Variability (s) | 5.79 | 0.018 | 0.056 | 170.86 | <0.001 | 0.638 | 3.14 | 0.023 | 0.031 |
| Step Time Asymmetry (s) | 4.44 | 0.038 | 0.044 | 1018.75 | <0.001 | 0.913 | 1.11 | 0.342 | 0.011 |
| Swing Time Asymmetry (s) | 7.79 | 0.006 | 0.074 | 1117.42 | <0.001 | 0.92 | 2.64 | 0.047 | 0.026 |
| Stance Time Asymmetry (s) | 5.05 | 0.027 | 0.05 | 986.81 | <0.001 | 0.911 | 2.04 | 0.113 | 0.021 |
| Step Length Asymmetry (m) | 0.18 | 0.671 | 0.002 | 911.29 | <0.001 | 0.904 | 2.29 | 0.051 | 0.023 |
Partial eta squared (η
FIGURE 4Comparison of effect of environment (lab vs. real-world) and WB duration (threshold) on discrimination between PD and HC participants (Dark highlighted color means lower p values).
FIGURE 5Association between lab-based gait characteristics with real-world gait. Pearson’s correlation r values for both PD and HC.
FIGURE 6Discrimination of PD from HC based on each individual gait characteristic with area under the receiver operating characteristics curve (AUC).
FIGURE 7Accuracy of trained classifiers based on the lab and real-world assessment of gait grouped by WB durations (▲ means average testing accuracy across all tests on independent dataset).
Evaluation metrics (accuracy, sensitivity, and specificity) of the trained classifiers on the lab and real-world test data under various walking bout (WB) durations.
| Lab/WB durations | Random forest | Support vector machine | Ensemble classifier | ||||||
| Accuracy | Sensitivity | Specificity | Accuracy | Sensitivity | Specificity | Accuracy | Sensitivity | Specificity | |
| Lab (2 min) | 64.67 ± 15.02 | 81.62 ± 11.98 | 50.27 ± 19.03 | 51.33 ± 5.58 | 88.87 ± 5.56 | 18.12 ± 11.65 | 60.67 ± 5.96 | 88.79 ± 3.62 | 35.77 ± 10.71 |
| Real-world All WB | 60.67 ± 10.65 | 79.55 ± 15.94 | 43.89 ± 12.14 | 52.67 ± 7.23 | 82.95 ± 17.90 | 26.71 ± 5.33 | 53.33 ± 7.07 | 80.02 ± 18.85 | 30.72 ± 7.91 |
| WB < 10 s | 58.67 ± 9.60 | 78.09 ± 9.99 | 41.99 ± 17.87 | 58.67 ± 6.06 | 87.94 ± 9.20 | 32.91 ± 11.84 | 56.00 ± 2.79 | 83.75 ± 12.04 | 31.72 ± 7.58 |
| 10 < WB ≤ 30 s | 64.67 ± 10.44 | 76.59 ± 15.01 | 54.30 ± 10.25 | 52.67 ± 5.96 | 79.41 ± 9.52 | 29.04 ± 6.68 | 52.00 ± 6.06 | 71.30 ± 14.78 | 35.43 ± 1.48 |
| 30 < WB ≤ 60 s | 56.67 ± 11.30 | 79.98 ± 15.36 | 36.23 ± 12.96 | 62.00 ± 10.95 | 79.69 ± 11.79 | 46.37 ± 16.47 | |||
| 60 < WB ≤ 120 s | 68.00 ± 8.03 | 78.97 ± 10.76 | 58.06 ± 8.63 | 60.00 ± 6.24 | 73.87 ± 10.63 | 47.88 ± 8.92 | 60.67 ± 5.96 | 75.22 ± 8.75 | 47.81 ± 9.31 |
| WB > 60 s | |||||||||
| WB > 120 s | 56.00 ± 2.79 | 82.19 ± 7.92 | 32.99 ± 8.49 | 62.00 ± 3.80 | 84.87 ± 8.15 | 41.78 ± 8.04 | |||
Bold values mean higher accuracy for PD classification.