| Literature DB >> 32978506 |
Soon Bin Kwon1, Yunseo Ku2, Hyuk-Soo Han3, Myung Chul Lee3, Hee Chan Kim1,4,5, Du Hyun Ro6.
Abstract
Knee osteoarthritis (KOA) is characterized by pain and decreased gait function. We aimed to find KOA-related gait features based on patient reported outcome measures (PROMs) and develop regression models using machine learning algorithms to estimate KOA severity. The study included 375 volunteers with variable KOA grades. The severity of KOA was determined using the Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC). WOMAC scores were used to classify disease severity into three groups. A total of 1087 features were extracted from the gait data. An ANOVA and student's t-test were performed and only features that were significant were selected for inclusion in the machine learning algorithm. Three WOMAC subscales (physical function, pain and stiffness) were further divided into three classes. An ANOVA was performed to determine which selected features were significantly related to the subscales. Both linear regression models and a random forest regression was used to estimate patient the WOMAC scores. Forty-three features were selected based on ANOVA and student's t-test results. The following number of features were selected from each joint: 12 from hip, 1 feature from pelvic, 17 features from knee, 9 features from ankle, 1 feature from foot, and 3 features from spatiotemporal parameters. A significance level of < 0.0001 and < 0.00003 was set for the ANOVA and t-test, respectively. The physical function, pain, and stiffness subscales were related to 41, 10, and 16 features, respectively. Linear regression models showed a correlation of 0.723 and the machine learning algorithm showed a correlation of 0.741. The severity of KOA was predicted by gait analysis features, which were incorporated to develop an objective estimation model for KOA severity. The identified features may serve as a tool to guide rehabilitation and progress assessments. In addition, the estimation model presented here suggests an approach for clinical application of gait analysis data for KOA evaluation.Entities:
Mesh:
Year: 2020 PMID: 32978506 PMCID: PMC7519044 DOI: 10.1038/s41598-020-72941-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Subject characteristics.
| Feature | Mild (n = 140) | Moderate (n = 182) | Severe (n = 53) | p-value |
|---|---|---|---|---|
| Age | 62.6 (9.1) | 63.7 (10.2) | 63.3 (10.2) | 0.101 |
| WOMAC | 18.9 (11.9) | 48.5 (6.8) | 71.7 (10.3) | < 0.001 |
| Physical Function | 13.8 (8.9) | 35.4 (5.6) | 52.8 (7.8) | < 0.001 |
| Pain | 3.4 (3.2) | 9.1 (2.3) | 13.2 (3.7) | < 0.001 |
| Stiffness | 1.7 (1.5) | 4 (1.7) | 5.7 (1.7) | < 0.001 |
Figure 1Mean values of representative gait parameters for each symptomatic severity of KOA where features were extracted from the (a) ankle power, (b) hip adduction angle, (c) knee flexion angle, and (d) knee varus angle. The shaded area represents standard deviation.
Mean and standard deviation of selected features significantly different for WOMAC severity groups.
| Joint | Parameter | Feature | Mild | Moderate | Severe |
|---|---|---|---|---|---|
| Hip | Rotation moment | Standard deviation | 2.67 (0.63) | 2.39 (0.52) | 2.36 (0.58) |
| Flexion angle | Lower bound of Autocorrelation | − 0.44 (0.0013) | − 0.43 (0.0019) | − 0.43 (0.0022) | |
| Bandwidth frequency bounds | 0.0025 (0.00016) | 0.0024 (0.00022) | 0.024 (0.00021) | ||
| Adduction angle | − | − | − | ||
| Standard deviation of absolute value | 298.18 (262.45) | 425.53 (321.97) | 501.01 (315.01) | ||
| Power | Minimum value during mid-stance | − 1.36 (2.06) | − 0.91 (1.89) | 0.02 (2.01) | |
| Maximum value during terminal stance | 4.72 (3.86) | 3.13 (3.51) | 3.37 (2.84) | ||
| Area under the curve | 121.33 (107.22) | 104.2 (106.16) | 179.68 (115.38) | ||
| Maximum − minimum | 11.56 (4.92) | 9.32 (4.97) | 10.68 (4.16) | ||
| Distance between stance and swing phase using dynamic time wrapping | 125.88 (64.02) | 96.33 (62.12) | 88.94 (49.67) | ||
| Maximum value during mid-swing | 6.74 (3.16) | 5.19 (3.28) | 5.99 (2.7) | ||
| Minimum value during terminal swing | − 0.37 (0.5) | − 0.45 (0.55) | − 0.78 (0.81) | ||
| Pelvic | Obliquity angle | Minimum value during terminal stance to pre-swing | 3.65 (0.83) | 3.19 (0.9) | 3.32 (0.79) |
| Knee | Extension moment | Kurtosis | 2.23 (0.52) | 1.97 (0.45) | 2.11 (0.48) |
| Peak2RMS | 2.14 (0.3) | 1.98 (0.28) | 2.02 (0.28) | ||
| Flexion angle | |||||
| Standard deviation | 16.37 (3.27) | 14.53 (4.06) | 13.93 (4.72) | ||
| Maximum − minimum | 51.4 (8.82) | 46.29 (11.48) | 44.3 (13.34) | ||
| Area under the curve of power spectral density | 276.99 (108.54) | 225.94 (110) | 214.6 (115.55) | ||
| Power | Maximum value during terminal swing | − 7.47 (3.65) | − 5.8 (3.36) | − 6.25 (3.58) | |
| Varus Angle | Maximum value during mid-stance | 5.36 (4.55) | 8.81 (6.06) | 9.34 (6.35) | |
| Maximum Value during Terminal Stance | 5.09 (4.4) | 8.46 (6.01) | 8.95 (6.53) | ||
| Area under the curve | 334.36 (420.53) | 641.28 (530.54) | 710.69 (595.35) | ||
| Root mean square (RMS) | 5.18 (3.03) | 7.77 (3.9) | 8.64 (4.28) | ||
| Peak2RMS | 1.73 (0.49) | 1.42 (0.34) | 1.45 (0.39) | ||
| Area under the curve of power spectrum | 0.94 (1.26) | 2.19 (2.31) | 2.65 (2.38) | ||
| Maximum Value during Terminal swing | 5.39 (4.29) | 7.88 (5.03) | 8.5 (5.47) | ||
| Minimum value during loading response | 3.12 (4.14) | 6.18 (5.52) | 6.4 (5.71) | ||
| Ankle | Plantarflexion moment | Minimum value during loading response | − 0.61 (0.56) | − 0.34 (0.44) | − 0.31 (0.48) |
| Maximum value during initial Swing | − 0.3 (0.11) | − 0.23 (0.14) | − 0.26 (0.11) | ||
| Maximum − minimum | 11.38 (3.68) | 9.76 (3.36) | 10.28 (2.81) | ||
| Power | Kurtosis | 6.73 (1.14) | 6.01 (1.48) | 6.12 (1.27) | |
| Maximum − minimum | 20.98 (9.02) | 16.62 (8.73) | 19.15 (6.99) | ||
| Lower bound of autocorrelation | − 0.43 (0.0031) | − 0.43 (0.0030) | − 0.43 (0.0032) | ||
| Occupied bandwidth | 0.89 (0.25) | 1.09 (0.35) | 0.98 (0.27) | ||
| Foot | Progression angle | Average of absolute value | − 0.44 (0.0013) | − 0.44 (0.0020) | − 0.44 (0.0018) |
| Spatiotemporal | Total speed | 85.41 (18.25) | 75.43 (21.82) | 81.87 (17.44) | |
| Duration of single limb support phase | 35.5 (2.72) | 33.58 (4.38) | 35.34 (3.14) | ||
| Timing of initial double limb support | 14.61 (3.05) | 16.37 (4.38) | 14.42 (2.96) |
The bolded rows are four features that showed most significant difference among each groups.
Figure 2Regression result for WOMAC results using (a) linear regression (b) the random forest algorithm and identified key features.