| Literature DB >> 26805847 |
Andrea Mannini1, Diana Trojaniello2, Andrea Cereatti3, Angelo M Sabatini4.
Abstract
Machine learning methods have been widely used for gait assessment through the estimation of spatio-temporal parameters. As a further step, the objective of this work is to propose and validate a general probabilistic modeling approach for the classification of different pathological gaits. Specifically, the presented methodology was tested on gait data recorded on two pathological populations (Huntington's disease and post-stroke subjects) and healthy elderly controls using data from inertial measurement units placed at shank and waist. By extracting features from group-specific Hidden Markov Models (HMMs) and signal information in time and frequency domain, a Support Vector Machines classifier (SVM) was designed and validated. The 90.5% of subjects was assigned to the right group after leave-one-subject-out cross validation and majority voting. The long-term goal we point to is the gait assessment in everyday life to early detect gait alterations.Entities:
Keywords: Huntington’s disease; elderly; gait classification; hemiparetic; hidden Markov model; inertial sensors; wearable sensors
Mesh:
Year: 2016 PMID: 26805847 PMCID: PMC4732167 DOI: 10.3390/s16010134
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1General block scheme of the algorithm for gait classification.
Figure 2Feature set definition for classification. The information from the subject being tested was not included in the training set.
Features for classification.
| H1. Log-likelihood, EL model (limited to a 2-s window) | ||
| H2. Log-likelihood, PS model | ||
| H3. Log-likelihood, HD model | ||
| H4. Difference between log-likelihoods given EL and PS models (for all available data) | ||
| H5. Difference between log-likelihoods given EL and HD models | ||
| H6. Difference between log-likelihoods given PS and HD models | ||
| T1. Mean value | Evaluated for each channel (84 features) | |
| T2. Standard deviation | ||
| T3. Variance | ||
| T4. Maximum | ||
| T5. Minimum | ||
| T6. Range | ||
| F1. Power at first dominant frequency (P1) | ||
| F2. Power at second dominant frequency | ||
| F3. First dominant frequency | ||
| F4. Second dominant frequency | ||
| F5. Total power (PT) | ||
| F6. P1/PT | ||
Figure 3Block scheme of the SVM classifier validation. The LOSO approach was followed: data from N-1 subjects were used for training and the obtained classifier was tested on the features from the remaining subject.
Dataset details.
| Group | Number of Passages | Number of Strides Per Passage | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Min | Max | Mean | SD | Total | Min | Max | Mean | SD | Total | |
| EL | 6 | 15 | 12.4 | 3.1 | 124 | 2 | 5 | 3.5 | 0.7 | 862 |
| PS | 2 | 11 | 6.3 | 3.1 | 95 | 3 | 24 | 6.0 | 3.6 | 1144 |
| HD | 4 | 16 | 10.1 | 3.4 | 171 | 2 | 21 | 4.3 | 2.6 | 1467 |
| Overall | 2 | 16 | 9.3 | 4.0 | 390 | 2 | 24 | 4.5 | 2.7 | 3473 |
Confusion matrices. Results are reported for two different strategies: (1) solving the classification problem using Hidden Markov Models (HMM) based information only; (2) solving the problem using a Support Vector Machine (SVM) classifier. In the latter case, the contributions of HMM-based features only (2A), of time and frequency domain features only (2B) and of the full features set (2C) are presented separately. Confusion matrices refer to the classification of single mat passages: each entry in the matrix corresponds to a passage on the mat of one foot. The amount of passages is then doubled with respect to data in Table 2. Correct classifications are in bold.
| Classification Output | ||||
|---|---|---|---|---|
| EL | PS | HD | ||
| 0 (0%) | 72 (29%) | |||
| 0 (0%) | 79 (83.2%) | |||
| 0 (0%) | 44 (46.3%) | |||
| 60 (17.5%) | 5 (1.5%) | |||
| Overall accuracy 66.7% of mat passages | ||||
| 50 (20.2%) | 30 (12.1%) | |||
| 11 (11.6%) | 15 (15.8%) | |||
| 0 (0%) | 2 (2.1%) | |||
| 55 (16.1%) | 59 (17.3%) | |||
| Overall accuracy 71.5% of mat passages | ||||
| 0 (0%) | 66 (26.6%) | |||
| 3 (3.2%) | 35 (36.8%) | |||
| 1 (1.1%) | 16 (16.8%) | |||
| 63 (18.4%) | 37 (10.8%) | |||
| Overall accuracy 71.7% of mat passages | ||||
| 48 (19.4%) | 41 (16.5%) | |||
| 0 (0%) | 24 (25.3%) | |||
| 0 (0%) | 7 (7.4%) | |||
| 39 (11.4%) | 49 (14.3%) | |||
| Overall accuracy 73.3% of mat passages | ||||
Confusion matrices obtained after majority voting (MV). Results are reported for two different strategies: (1) solving the classification problem using HMM based information only; (2) solving the problem using a SVM classifier. In the latter case, the contributions of HMM-based features only (2A), of time and frequency domain features only (2B) and of the full features set (2C) are evaluated separately. Each entry in the matrix corresponds to a subject. Post stroke (PS) subjects were reported twice, separating the two sides contributions. Correct classifications are in bold.
| Classification Output | ||||
|---|---|---|---|---|
| EL | PS | HD | ||
| 0 (0%) | 3 (30%) | |||
| 0 (0%) | 10 (66.7%) | |||
| 0 (0%) | 5 (33.3%) | |||
| 1 (5.9%) | 0 (0%) | |||
| Overall accuracy 76.2% of subjects | ||||
| 0 (0%) | 1 (10%) | |||
| 1 (6.7%) | 1 (6.7%) | |||
| 0 (0%) | 0 (0%) | |||
| 1 (5.9%) | 4 (23.5%) | |||
| Overall accuracy 85.7% of subjects | ||||
| 0 (0%) | 2 (20%) | |||
| 0 (0%) | 2 (13.3%) | |||
| 0 (0%) | 1 (6.7%) | |||
| 3 (17.6%) | 0 (0%) | |||
| Overall accuracy 83.3% of subjects | ||||
| 1 (10%) | 0 (0%) | |||
| 0 (0%) | 2 (13.3%) | |||
| 0 (0%) | 0 (0%) | |||
| 0 (0%) | 2 (11.8%) | |||
| Overall accuracy 90.5% of subjects | ||||
Figure 4Output of the classifier after MV in relation to clinical scales: (a) PS subjects (impaired side only), in relation to the FAC scale. No data were misclassified from the PS class to the EL class. Two subjects are misclassified from the PS class to the HD class; (b) HD subjects in relation to the HDRS’ scale. No data were misclassified from the HD class to the EL class. Two subjects were misclassified from the HD class to the PS class.