| Literature DB >> 32098362 |
Sebastian Scheurer1, Salvatore Tedesco2, Kenneth N Brown1, Brendan O'Flynn1,2,3.
Abstract
Human activity recognition (HAR) has become an increasingly popular application of machine learning across a range of domains. Typically the HAR task that a machine learning algorithm is trained for requires separating multiple activities such as walking, running, sitting, and falling from each other. Despite a large body of work on multi-class HAR, and the well-known fact that the performance on a multi-class problem can be significantly affected by how it is decomposed into a set of binary problems, there has been little research into how the choice of multi-class decomposition method affects the performance of HAR systems. This paper presents the first empirical comparison of multi-class decomposition methods in a HAR context by estimating the performance of five machine learning algorithms when used in their multi-class formulation, with four popular multi-class decomposition methods, five expert hierarchies-nested dichotomies constructed from domain knowledge-or an ensemble of expert hierarchies on a 17-class HAR data-set which consists of features extracted from tri-axial accelerometer and gyroscope signals. We further compare performance on two binary classification problems, each based on the topmost dichotomy of an expert hierarchy. The results show that expert hierarchies can indeed compete with one-vs-all, both on the original multi-class problem and on a more general binary classification problem, such as that induced by an expert hierarchy's topmost dichotomy. Finally, we show that an ensemble of expert hierarchies performs better than one-vs-all and comparably to one-vs-one, despite being of lower time and space complexity, on the multi-class problem, and outperforms all other multi-class decomposition methods on the two dichotomous problems.Entities:
Keywords: ensembles of nested dichotomies; error-correcting output codes; hierarchical classification; human activity recognition; inertial sensors; machine learning; multi-class classification; wearable sensors
Year: 2020 PMID: 32098362 PMCID: PMC7070332 DOI: 10.3390/s20041208
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
The 17 activities and the proportion (%) with which they are represented in the data-set.
| Activity | % |
|---|---|
| All 4 s | 5.5 |
| Crouch | 4.2 |
| Lie | 5.7 |
| Sit | 9.3 |
| Stand | 8.2 |
| Fall | 1.7 |
| Jump Up | 1.8 |
| Jump Down | 1.8 |
| Crawl Hands & Knees | 5.8 |
| Military Crawl | 4.8 |
| Duck Walk | 3.9 |
| Walk Horizontally | 5.7 |
| Walk Down | 8.0 |
| Walk Up | 9.8 |
| Run Horizontally | 12.4 |
| Run Down | 5.7 |
| Run Up | 5.6 |
Figure A1Expert hierarchy 1 (EH1).
Figure A2Expert hierarchy 2 (EH2).
Figure A3Expert hierarchy 3 (EH3).
Figure A4Expert hierarchy 4 (EH4).
Figure A5Expert hierarchy 5 (EH5).
Mean % (± SE) on the multi-class human activity recognition (HAR) problem for each machine learning algorithm and multi-class decomposition method. The first six multi-class decomposition methods are given in order of decreasing average score, the expert hierarchies ordered alphabetically.
| GBT | SVM | DT | kNN | GLM | SVM-MCL | Avg. | |
|---|---|---|---|---|---|---|---|
| OVO | 95.37 ± 0.29 | 90.88 ± 0.26 | 90.56 ± 0.20 | 83.37 ± 0.24 | 87.35 ± 0.47 | 87.10 ± 0.36 | 89.11 ± 1.68 |
| END | 95.30 ± 0.27 | 91.08 ± 0.31 | 92.19 ± 0.33 | 85.88 ± 0.34 | 81.37 ± 0.47 | 82.34 ± 0.51 | 88.03 ± 2.32 |
| OVA | 95.24 ± 0.25 | 90.67 ± 0.33 | 89.41 ± 0.45 | 85.45 ± 0.24 | 83.43 ± 0.44 | 83.88 ± 0.53 | 88.01 ± 1.88 |
| EEH | 95.31 ± 0.22 | 90.29 ± 0.34 | 87.06 ± 0.34 | 85.43 ± 0.32 | 84.89 ± 0.32 | 84.85 ± 0.33 | 87.97 ± 1.69 |
| MCL | 95.85 ± 0.20 | - | 84.21 ± 0.49 | 85.45 ± 0.24 | 85.67 ± 0.50 | 80.72 ± 0.47 | 86.38 ± 2.53 |
| ECOC | 93.91 ± 0.21 | 90.33 ± 0.32 | 95.84 ± 0.20 | 85.74 ± 0.27 | 73.93 ± 0.29 | 71.56 ± 0.56 | 85.22 ± 4.20 |
| Avg. | 95.16 ± 0.27 | 90.65 ± 0.15 | 89.88 ± 1.65 | 85.22 ± 0.38 | 82.77 ± 1.95 | 81.74 ± 2.22 | 87.45 ± 0.57 |
| EH1 | 94.95 ± 0.21 | 89.42 ± 0.39 | 85.29 ± 0.19 | 85.36 ± 0.33 | 83.37 ± 0.34 | 83.51 ± 0.30 | 86.98 ± 1.83 |
| EH2 | 94.87 ± 0.24 | 89.64 ± 0.30 | 85.29 ± 0.39 | 85.54 ± 0.28 | 83.51 ± 0.37 | 83.40 ± 0.34 | 87.04 ± 1.82 |
| EH3 | 94.81 ± 0.21 | 89.94 ± 0.41 | 85.68 ± 0.35 | 85.55 ± 0.26 | 83.62 ± 0.37 | 83.65 ± 0.30 | 87.21 ± 1.79 |
| EH4 | 94.76 ± 0.24 | 89.84 ± 0.38 | 85.45 ± 0.28 | 85.43 ± 0.30 | 82.99 ± 0.50 | 83.10 ± 0.39 | 86.93 ± 1.87 |
| EH5 | 94.69 ± 0.31 | 89.65 ± 0.25 | 84.93 ± 0.32 | 85.39 ± 0.25 | 81.11 ± 0.37 | 80.89 ± 0.36 | 86.11 ± 2.16 |
| Avg. | 94.82 ± 0.04 | 89.70 ± 0.09 | 85.33 ± 0.12 | 85.45 ± 0.04 | 82.92 ± 0.46 | 82.91 ± 0.51 | 86.85 ± 0.19 |
Figure 199% confidence intervals (C.I.) for the effect of the multi-class decomposition method (MDM) on the Kappa statistic for the full 17-class problem.
Mean % (± SE) for the topmost dichotomy of expert hierarchy (EH1) (Stationary vs. obile). The first six multi-class decomposition methods are given in order of decreasing average score, the expert hierarchies in alphabetical order.
| GBT | SVM | DT | kNN | GLM | SVM-MCL | Avg. | |
|---|---|---|---|---|---|---|---|
| EEH | 99.85 ± 0.06 | 99.77 ± 0.09 | 99.67 ± 0.08 | 99.14 ± 0.16 | 99.72 ± 0.10 | 99.70 ± 0.07 | 99.64 ± 0.10 |
| OVO | 99.85 ± 0.06 | 99.70 ± 0.07 | 99.80 ± 0.06 | 99.19 ± 0.12 | 99.52 ± 0.10 | 99.62 ± 0.08 | 99.61 ± 0.10 |
| END | 99.77 ± 0.09 | 99.65 ± 0.10 | 99.59 ± 0.08 | 99.11 ± 0.15 | 99.14 ± 0.11 | 99.06 ± 0.13 | 99.39 ± 0.13 |
| MCL | 99.75 ± 0.08 | - | 99.32 ± 0.11 | 99.09 ± 0.17 | 99.52 ± 0.14 | 98.43 ± 0.25 | 99.22 ± 0.23 |
| ECOC | 99.77 ± 0.07 | 99.42 ± 0.11 | 99.80 ± 0.06 | 98.99 ± 0.16 | 96.11 ± 0.31 | 93.87 ± 0.87 | 97.99 ± 1.00 |
| OVA | 99.80 ± 0.08 | 99.27 ± 0.20 | 93.02 ± 0.44 | 99.09 ± 0.17 | 98.78 ± 0.13 | 97.80 ± 0.38 | 97.96 ± 1.02 |
| Avg. | 99.80 ± 0.02 | 99.56 ± 0.09 | 98.53 ± 1.11 | 99.10 ± 0.03 | 98.80 ± 0.55 | 98.08 ± 0.89 | 98.97 ± 0.32 |
| EH1 | 99.82 ± 0.07 | 99.72 ± 0.10 | 99.39 ± 0.09 | 99.09 ± 0.17 | 99.60 ± 0.08 | 99.57 ± 0.07 | 99.53 ± 0.11 |
| EH2 | 99.77 ± 0.06 | 99.39 ± 0.17 | 98.66 ± 0.14 | 99.06 ± 0.15 | 98.78 ± 0.15 | 98.66 ± 0.15 | 99.05 ± 0.18 |
| EH3 | 99.82 ± 0.07 | 99.70 ± 0.08 | 99.52 ± 0.10 | 99.06 ± 0.17 | 99.62 ± 0.10 | 99.47 ± 0.12 | 99.53 ± 0.11 |
| EH4 | 99.82 ± 0.07 | 99.72 ± 0.10 | 99.26 ± 0.16 | 99.11 ± 0.16 | 99.49 ± 0.12 | 99.49 ± 0.12 | 99.48 ± 0.11 |
| EH5 | 99.80 ± 0.07 | 99.52 ± 0.12 | 99.14 ± 0.15 | 99.09 ± 0.17 | 99.06 ± 0.13 | 98.91 ± 0.16 | 99.25 ± 0.14 |
| Avg. | 99.81 ± 0.01 | 99.61 ± 0.07 | 99.19 ± 0.15 | 99.08 ± 0.01 | 99.31 ± 0.17 | 99.22 ± 0.18 | 99.37 ± 0.09 |
Mean % (± SE) for the topmost dichotomy of EH4 (Possible Emergency vs. non-Emergency). The first six multi-class decomposition methods are given in order of decreasing average score, the five expert hierarchies in alphabetical order.
| GBT | SVM | DT | kNN | GLM | SVM-MCL | Avg. | |
|---|---|---|---|---|---|---|---|
| EEH | 94.11 ± 0.61 | 92.46 ± 0.47 | 87.99 ± 0.66 | 86.86 ± 0.79 | 86.71 ± 0.82 | 87.59 ± 0.70 | 89.29 ± 1.30 |
| OVO | 94.93 ± 0.48 | 90.90 ± 0.73 | 89.69 ± 0.63 | 82.41 ± 1.02 | 89.32 ± 0.87 | 88.41 ± 0.93 | 89.28 ± 1.66 |
| END | 94.34 ± 0.51 | 92.68 ± 0.54 | 90.66 ± 0.61 | 86.97 ± 0.89 | 83.44 ± 1.57 | 84.79 ± 1.30 | 88.81 ± 1.80 |
| MCL | 94.74 ± 0.57 | - | 81.38 ± 1.22 | 87.35 ± 0.72 | 88.18 ± 0.57 | 80.87 ± 0.86 | 86.50 ± 2.54 |
| ECOC | 94.83 ± 0.30 | 91.19 ± 0.57 | 92.69 ± 0.39 | 85.65 ± 0.59 | 77.26 ± 0.90 | 76.34 ± 1.41 | 86.33 ± 3.26 |
| OVA | 94.36 ± 0.50 | 92.91 ± 0.56 | 59.74 ± 1.34 | 87.35 ± 0.72 | 86.20 ± 1.12 | 84.89 ± 1.05 | 84.24 ± 5.14 |
| Avg. | 94.55 ± 0.13 | 92.03 ± 0.41 | 83.69 ± 5.04 | 86.10 ± 0.78 | 85.19 ± 1.78 | 83.81 ± 1.84 | 87.41 ± 0.84 |
| EH1 | 94.78 ± 0.51 | 91.30 ± 0.69 | 82.30 ± 0.61 | 86.95 ± 0.67 | 84.35 ± 1.24 | 84.39 ± 1.08 | 87.34 ± 1.95 |
| EH2 | 94.06 ± 0.54 | 91.00 ± 0.55 | 84.02 ± 0.70 | 87.70 ± 0.67 | 84.62 ± 0.83 | 83.93 ± 0.88 | 87.55 ± 1.72 |
| EH3 | 93.16 ± 0.68 | 92.39 ± 0.61 | 83.54 ± 0.79 | 87.54 ± 0.74 | 85.52 ± 0.84 | 86.17 ± 0.69 | 88.05 ± 1.59 |
| EH4 | 91.75 ± 0.69 | 90.53 ± 0.44 | 81.74 ± 0.59 | 87.35 ± 0.72 | 79.98 ± 1.36 | 81.15 ± 1.23 | 85.42 ± 2.09 |
| EH5 | 93.65 ± 0.58 | 90.82 ± 0.55 | 81.81 ± 0.94 | 87.48 ± 0.84 | 82.43 ± 1.03 | 82.97 ± 0.83 | 86.53 ± 2.01 |
| Avg. | 93.48 ± 0.51 | 91.21 ± 0.32 | 82.68 ± 0.46 | 87.40 ± 0.13 | 83.38 ± 0.99 | 83.72 ± 0.83 | 86.98 ± 0.46 |
Figure 299% confidence intervals (C.I.) for the effect of the multi-class decomposition method (MDM) on the Kappa statistic for the topmost dichotomy of EH1 (left) and EH4 (right).
Estimated logistic regression coefficients with p < 0.1 for the multi-class problem.
| Coefficient | 0.5% |
| 99.5% |
|
|---|---|---|---|---|
| (Intercept) | 2.87 | 2.99 | 3.13 | <2.0 × 10−32 |
| SVM | −0.88 | −0.72 | −0.56 | 1.0 × 10−31 |
| DT | −1.02 | −0.86 | −0.7 | <2.0 × 10−32 |
| kNN | −1.38 | −1.22 | −1.08 | <2.0 × 10−32 |
| GLM | −1.53 | −1.38 | −1.23 | <2.0 × 10−32 |
| SVM-MCL | −1.5 | −1.35 | −1.2 | <2.0 × 10−32 |
| ECOC | −0.43 | −0.26 | −0.09 | 9.6 × 10−5 |
| ECOC ∧ SVM | 0.0 | 0.22 | 0.44 | 8.6 × 10−3 |
| ECOC ∧ DT | 1.03 | 1.26 | 1.5 | <2.0 × 10−32 |
| ECOC ∧ kNN | 0.08 | 0.28 | 0.49 | 3.5 × 10−4 |
| ECOC ∧ GLM | −0.51 | −0.31 | −0.12 | 3.8 × 10−5 |
| ECOC ∧ SVM-MCL | −0.66 | −0.47 | −0.27 | 9.3 × 10−10 |
| EEH ∧ DT | −0.46 | 0.24 | −0.02 | 4.3 × 10−3 |
| EH ∧ DT | −0.46 | −0.28 | −0.11 | 1.9 × 10−5 |
| END ∧ DT | 0.09 | 0.32 | 0.55 | 3.0 × 10−4 |
| END ∧ GLM | −0.36 | −0.15 | 0.05 | 5.5 × 10−2 |
| MCL | −0.04 | 0.15 | 0.33 | 4.6 × 10−2 |
| MCL ∧ DT | −0.83 | −0.61 | −0.38 | 2.4 × 10−12 |
| MCL ∧ kNN | −0.36 | −0.15 | 0.07 | 8.5 × 10−2 |
| OVO ∧ kNN | −0.4 | −0.19 | 0.02 | 2.2 × 10−2 |
| OVO ∧ GLM | 0.07 | 0.29 | 0.5 | 5.4 × 10−4 |
| OVO ∧ SVM-MCL | 0.02 | 0.23 | 0.44 | 5.2 × 10−3 |
Estimated logistic regression coefficients with p < 0.1 for the binary problem induced by the topmost dichotomy of EH1.
| Coefficient | 0.5% |
| 99.5% |
|
|---|---|---|---|---|
| (Intercept) | 5.65 | 6.2 | 6.87 | <2.0 × 10 |
| SVM | −2.03 | −1.29 | −0.64 | 1.2 × 10 |
| DT | −4.29 | −3.61 | −3.05 | <2.0 × 10 |
| kNN | −2.24 | −1.51 | −0.88 | 6.2 × 10 |
| GLM | −2.52 | −1.8 | −1.19 | 1.3 × 10 |
| SVM-MCL | −3.1 | −2.4 | −1.82 | 1.6 × 10 |
| ECOC ∧ DT | 2.71 | 3.73 | 4.8 | 1.9 × 10 |
| ECOC ∧ GLM | −1.95 | −1.07 | −0.17 | 1.8 × 10 |
| ECOC ∧ SVM-MCL | −1.81 | −0.95 | −0.07 | 4.6 × 10 |
| EEH ∧ SVM | −0.26 | 0.89 | 2.03 | 4.4 × 10 |
| EEH ∧ DT | 1.77 | 2.83 | 3.88 | 3.0 × 10 |
| EEH ∧ GLM | 0.09 | 1.2 | 2.29 | 4.7 × 10 |
| EEH ∧ SVM-MCL | 0.62 | 1.71 | 2.78 | 3.6 × 10 |
| EH ∧ SVM | −0.15 | 0.58 | 1.39 | 4.9 × 10 |
| EH ∧ DT | 1.52 | 2.17 | 2.91 | 4.7 × 10 |
| EH ∧ GLM | −0.17 | 0.52 | 1.3 | 6.4 × 10 |
| EH ∧ SVM-MCL | 0.33 | 1.0 | 1.75 | 2.7 × 10 |
| END ∧ SVM | −0.15 | 0.85 | 1.87 | 2.9 × 10 |
| END ∧ DT | 2.09 | 3.03 | 3.99 | 1.3 × 10 |
| END ∧ SVM-MCL | 0.08 | 0.99 | 1.9 | 4.8 × 10 |
| MCL ∧ DT | 1.73 | 2.61 | 3.52 | 2.8 × 10 |
| MCL ∧ GLM | 0.23 | 1.16 | 2.12 | 1.4 × 10 |
| OVO ∧ DT | 2.2 | 3.32 | 4.45 | 1.4 × 10 |
| OVO ∧ SVM-MCL | 0.42 | 1.49 | 2.53 | 2.4 × 10 |
Estimated logistic regression coefficients with p < 0.1 for the binary problem induced by the topmost dichotomy of EH4.
| Coefficient | 0.5% |
| 99.5% |
|
|---|---|---|---|---|
| (Intercept) | 2.7 | 2.82 | 2.94 | <2.0 × 10 |
| SVM | −0.4 | −0.25 | −0.09 | 7.0 × 10 |
| DT | −2.56 | −2.42 | −2.29 | 8.9 × 10 |
| kNN | −1.03 | −0.88 | −0.74 | <2.0 × 10 |
| GLM | −1.13 | −0.99 | −0.84 | <2.0 × 10 |
| SVM-MCL | −1.23 | −1.09 | −0.95 | <2.0 × 10 |
| ECOC ∧ SVM | −0.55 | −0.33 | −0.1 | 1.6 × 10 |
| ECOC ∧ DT | 1.85 | 2.05 | 2.26 | <2.0 × 10 |
| ECOC ∧ kNN | −0.44 | −0.24 | −0.03 | 2.9 × 10 |
| ECOC ∧ GLM | −0.9 | −0.7 | −0.5 | 1.3 × 10 |
| ECOC ∧ SVM-MCL | −0.84 | −0.65 | −0.45 | 4.1 × 10 |
| EEH ∧ DT | 1.45 | 1.64 | 1.84 | <2.0 × 10 |
| EEH ∧ SVM-MCL | 0.07 | 0.27 | 0.47 | 4.3 × 10 |
| EH | −0.28 | −0.15 | −0.03 | 2.0 × 10 |
| EH ∧ DT | 1.18 | 1.32 | 1.47 | <2.0 × 10 |
| EH ∧ kNN | 0.0 | 0.16 | 0.32 | 9.0 × 10 |
| END ∧ DT | 1.68 | 1.88 | 2.08 | <2.0 × 10 |
| END ∧ GLM | −0.41 | −0.21 | −0.01 | 6.0 × 10 |
| MCL ∧ DT | 0.81 | 1.01 | 1.2 | <2.0 × 10 |
| OVO | −0.06 | 0.11 | 0.29 | 8.9 × 10 |
| OVO ∧ SVM | −0.61 | −0.38 | −0.16 | 8.9 × 10 |
| OVO ∧ DT | 1.45 | 1.66 | 1.86 | <2.0 × 10 |
| OVO ∧ kNN | −0.7 | −0.5 | −0.3 | 2.2 × 10 |
| OVO ∧ GLM | −0.03 | 0.18 | 0.39 | 2.7 × 10 |
| OVO ∧ SVM-MCL | −0.01 | 0.19 | 0.4 | 1.6 × 10 |