| Literature DB >> 29728658 |
Sayantan Ghosh1,2, Tim Fleiner3,4, Eleftheria Giannouli3, Uwe Jaekel5, Sabato Mellone6, Peter Häussermann4, Wiebren Zijlstra3.
Abstract
Long term monitoring of locomotor behaviour in humans using body-worn sensors can provide insight into the dynamical structure of locomotion, which can be used for quantitative, predictive and classification analyses in a biomedical context. A frequently used approach to study daily life locomotor behaviour in different population groups involves categorisation of locomotion into various states as a basis for subsequent analyses of differences in locomotor behaviour. In this work, we use such a categorisation to develop two feature sets, namely state probability and transition rates between states, and use supervised classification techniques to demonstrate differences in locomotor behaviour. We use this to study the influence of various states in differentiating between older adults with and without dementia. We further assess the contribution of each state and transition and identify the states most influential in maximising the classification accuracy between the two groups. The methods developed here are general and can be applied to areas dealing with categorical time series.Entities:
Mesh:
Year: 2018 PMID: 29728658 PMCID: PMC5935746 DOI: 10.1038/s41598-018-25523-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary statistics of the probability of physical activity for the control and dementia groups.
| Parameters | State probabilities for locomotor behaviour | ||||||
|---|---|---|---|---|---|---|---|
| Lying (Sup) | Sitting Sedentary (SiSe) | Standing Sedentary (StSe) | Postural Transition (PoTr) | Sitting Active (SiAc) | Standing Active (StAc) | Walking (Gait) | |
| Mean¶ | 18.8 | 35.5 | 21.4 | 2.59 | 2.46 | 3.41 | 15.9 |
| 14.1 | 51.2 | 18.4 | 1.21 | 2.25 | 3.41 | 9.42 | |
| Standard Deviation¶ | 23.5 | 15.2 | 10.6 | 1.84 | 1.58 | 1.76 | 8.25 |
| 15.3 | 19.3 | 13.2 | 0.732 | 1.43 | 1.72 | 8.49 | |
| Median¶ | 11.6 | 39.2 | 20.8 | 2.17 | 2.06 | 3.20 | 15.3 |
| 10.4 | 56.1 | 16.3 | 1.03 | 2.06 | 3.35 | 7.45 | |
| 25 | 0.846 | 24.1 | 14.1 | 1.46 | 1.38 | 2.01 | 10.2 |
| 2.16 | 39.1 | 9.74 | 0.741 | 1.18 | 1.97 | 4.41 | |
| 75 | 24.0 | 47.3 | 26.6 | 2.96 | 3.25 | 4.32 | 20.9 |
| 19.0 | 64.9 | 20.7 | 1.52 | 2.78 | 4.45 | 12.8 | |
| Skewness† | 1.91* | −4.90 × 10−1 | 1.14 | 1.94* | 9.48 × 10−1 | 4.43 × 10−1 | 4.36 × 10−1 |
| 1.62* | −7.78 × 10−1 | 2.10* | 1.10 | 1.43* | 2.44 × 10−1 | 3.50* | |
| Kurtosis† | 3.46 | −2.04 × 10−1 | 3.66 | 4.83 | 3.61 × 10−1 | −3.90 × 10−1 | 3.79 × 10−1 |
| 2.32 | −2.00 × 10−1 | 5.51 | 1.43 | 2.46 | −7.31 × 10−1 | 1.83 × 10+1 | |
| SF test§ ( | 1.96 × 10−9* | 5.46 × 10−2 | 3.98 × 10−4* | 1.34 × 10−7* | 4.13 × 10−4* | 4.75 × 10−2 | 2.46 × 10−1 |
| 4.53 × 10−8* | 9.23 × 10−4* | 3.13 × 10−8* | 6.63 × 10−4 | 1.45 × 10−5* | 2.00 × 10−1 | 2.20 × 10−10* | |
| MannU‡ ( | 2.38 × 10−1 | 1.22 × 10−7* | 4.15 × 10−3 | 6.90 × 10−10* | 2.59 × 10−1 | 4.74 × 10−1 | 1.19 × 10−7* |
The first four moments (mean, standard deviation, skewness, and kurtosis), the three quartiles (first, median and third), and tests for normality (Shapiro-Francia), and the Mann-Whitney U test for similarity of distribution are shown. The cells with asterisks show significant behaviour at the p < 0.001. Refer to the table notes and the text for further discussion. The top and bottom rows for each parameter represent the statistics for control and dementia subjects respectively.
¶These rows shows the value of the state probabilities (π × 100) for clearer interpretation.
†The significant skewness and kurtosis are marked with asterisks, following the discussion in Cramer[43].
§The p–values for the Shapiro-Francia test are shown. The p–values marked with asterisks show significant difference between the two groups at 99.9% confidence level (p < 0.001).
‡The p–values for the Mann-Whitney U test are shown. The p–values marked with asterisks (*) show significant difference between the two groups at 99.9% confidence level (p < 0.001).
Figure 1Summary statistics of features. The summary statistics of the two feature sets SP and TR are shown in this figure. Panels (a and c) show the box plots for SP and TR respectively for the two groups (control in dark, and dementia in white). The states and transitions at which the two groups differ significantly, calculated through the Mann-Whitney U-test (p < 0.001), have been highlighted using black stars. The panel (b) displays an empirically constructed transition matrix representative of the control group subjects. The dark pixels represent higher transition rates, the shade lowering with decreasing transition rate. The null transitions are shown in white. The y-axis of the transition matrix represents the numerical coding of the seven states for clarity in interpreting the transition matrix elements (Sup corresponds to state 1, and Gait to state 7). In the panel (c), diagonal elements and the null transition elements have been dropped to preserve visual clarity. Also, all quantities have been plotted on a logarithmic (base-10) scale to highlight the distributional variations amongst the groups.
Figure 2Learning performance. The classification (a) accuracy, (b) area-under-receiver-operating-characteristic-curve (AUCROC), (c) precision, and (d) recall scores (mean of 10-fold CV) for the different supervised learning methods applied to the SP (white hatched) and the TR (gray) feature sets. The errorbars represent the standard deviation of the cross validation. We observe that the classification accuracy is significantly better for the TR feature sets, expect for in the cases of Näive Bayes’, and quadratic discriminant analysis. The dark bars represent the TR feature set, while the hatched bars represent the SP features.
Figure 3Feature importance. The importance of the features (physical activity states) calculated through the Gini impurity coefficient (I) is shown in decreasing order of their importance. The panels (a–c) represent AdaBoost, Decision Tree and Random Forests respectively for the state probability features. The panel (d) represents the feature ranking for the transition rate matrix method.