| Literature DB >> 34540191 |
Ghayth AlMahadin1, Ahmad Lotfi1, Marie Mc Carthy2, Philip Breedon1.
Abstract
Tremor is a common symptom of Parkinson's disease (PD). Currently, tremor is evaluated clinically based on MDS-UPDRS Rating Scale, which is inaccurate, subjective, and unreliable. Precise assessment of tremor severity is the key to effective treatment to alleviate the symptom. Therefore, several objective methods have been proposed for measuring and quantifying PD tremor from data collected while patients performing scripted and unscripted tasks. However, up to now, the literature appears to focus on suggesting tremor severity classification methods without discrimination tasks effect on classification and tremor severity measurement. In this study, a novel approach to identify a recommended system is used to measure tremor severity, including the influence of tasks performed during data collection on classification performance. The recommended system comprises recommended tasks, classifier, classifier hyperparameters, and resampling technique. The proposed approach is based on the above-average rule of five advanced metrics results of four subdatasets, six resampling techniques, six classifiers besides signal processing, and features extraction techniques. The results of this study indicate that tasks that do not involve direct wrist movements are better than tasks that involve direct wrist movements for tremor severity measurements. Furthermore, resampling techniques improve classification performance significantly. The findings of this study suggest that a recommended system consists of support vector machine (SVM) classifier combined with BorderlineSMOTE oversampling technique and data collection while performing set of recommended tasks, which are sitting, stairs up and down, walking straight, walking while counting, and standing.Entities:
Mesh:
Year: 2021 PMID: 34540191 PMCID: PMC8448616 DOI: 10.1155/2021/9624386
Source DB: PubMed Journal: J Healthc Eng ISSN: 2040-2295 Impact factor: 2.682
Figure 1Proposed framework for tremor severity classification.
Figure 2Tremor datasets.
Imbalanced classes (severities) distribution.
| Tremor severity | GENEActiv | Pebble | Total | ||
|---|---|---|---|---|---|
| (Class) | Day 1 | Day 4 | Day 1 | Day 4 | ( |
| 0 | 18843 | 16860 | 19389 | 17215 | 72307 |
| 1 | 5845 | 6534 | 4491 | 4421 | 21291 |
| 2 | 2185 | 2921 | 1357 | 1112 | 7575 |
| 3 | 845 | 676 | 117 | 103 | 1741 |
| 4 | 43 | 53 | 11 | 59 | 166 |
Extracted features and their descriptions.
| Feature | Domain | Formula |
|---|---|---|
| Above mean | T and F | | |
| Below mean | T and F | | |
| Autocorrelation | T and F |
|
| Complexity-invariant distance (CID) | T and F |
|
| Sample entropy | T and F | log |
| Kurtosis | T and F |
|
| Skewness | T and F |
|
| Standard deviation | T and F |
|
| Max | T and F |
|
| Mean | T and F | 1/ |
| Median | T and F |
|
| Sum of absolute differences (SAD) | T and F | ∑ |
| Energy | T and F | ∑ |
| Peaks | T | | |
| Amplitude of peak PSD | F |
|
| Median frequency | F | |
| Frequency dispersion | F | |
| Fundamental frequency | F |
|
| Frequency difference | F | |
| Spectral centroid amplitude (SCA) | F | ∑ |
| Maximum weighted PSD | F |
|
W+: window subset contains elements above the mean; W−: window subset contains elements below the mean; W: window length (number of samples); a: the acceleration at time t; l: the lag. : window's samples mean; s: window's samples standard deviation; A(r): the probability that two vectors of m points within a one window would match; A(r): the probability that two vectors of m+1 points within one window would match; W(: window length is odd; W(: window length is even; i: an element position (index) in the window {W}; n: number of neighbours; a(: the acceleration at a time (n+m+k); W: the selected window; e−: the primitive Nth root of unity; fdis: the dispersion frequency in the selected window; f: frequency bin; f: the lowest frequency in the selected window; f: the highest frequency in the selected window; fstep: the range between the median frequency and the lower bound of dispersion frequency, which is equal to the range between median frequency and the higher bound of dispersion frequency, that is, 2fstep is the range between lower and higher bound of of dispersion frequency; PSDfund: the PSD at fundamental frequency.
Classifiers' hyperparameters search spaces.
| Classifier | Hyperparameters search spaces |
|---|---|
| ANN-MLP | batch_size: [32, 64, 512] |
| Epochs: [200, 300] | |
| Neurons: Integer (60, 100) | |
| Optimizer: [SGD, RMSprop, Adam, Adadelta, Adagrad, Adamax, Nadam] | |
| Activation: [relu, tanh, selu, elu, exponential] | |
|
| |
| KNN | n_neighbors: Integer (1, 20) |
| Weights: [Distance, uniform] | |
| Algorithm: [Brute, ball_tree, kd_tree] | |
| Metric: [Minkowski, euclidean, manhattan] | |
| leaf_size: Integer (1, 20) | |
|
| |
| RF | n_estimators: Integer(10, 250) |
| max_features: Integer(1, 102) | |
| max_depth: Integer(5, 100) | |
| min_samples_split: Integer(2, 20) | |
| min_samples_leaf: Integer(1, 20) | |
| Criterion: [gini, entropy] | |
|
| |
| DT | max_features: Integer(1, 102) |
| max_depth: Integer(5, 100) | |
| min_samples_split: Integer(2, 20) | |
| min_samples_leaf: Integer(1, 20) | |
| Criterion: [gini, entropy] | |
|
| |
| LR | Penalty: [l2, none] |
| C: [1 | |
| Solver: [Newton-cg, lbfgs, sag, saga] | |
| max_iter: Integer(1, 1000) | |
|
| |
| SVM | C: [1, 2, 3, 4, 5, 6, 7, 8, 9, 10] |
| Gamma: [0.1, 0.01, 0.001] | |
| Degree: (1, 5) | |
| Kernel: [Linear, poly, rbf, sigmoid] | |
Algorithm 1Recommended tasks algorithm.
Figure 3Recommended classifiers and resampling techniques.
Task highest accuracy of all classifiers and values above average counts.
| Accuracy | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Without resampling | With resampling | Count above average | |||||||
| G-1 (%) | G-4 (%) | P-1 (%) | P-4 (%) | G-1 (%) | G-4 (%) | P-1 (%) | P-4 (%) | ||
| drawg | 66 | 55 | 88 | 95 | 93 | 91 | 95 | 99 | 3 |
| drnkg | 66 | 58 | 72 | 79 | 93 | 93 | 96 | 97 | 0 |
| fldng | 71 | 63 | 75 | 80 | 94 | 91 | 95 | 96 | 0 |
| ftnl | 77 | 76 | 65 | 62 | 97 | 96 | 95 | 96 | 3 |
| ftnr | 53 | 68 | 76 | 86 | 90 | 98 | 97 | 99 | 3 |
| ntblt | 71 | 63 | 71 | 75 | 95 | 94 | 95 | 96 | 0 |
| orgpa | 66 | 75 | 67 | 77 | 96 | 98 | 96 | 97 | 2 |
| raml | 77 | 79 | 68 | 59 | 96 | 97 | 98 | 94 | 4 |
| ramr | 68 | 59 | 82 | 85 | 96 | 91 | 98 | 99 | 4 |
| typng | 77 | 71 | 75 | 67 | 96 | 93 | 97 | 96 | 1 |
| sittg | 78 | 75 | 87 | 93 | 100 | 98 | 98 | 99 | 8 |
| stndg | 72 | 65 | 77 | 76 | 100 | 98 | 99 | 97 | 3 |
| strsd | 94 | 81 | 89 | 90 | 100 | 100 | 100 | 100 | 8 |
| strsu | 80 | 86 | 90 | 100 | 100 | 100 | 100 | 100 | 8 |
| ststd | 86 | 79 | 88 | 81 | 100 | 99 | 99 | 100 | 7 |
| wlkgc | 76 | 74 | 90 | 83 | 98 | 96 | 99 | 98 | 7 |
| wlkgp | 72 | 73 | 88 | 84 | 96 | 97 | 98 | 98 | 6 |
| wlkgs | 80 | 79 | 90 | 88 | 99 | 98 | 100 | 99 | 8 |
| Average | 74 | 71 | 80 | 81 | 97 | 96 | 98 | 98 | |
G-1: GENEActiv-Day 1; G-4: GENEActiv-Day 4; P-1: Pebble-Day 1; P-4: Pebble-Day 4.
Tasks of above-average count for all metrics.
| Task | Count above average | ||||||
|---|---|---|---|---|---|---|---|
| Accuracy | AUC | F1-score | IBA | Total | |||
| Recommended tasks | strsd | 8 | 8 | 8 | 8 | 8 | 40 |
| sittg | 8 | 7 | 8 | 8 | 8 | 39 | |
| strsu | 8 | 8 | 8 | 6 | 6 | 36 | |
| wlkgs | 8 | 8 | 8 | 6 | 6 | 36 | |
| wlkgc | 7 | 8 | 7 | 5 | 5 | 32 | |
| ststd | 7 | 7 | 7 | 5 | 4 | 30 | |
|
| |||||||
| Neutral tasks | ftnr | 3 | 6 | 4 | 6 | 5 | 24 |
| raml | 4 | 6 | 3 | 6 | 5 | 24 | |
| wlkgp | 6 | 7 | 6 | 2 | 3 | 24 | |
| ramr | 4 | 5 | 4 | 5 | 5 | 23 | |
| stndg | 3 | 7 | 3 | 5 | 5 | 23 | |
| ftnl | 3 | 4 | 3 | 4 | 4 | 18 | |
|
| |||||||
| Not recommended task | orgpa | 2 | 6 | 2 | 2 | 2 | 14 |
| drawg | 3 | 2 | 3 | 2 | 2 | 12 | |
| typng | 1 | 5 | 1 | 1 | 1 | 9 | |
| fldng | 0 | 4 | 0 | 2 | 2 | 8 | |
| drnkg | 0 | 3 | 0 | 1 | 1 | 5 | |
| ntblt | 0 | 1 | 0 | 0 | 0 | 1 | |
Figure 4Recommended classifiers and resampling techniques results (strsd).
Potential recommended systems.
| System | Task | Classifier | Resample technique | Validation score (%) | Hyperparameters | Mean fit time |
|---|---|---|---|---|---|---|
| System 1 | strsd | SVM | ADASYN | 100.00 | 2.549183011 | |
| System 2 | sittg | SVM | ADASYN | 99.47 | 5.469041586 | |
| System 3 | wlkgs | SVM | ADASYN | 98.34 | 4.719249964 | |
| System 4 | strsu | SVM | SMOTETomek | 100.00 | 0.045000315 | |
| System 5 | wlkgc | SVM | SMOTEENN | 98.46 | 1.642106652 | |
| System 6 | ststd | SVM | BorderlineSMOTE | 99.14 | 6.840166569 |
Potential systems performance.
| System | Classifier | Resample technique | Accuracy (%) | IBA (%) | AUC (%) | ||
|---|---|---|---|---|---|---|---|
| System 1 | SVM | ADASYN | 97 | 97 | 96 | 98 | 99 |
| System 2 | SVM | ADASYN | 97 | 97 | 96 | 98 | 99 |
| System 3 | SVM | ADASYN | 97 | 97 | 96 | 98 | 100 |
| System 4 | SVM | SMOTETomek | 96 | 96 | 94 | 97 | 99 |
| System 5 | SVM | SMOTEENN | 93 | 93 | 90 | 95 | 99 |
| System 6 | SVM | BorderlineSMOTE | 98 | 98 | 97 | 98 | 100 |
Top four systems tremor severity predictions.
| Sample data | Actual severity | Predicted severity | |||
|---|---|---|---|---|---|
| System 1 | System 2 | System 3 | System 6 | ||
| Sample_01 | 0 | 0 | 0 | 0 | 0 |
| Sample_02 | 1 | 1 | 1 | 1 | 1 |
| Sample_03 | 2 | 2 | 2 | 2 | 2 |
| Sample_04 | 3 | 3 | 3 | 3 | 3 |
| Sample_05 | 4 | 4 | 4 | 4 | 4 |
| Sample_06 | 0 | 0 | 0 | 0 | 0 |
| Sample_07 | 1 | 1 | 1 | 1 | 1 |
| Sample_08 | 2 | 2 | 2 | 2 | 2 |
| Sample_09 | 3 | 3 | 3 | 3 | 3 |
| Sample_10 | 4 | 4 | 4 | 4 | 4 |
| Sample_11 | 0 | 0 | 0 | 0 | 0 |
| Sample_12 | 1 | 1 | 1 | 1 | 1 |
| Sample_13 | 2 | 2 | 2 | 2 | 2 |
| Sample_14 | 3 | 3 | 3 | 3 | 3 |
| Sample_15 | 4 | 4 | 4 | 4 | 4 |
| Sample_16 | 0 | 0 | 0 | 0 | 0 |
| Sample_17 | 1 | 1 | 1 | 1 | 1 |
| Sample_18 | 2 | 2 | 2 | 2 | 2 |
| Sample_19 | 3 |
| 3 |
| 3 |
| Sample_20 | 4 | 4 | 4 | 4 | 4 |
The misclassified samples are in bold.
Figure 5Recommended system (system 6) confusion matrix and ROC curve.