| Literature DB >> 32312287 |
Nicholas Shawen1,2, Megan K O'Brien1,3, Sanjeev Venkatesan1,4, Luca Lonini1,3, Tanya Simuni5, Jamie L Hamilton6, Roozbeh Ghaffari7, John A Rogers7, Arun Jayaraman8,9,10.
Abstract
BACKGROUND: Parkinson's disease (PD) is a progressive neurological disease, with characteristic motor symptoms such as tremor and bradykinesia. There is a growing interest to continuously monitor these and other symptoms through body-worn sensor technology. However, limited battery life and memory capacity hinder the potential for continuous, long-term monitoring with these devices. There is little information available on the relative value of adding sensors, increasing sampling rate, or computing complex signal features, all of which may improve accuracy of symptom detection at the expense of computational resources. Here we build on a previous study to investigate the relationship between data measurement characteristics and accuracy when using wearable sensor data to classify tremor and bradykinesia in patients with PD.Entities:
Keywords: Bradykinesia; Daily activities; Machine learning; Parkinson’s disease; Soft wearables; Symptom detection; Tremor; Wearable sensors
Year: 2020 PMID: 32312287 PMCID: PMC7168958 DOI: 10.1186/s12984-020-00684-4
Source DB: PubMed Journal: J Neuroeng Rehabil ISSN: 1743-0003 Impact factor: 4.262
Participant demographics and clinical summary
| Participant Characteristics | Values |
|---|---|
| Sex (female/male) | 4 / 9 |
| Age (years) | 62.1 ± 10.7 |
| Time since diagnosis (years) | 6.4 ± 4.5 |
| Fluctuator (yes/no) | 5 / 8 |
| Side predominantly affected at first assessment (right/left/bilateral) | 2 / 8 / 3 |
| MDS part III score, day 1 OFF medication | 28.8 ± 10.2 |
| MDS part III score, day 1 ON medication | 17.9 ± 6.8 |
| MDS part III score, day 2 ON medication | 19.6 ± 6.0 |
Relevant demographic characteristics of study participants included in this analysis. Values are presented as Mean ± Standard Deviation where applicable. Total participants (N) was 13.
Fig. 1Wearable device placement. Participants wore a flexible BioStampRC sensor recording accelerometer and gyroscope data on the dorsal aspect of the hand, secured with adhesive dressing. They also wore an Apple Watch recording accelerometer data on the wrist
Number of data clips scored for tremor and bradykinesia used in the supervised machine learning models
| Score | No. Clips with Tremor Score (%) | No. Clips with Bradykinesia Score (%) | ||
|---|---|---|---|---|
| 0 | 12,143 (73.8%) | 10,485 (73.1%) | 5487 (45.0%) | 4845 (44.8%) |
| > 0 | 4302 (26.2%) | 3854 (26.9%) | 6697 (55.0%) | 5979 (55.2%) |
| 1 | 2684 (16.3%) | 2346 (16.4%) | 4764 (39.1%) | 4213 (38.9%) |
| 2 | 1274 (7.7%) | 1233 (8.6%) | 1835 (15.1%) | 1676 (15.5%) |
| 3 | 344 (2.0%) | 275 (1.9%) | 98 (0.8%) | 90 (0.8%) |
| 4 | 0 (0.0%) | 0 (0.0%) | 0 (0.0%) | 0 (0.0%) |
Number of 5-s clips for tremor and bradykinesia symptoms, by score and device type. Fewer clips are available with bradykinesia scores because not all tasks involved enough movement for the clinician to assess bradykinesia
Feature categorization for supervised machine learning models
| Feature category | Abbreviation | Features | No. Tri-axial features | No. Magnitude features |
|---|---|---|---|---|
| Time | T | Root mean square, range, mean, variance, skew, kurtosis | 18 | 6 |
| Frequency | F | Dominant frequency, Relative magnitude, Moments of power spectral density (mean, standard deviation, skew, kurtosis) | 18 | 6 |
| Entropy | E | Sample entropy | 3 | 1 |
| Correlation | C | Cross-correlation peak (XY,XZ,YZ), Cross-correlation lag (XY,XZ,YZ) | 6 | 0 |
| Derivative | D | Moments of the signal derivative (mean, standard deviation, skew, kurtosis) | 12 | 4 |
Features extracted from both accelerometer and gyroscope data signals and used as inputs for symptom models. Features are shown split into the categories used during the analysis of feature types
Fig. 2Effect of sensor set. ROC curves for (a) Binary and (b) Multiclass models of tremor and bradykinesia with average AUROC. Accelerometer data is sufficient to classify tremor, whereas the combination of gyroscope and accelerometer data improves detection of bradykinesia
Effect of sensor set on model performance
| Sensor Set | Tremor | Bradykinesia | ||
|---|---|---|---|---|
| Combo | 0.78 (0.70–0.86) | 0.76 (0.68–0.83) | 0.67 (0.61–0.74) | 0.65 (0.59–0.71) |
| Accel | 0.77 (0.67–0.87) | 0.74 (0.65–0.82) | 0.63 (0.57–0.70) | 0.63 (0.57–0.68) |
| Gyro | 0.79 (0.74–0.85) | 0.77 (0.72–0.82) | 0.68 (0.61–0.75) | 0.64 (0.59–0.70) |
| Watch | 0.79 (0.69–0.89) | 0.77 (0.68–0.86) | 0.63 (0.56–0.69) | 0.61 (0.56–0.66) |
Average and 95% confidence intervals of model performance (AUROC) to classify PD symptoms using different sensor sets
Effect of sampling rate on model performance
| Sampling Rate (Hz) | Tremor | Bradykinesia | ||
|---|---|---|---|---|
| 62.5 | 0.77 (0.67, 0.87) | 0.74 (0.65, 0.82) | 0.67 (0.61, 0.74) | 0.65 (0.59, 0.71) |
| 50 | 0.77 (0.67, 0.87) | 0.74 (0.66, 0.83) | 0.68 (0.61, 0.74) | 0.65 (0.59, 0.70) |
| 40 | 0.77 (0.67, 0.87) | 0.75 (0.66, 0.84) | 0.68 (0.61, 0.74) | 0.65 (0.59, 0.70) |
| 30 | ||||
| 20 | 0.75 (0.65, 0.85)* | 0.73 (0.64, 0.81) | 0.67 (0.61, 0.74) | 0.65 (0.59, 0.70) |
| 10 | 0.73 (0.64, 0.82)* | 0.70 (0.62, 0.78)* | 0.68 (0.61, 0.74) | 0.64 (0.58, 0.70) |
| 7.5 | 0.72 (0.63, 0.81)* | 0.69 (0.61, 0.77)* | 0.69 (0.62, 0.75) | 0.65 (0.60, 0.70) |
| 5 | 0.70 (0.62, 0.79)* | 0.70 (0.62, 0.78)* | 0.67 (0.60, 0.74) | 0.65 (0.60, 0.71) |
Average and 95% confidence intervals of model performance (AUROC) to classify PD symptoms using different sampling rates for the Accel (tremor) or Combo (bradykinesia) sensor types. Asterisk (*) indicates significant difference from performance at the original sampling rate. Bolded results indicate the sampling rate selected for subsequent analyses
Fig. 3Effect of sampling rate. Model performance (AUROC) for (a) Binary and (b) Multiclass models of tremor and bradykinesia, using the previously determined sensor set for each symptom. Shaded regions depict a 95% confidence interval on the average AUROC centered at the original sampling rate. Decreasing sampling rate reduces ability to classify tremor beyond 20–30 Hz, with only slight impact on classifying bradykinesia. A 30-Hz sampling rate is sufficient to classify both symptoms using the BioStampRC sensor (Sensor) or smart watch (Watch)
Fig. 4Feature set performance and computation time. Combinations of feature categories for the symptom models, ordered by total computation time. Model performance (AUROC) is computed for each model type and feature category combination. Generally, there is a trade-off between feature complexity and model performance, as more comprehensive feature sets improve AUROC but take longer to compute
Computation time and model performance for select feature sets
| C | 4.41 | 0.68 | < 0.001* | C | 4.41 | 0.61 | 0.001* |
| D | 6.47 | 0.73 | < 0.001* | D | 6.47 | 0.64 | 0.006* |
| T | 8.15 | 0.73 | 0.001* | CD | 7.55 | 0.65 | 0.005* |
| DT | 11.29 | 0.74 | 0.018† | T | 8.15 | 0.67 | 0.099 |
| DF | 18.42 | 0.75 | 0.022† | DT | 11.29 | 0.68 | 0.174 |
| FT | 20.09 | 0.75 | 0.004* | ET | 63.02 | 0.68 | 0.222 |
| CDFT | 24.31 | 0.76 | 0.040† | ||||
| CE | 59.29 | 0.77 | 0.126 | ||||
| EF | 70.15 | 0.77 | 0.155 | ||||
| EFT | 74.98 | 0.77 | 0.071 | ||||
| C | 4.41 | 0.67 | 0.001* | C | 4.41 | 0.59 | < 0.001* |
| D | 6.47 | 0.71 | < 0.001* | D | 6.47 | 0.62 | 0.010* |
| DT | 11.29 | 0.72 | 0.001* | CD | 7.55 | 0.62 | 0.001* |
| F | 15.28 | 0.73 | 0.014† | T | 8.15 | 0.64 | 0.011* |
| CF | 16.35 | 0.73 | 0.015† | DT | 11.29 | 0.65 | 0.152 |
| DF | 18.42 | 0.74 | 0.037† | FTE | 74.98 | 0.65 | 0.189 |
| EF | 70.16 | 0.75 | 0.234 | – | |||
| DEF | 73.30 | 0.75 | 0.226 | ||||
| – | |||||||
Total computation time and average AUROC for each combination of feature categories (includes tri-axial and magnitude features). Only features combinations showing improved performance with increasing computation time were included. T = Time, F = Frequency, C = Correlation, D = Derivative, E = Entropy. Asterisk (*) indicates significant difference after Holm-Bonferroni correction (α = 0.05) from the best performing feature set, marked in italic. Dagger (†) indicates additional significant differences when not controlling the family-wise error rate
Fig. 5Effect of magnitude-only features. Relationship between feature computation time and model performance (AUROC) for (a) Binary and (b) Multiclass models of tremor and bradykinesia. Shaded regions depict a 95% confidence interval of the mean AUROC. Features derived from the signal magnitude alone can achieve similar performance to combined tri-axial and magnitude features at a fraction of the computation time