| Literature DB >> 34065245 |
Jeremy Watts1, Anahita Khojandi1, Rama Vasudevan2, Fatta B Nahab3, Ritesh A Ramdhani4.
Abstract
Parkinson's disease medication treatment planning is generally based on subjective data obtained through clinical, physician-patient interactions. The Personal KinetiGraph™ (PKG) and similar wearable sensors have shown promise in enabling objective, continuous remote health monitoring for Parkinson's patients. In this proof-of-concept study, we propose to use objective sensor data from the PKG and apply machine learning to cluster patients based on levodopa regimens and response. The resulting clusters are then used to enhance treatment planning by providing improved initial treatment estimates to supplement a physician's initial assessment. We apply k-means clustering to a dataset of within-subject Parkinson's medication changes-clinically assessed by the MDS-Unified Parkinson's Disease Rating Scale-III (MDS-UPDRS-III) and the PKG sensor for movement staging. A random forest classification model was then used to predict patients' cluster allocation based on their respective demographic information, MDS-UPDRS-III scores, and PKG time-series data. Clinically relevant clusters were partitioned by levodopa dose, medication administration frequency, and total levodopa equivalent daily dose-with the PKG providing similar symptomatic assessments to physician MDS-UPDRS-III scores. A random forest classifier trained on demographic information, MDS-UPDRS-III scores, and PKG time-series data was able to accurately classify subjects of the two most demographically similar clusters with an accuracy of 86.9%, an F1 score of 90.7%, and an AUC of 0.871. A model that relied solely on demographic information and PKG time-series data provided the next best performance with an accuracy of 83.8%, an F1 score of 88.5%, and an AUC of 0.831, hence further enabling fully remote assessments. These computational methods demonstrate the feasibility of using sensor-based data to cluster patients based on their medication responses with further potential to assist with medication recommendations.Entities:
Keywords: PKG; Parkinson’s disease; clustering; decision support tool; levodopa; machine learning; regimen; remote assessment; wearable sensors
Mesh:
Substances:
Year: 2021 PMID: 34065245 PMCID: PMC8160757 DOI: 10.3390/s21103553
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Sample PKG sensor time-series data [23], displaying an individual’s change in dyskinesia and bradykinesia scores in response to medication with the median, 25th, and 75th percentile, compared to a non-PD control group averaged over six days. (A) The dyskinesia time-series, (B) the bradykinesia time-series, and (C) the patient’s self-reported acknowledgment of medication administration.
Figure 2The study design is divided into two parts as labeled by the blocks “Clustering” and “Prediction”. In the clustering block, (A) we begin with a diverse cohort of PD patients; (B) each patient is assessed by MDS-UPDRS-III scores and PKG summary dyskinesia and bradykinesia scores; the physician icon is greyed out as in the future for some remote contexts this may be accomplished using only the PKG time-series data, but we currently collect both for validation purposes; (C) we identify similar medication regimen clusters through k-means clustering. These clusters are used in the prediction block; (D) we optimize medication regimens and perform statistical analysis on demographic similarities for each group—to create a decision support tool to provide enhanced initial regimen estimates; (E) machine learning methods, specifically random forest, are applied to predict an unknown patient’s optimized regimen cluster based on physician assessment and/or wearable sensor measurements depending on the context; (F) the new patient’s data will be incorporated to improve the accuracy of the decision support tool.
Figure A1WCSS as a function of the number of clusters when clustering patients based on their best symptom control medication regimens between the two visits according to PKG’s summary dyskinesia and bradykinesia scores. The number of clusters is set to four, as a number towards the bottom of the “elbow” is considered best, but further increasing the number of clusters has been shown to reduce the model’s robustness.
Features included in the random forest analysis based on preliminary experiments.
| Calculated Features | Description |
|---|---|
|
| |
| Dyskinesia ar coefficient k 10 coeff 2 | Unconditional maximum likelihood of an autoregressive AR(k) process—coeff 2—k10 |
| Dyskinesia fft coefficient coeff 39 attr “angle” | One-dimensional discrete fast Fourier transform—coeff 39—angle |
| Dyskinesia spkt welch density coeff 5 | Cross power spectral density—coeff 5 |
|
| |
| Bradykinesia agg autocorrelation f agg “mean” maxlag 40 | Aggregation function fagg—mean maxlag—40 |
| Bradykinesia agg linear trend f agg “mean” chunk len 50 att “rvalue” | Linear least-squares regression aggregated over chunks versus the sequence from 0 up to the number of chunks minus one—mean—chunk length 50—attribute rvalue |
| bradykinesia agg linear trend f agg “min” chunk len 50 attr “slope” | Linear least-squares regression aggregated over chunks versus the sequence from 0 up to the number of chunks minus one—mean—chunk length 50—attribute slope |
| Bradykinesia fft coefficient coeff 23 attr “abs” | One-dimensional discrete fast Fourier transform—coeff 23—abs |
| Bradykinesia fft coefficient coeff 94 attr “angle” | One-dimensional discrete fast Fourier transform—coeff 94—angle |
| Bradykinesia fft coefficient coeff 76 attr “imag” | One-dimensional discrete fast Fourier transform—coeff 76—imag |
| Bradykinesia index mass quantile q 0.9 | Relative index i where q% of the mass of the time series x lie left of I-quantile 0.9 |
Patient demographics and clinical characteristics.
| Mean ± SD | |
|---|---|
| Gender (female/male) | 9/17 |
| Age in years | 71.19 ± 9.70 |
| Age at diagnosis (years) | 65.77 ± 10.37 |
| Visit 1 MDS-UPDRS-III | 28.89 ± 14.07 |
| Visit 2 MDS-UPDRS-III | 24.12 ± 13.50 |
| Visit 1 H&Y stage | 1.77 ± 0.71 |
| Visit 2 H&Y stage | 1.85 ± 0.78 |
| Visit 1 levodopa equivalent dose (mg) | 498.94 ± 309.88 |
| Visit 2 levodopa equivalent dose (mg) | 637.40 ± 322.37 |
| Time between clinical visits (days) | 65.62 ± 26.46 |
Figure 3(a,b) demonstrates the clusters for the motor function changes between visit 1 to visit 2 based on the MDS-UPDRS-III scores and the PKG’s summary dyskinesia and bradykinesia scores, respectively. (c,d) highlight the subjects’ best symptom control recorded using MDS-UPDRS-III scores and PKG’s summary dyskinesia and bradykinesia scores, respectively. The large shapes denote each cluster’s centroid, and the exterior marker of each point corresponds to the cluster centroid shape. The capital letters (A–D) are used to refer to each cluster. Each point’s interior maker in (a,b) represents the state of MDS-UPDRS-III and PKG change for each patient, where a black interior-point denotes the patient stayed the same between visits, a green point denotes that the patient’s visit 2 scores were better than their visit 1 scores. A red point denotes that the patient’s visit 2 scores were worse than their visit 1 scores. Note that due to the three-dimensional projection of the plot, the distance between points may appear skewed. See Supplemental Information for a 3-D animation showcasing the clusters’ position in space.
Demographic and clinical information for clusters depicted in Figure 3a–d.
| Cluster A | Cluster B | Cluster C | Cluster D | |
|---|---|---|---|---|
| Visit 2 MDS-UPDRS-III & PKG Scores | ||||
| Study age (years) | 74.31 ± 9.99 | 67.96 ± 9.70 | 68.15 ± 3.96 | 68.29 ± 5.68 |
| Age at diagnosis (years) | 69.69 ± 9.22 | 65.41 ± 9.08 | 60.48 ± 10.32 | 54.95 ± 4.15 † |
| Years of PD | 4.62 ± 3.54 | 2.55 ± 1.78 | 7.67 ± 6.65 | 13.33 ± 2.05 † |
| Gender (female/male)) | 5/8 | 2/5 | 1/2 | 1/2 |
| Number of participants | 13 | 7 | 3 | 3 |
| Best MDS-UPDRS-III | ||||
| Study age (years) | 75.62 ± 7.84 | 65.59 ± 10.86 | 63.54 ± 0.00 | 67.40 ± 3.74 †† |
| Age at diagnosis (years) | 70.26 ± 7.67 | 62.90 ± 10.35 | 46.54 ± 0.00 | 59.90 ± 8.17 |
| Years of PD | 5.36 ± 4.55 | 2.69 ± 1.77 | 17.00 ± 0.00 | 7.50 ± 4.61 |
| Gender (female/male) | 5/9 | 2/5 | 0/1 | 2/4 |
| Number of participants | 14 | 7 | 1 | 4 |
| Best PKG Scores | ||||
| Study age (years) | 72.92 ± 10.21 | 67.66 ± 7.54 | 63.54 ± 0.00 | 70.93 ± 5.24 |
| Age at diagnosis (years) | 68.69 ± 9.47 | 63.49 ± 8.78 | 46.54 ± 0.00 | 57.43 ± 2.74 |
| Years of PD | 4.23 ± 3.31 | 4.17 ± 4.14 | 17.00 ± 0.00 | 13.50 ± 2.50 † |
| Gender (female/male) | 6/11 | 3/3 | 0/1 | 0/2 |
| Number of participants | 17 | 6 | 1 | 2 |
† Pairwise p-value < 0.05 when compared with both Cluster A and B. †† Pairwise p-value < 0.05 when compared with only Cluster A. Unless denoted by † or ††, each pairwise cluster comparison yields no statistical difference (p-value > 0.05).
Dosage and medication types for clusters depicted in Figure 3c,d.
| Cluster A | Cluster B | Cluster C | Cluster D | |
|---|---|---|---|---|
| Best MDS-UPDRS-III | ||||
| LEDD | 387 ± 151 | 643 ± 127 | 1380 ± 0 | 1157 ± 183 |
| Carbidopa/levodopa IR | 279 ± 159 | 629 ± 150 | 1050 ± 0 | 925 ± 299 |
| Carbidopa/levodopa CR | 21 ± 80 | -- | 200 ± 0 | 300 ± 476 |
| Ropinirole | -- | 1 ± 4 | 4 ± 0 | -- |
| Selegiline | -- | -- | 10 ± 0 | 1 ± 1 |
| Rasagiline | 0.2 ± 0.4 | -- | -- | -- |
| Rytary | 194 ± 547 | -- | -- | -- |
| Best PKG Scores | ||||
| LEDD | 381 ± 105 | 942 ± 233 | 1380 ± 0 | 1131 ± 206 |
| Carbidopa/levodopa IR | 335 ± 147 | 917 ± 183 | 1050 ± 0 | 250 ± 354 |
| Carbidopa/levodopa CR | 18 ± 73 | 33 ± 82 | 200 ± 0 | 500 ± 707 |
| Ropinirole | -- | -- | 4 ± 0 | -- |
| Selegiline | 0.6 ± 2.4 | -- | 10 ± 0 | 1 ± 2 |
| Rasagiline | 0.1 ± 0.3 | -- | -- | 0.3 ± 0.4 |
| Rytary | 45 ± 184 | -- | -- | 1170 ± 1655 |
Medications are aggregate results for each cluster and do not represent a single subject’s regimen.
Random forest classifier performance identifying subjects in clusters A and B, as stratified through k-means clustering under the best PKG score. Each classifier was trained with a combination of features: Demographic information, demographic information and MDS-UPDRS-III scores (Visit 1), demographic information and PKG time-series data (Visit 1), and demographic information, visit MDS-UPDRS-III scores (Visit 1), and PKG time-series data (Visit 1).
| Demographics Alone | Demographics and MDS-UPDRS-III | Demographics and PKG | Demographics, MDS-UPDRS-III and PKG | |
|---|---|---|---|---|
| Sensitivity | 61.3 ± 1.0% | 65.2 ± 0.8% | 84.5 ± 0.7% | 86.5 ± 0.5% |
| Specificity | 62.3 ± 1.5% | 66.0 ± 0.7% | 81.7 ± 2.2% | 87.7 ± 1.6% |
| Accuracy | 61.6 ± 0.8% | 65.4 ± 0.6% | 83.8 ± 0.7% | 86.9 ± 0.6% |
| PPV | 82.2 ± 0.6% | 84.4 ± 0.3% | 93.1 ± 0.8% | 95.3 ± 0.6% |
| F1 Score | 70.1 ± 0.8% | 73.5 ± 0.6% | 88.5 ± 0.5% | 90.7 ± 0.4% |
| AUC | 0.618 ± 0.008 | 0.656 ± 0.005 | 0.831 ± 0.011 | 0.871 ± 0.008 |