| Literature DB >> 23882193 |
Abstract
The accelerometer data from mobile smart phones provide stochastic trajectories that change over time. This rate of change is unique to each person and can be well-characterized by the continuous two-parameter family of Gamma probability distributions. Accordingly, on the Gamma plane each participant can be uniquely localized by the shape and the scale parameters of the Gamma probability distribution. The scatter of such points contains information that can unambiguously separate the normal controls (NC) from those patients with Parkinson's disease (PD) that are at a later stage of the disease. In general normal aging seems conducive of more predictable patterns of variation in the accelerometer data. Yet this trend breaks down in PD where the statistical signatures seem to be a more relevant predictor of the stage of the disease. Those patients at a later stage of the disease have more random and noisier patterns than those in the earlier stages, whose statistics resemble those of the older NC. Overall the peak rates of change of the stochastic trajectories of the accelerometer are a good predictor of the stage of PD and of the age of a "normally" aging individual.Entities:
Keywords: Parkinson disease; accelerometers; prediction; severity of illness; stochastic; trajectories
Year: 2013 PMID: 23882193 PMCID: PMC3713394 DOI: 10.3389/fnint.2013.00050
Source DB: PubMed Journal: Front Integr Neurosci ISSN: 1662-5145
Figure 1Schematics to explain the first step of the methods using data from one patient. (A) Examples of frequency histograms from the accelerometer data taken from 5 consecutive entries across sessions from 2 day readings from a single patient (PD patient Cherry, entries 1–3 are the last from 1 day, entries 4–5 are the first from the next day, for example). For the 5 entries shown in (A), the left hand subplot contains histograms of the mean acceleration. The right hand subplot contains histogram of the max acceleration deviation relative to the mean acceleration. The number of points per entry that went into each histogram is specified in each case. The curves are from probability density functions from the estimated parameters of the continuous two-parameter Gamma probability distribution family where a is the shape and b is the scale parameter, and Γ is the Gamma function. These were fit to the frequency histograms in (A). (B) The estimated a-shape and b-scale parameters are plotted on the Gamma plane with 95% confidence intervals for each one of the estimated sets of values in (A) using the same color code as in (A). (C) Sample segment of the stochastic trajectory using the 5 measurements with the arrows indicating the flow in the order in which these 5 measurements were obtained.
Figure 2Schematics to illustrate the second step in the methods using data from one patient and one control. (A) Shows an example of a stochastic trajectory of a PD participant on the Gamma plane for 1 day. The green dot marks the start of the trajectory. (B) The change in position of the point (unit less) along the stochastic trajectory was obtained over time. This is the change in the Euclidean distance in the Gamma plane between two successive points. Each point in the trajectory on the Gamma plane is estimated using at least 100 point measurements (one data entry) of many in a recording session, across several sessions within a given day. (C) Sample histograms of the speed spikes >1 (in shifts/unit time) for 1 day plotted for a NC and a PD participant. The histograms and bin size estimation for the parameters of interest were obtained using MATLAB routines developed in-house based on well-established algorithms for optimal estimation with W = 3.49σN−1/3 (Izenman, 1991), where W is the bin width, σ the standard deviation of the distribution (we used estimated standard deviation ) and N the number of samples. Arrows show the exponential decay in each case. (D) Example of estimated (shape, scale) point for each of the NC and PD cases in (C) are plotted on the Gamma plane with 95% confidence intervals.
Stochastic parameters' estimation, ranges and goodness of fit values.
| Range of values max deviation | ||||||
| 54 Daisy | 1.0147 | 22.8752 | 42 Dafo | 1.0153 | 29.0774 | |
| 55 Cherry | 1.0252 | 6.5693 | 55 Rose | 1.0033 | 17.0063 | |
| 69 Orchid | 1.0079 | 83.6943 | 57 Orange | 1.0126 | 12.2356 | |
| 46 Crocus | 1.0012 | 36.2181 | 67 Sunf | 1.0005 | 6.5434 | |
| 55 Maple | 1.0032 | 52.6980 | 77 Apple | 1.0185 | 7.4658 | |
| 57 Flox | 1.0207 | 44.3045 | 77 Sweetp | 1.0014 | 6.8278 | |
| 55 Violet | 1.0043 | 35.8459 | ||||
| 80 Peony | 1.0014 | 27.7137 | ||||
| 65 Iris | 1.1204 | 36.5351 | ||||
| Gamma MLE estimates with 95% confidence intervals | ||||||
| 2.1892 | 1.6624 [1.929 1.442 2.483 1.915] | 1.3953 | 3.1758 [1.185 2.612 1.642 3.861] | |||
| 2.8527 | 0.7797 [1.893 0.498 4.298 1.220] | 2.1209 | 1.9402 [1.443 1.257 3.117 2.995] | |||
| 0.7476 | 13.4048 [0.643 0.886 0.869 14.51] | 2.9601 | 0.9281 [2.318 0.711 3.779 1.211] | |||
| 1.3966 | 4.6465 [1.199 3.870 1.626 5.578] | 3.2197 | 0.6442 [2.109 0.407 4.914 1.018] | |||
| 0.8482 | 8.5078 [0.742 7.114 0.969 9.18] | 3.8684 | 0.5716 [2.795 0.404 5.354 0.808] | |||
| 0.9854 | 6.6008 [0.823 5.231 1.181 7.329] | 3.5192 | 0.6395 [2.203 0.386 5.623 1.058] | |||
| 0.8260 | 6.0915 [0.675 4.643 1.011 7.993] | |||||
| 1.0905 | 3.9342 [0.885 3.026 1.343 5.114] | |||||
| 1.0631 | 6.8824 [0.684 3.941 1.651 9.02] | |||||
| Power relation cohort | ||||||
| General model Power1: | ||||||
| Coefficients (with 95% confidence bounds): | ||||||
| Goodness of fit: | ||||||
| SSE: 20.44 | ||||||
| Adjusted | ||||||
| RMSE: 1.254 | ||||||
| Power relation split | General model Power1: | General model Power1: | ||||
| Coefficients (with 95% confidence bounds): | Coefficients (with 95% confidence bounds): | |||||
| Goodness of fit: | Goodness of fit: | |||||
| SSE: 20.17 | SSE: 0.1382 | |||||
| Adjusted | Adjusted | |||||
| RMSE: 1.698 | RMSE: 0.1859 | |||||
| Fano factor | ||||||
| Daysi | 1.6624 | Dafo | 3.1758 | |||
| Cherry | 0.7797 | Rose | 1.9402 | |||
| Orchid | 13.4048 | Orange | 0.9281 | |||
| Crocus | 4.6465 | Sunfl | 0.6442 | |||
| Maple | 8.5078 | Apple | 0.5716 | |||
| Flox | 6.6008 | Sweetp | 0.6395 | |||
| Violet | 6.0915 | |||||
| Peony | 3.9342 | |||||
| Iris | 6.8824 | |||||
| PD Fano factor vs. Distance of PD point to NC centroid | ||||||
| Linear model Poly1: | ||||||
| Coefficients (with 95% confidence bounds): | ||||||
| Goodness of fit: | ||||||
| SSE: 1.135 | ||||||
| Adjusted | ||||||
| RMSE: 0.4027 | ||||||
| Age-shape correlation | ||||||
| Linear model Poly1: | Linear model Poly1: | |||||
| Coefficients (with 95% confidence bounds): | Coefficients (with 95% confidence bounds): | |||||
| Goodness of fit: | Goodness of fit: | |||||
| SSE: 732.1 | SSE: 83.58 | |||||
| Adjusted | Adjusted | |||||
| RMSE: 10.23 | RMSE: 4.571 | |||||
Figure 3The stochastic signatures of the peak rate of change scalar reveal differences between PD and NC. (A) Rate of change data from the full cohort is well-described by a power relation. (B) Different slopes for PD and NC within the cohort. The oldest NC participants in the cohort are—stochastically speaking—close to the patients with PD at the earliest stages. PD patients with more years of diagnosis show the most statistically random patterns well-characterized by the Exponential distribution, to the left of the Gamma plane. (C) K-means algorithm with k = 2 forms two clusters whereby one NC is in the “PD” cluster and 2 PD are in the “NC” cluster (all 3 marked by arrows). (D) The Fano Factor obtained from each PD patient's empirically estimated probability distribution is plotted as a function of the distance from each PD patient to the “NC” cluster centroid. This reveals a strong positive correlation between noise and distance from the “NC” centroid. The closer to the “NC” cluster centroid the patient is the lower the FF (lower variance to mean ratio). The farther from the “NC” cluster centroid the patient is the higher the Fano Factor (i.e., the wider the dispersion, indicating noisier, and less reliable statistics). (E) Strong positive correlation between age and shape parameter indicating more predictive patterns of behavior with normal aging. (F) Weak negative correlation between the random patterns and the UPDRS scores in PD participants.