| Literature DB >> 33288807 |
Alexandros Papadopoulos1, Dimitrios Iakovakis2, Lisa Klingelhoefer3, Sevasti Bostantjopoulou4, K Ray Chaudhuri5, Konstantinos Kyritsis6, Stelios Hadjidimitriou2, Vasileios Charisis2, Leontios J Hadjileontiadis2,7, Anastasios Delopoulos6.
Abstract
Parkinson's Disease (PD) is the second most common neurodegenerative disorder, affecting more than 1% of the population above 60 years old with both motor and non-motor symptoms of escalating severity as it progresses. Since it cannot be cured, treatment options focus on the improvement of PD symptoms. In fact, evidence suggests that early PD intervention has the potential to slow down symptom progression and improve the general quality of life in the long term. However, the initial motor symptoms are usually very subtle and, as a result, patients seek medical assistance only when their condition has substantially deteriorated; thus, missing the opportunity for an improved clinical outcome. This situation highlights the need for accessible tools that can screen for early motor PD symptoms and alert individuals to act accordingly. Here we show that PD and its motor symptoms can unobtrusively be detected from the combination of accelerometer and touchscreen typing data that are passively captured during natural user-smartphone interaction. To this end, we introduce a deep learning framework that analyses such data to simultaneously predict tremor, fine-motor impairment and PD. In a validation dataset from 22 clinically-assessed subjects (8 Healthy Controls (HC)/14 PD patients with a total data contribution of 18.305 accelerometer and 2.922 typing sessions), the proposed approach achieved 0.86/0.93 sensitivity/specificity for the binary classification task of HC versus PD. Additional validation on data from 157 subjects (131 HC/26 PD with a total contribution of 76.528 accelerometer and 18.069 typing sessions) with self-reported health status (HC or PD), resulted in area under curve of 0.87, with sensitivity/specificity of 0.92/0.69 and 0.60/0.92 at the operating points of highest sensitivity or specificity, respectively. Our findings suggest that the proposed method can be used as a stepping stone towards the development of an accessible PD screening tool that will passively monitor the subject-smartphone interaction for signs of PD and which could be used to reduce the critical gap between disease onset and start of treatment.Entities:
Mesh:
Year: 2020 PMID: 33288807 PMCID: PMC7721908 DOI: 10.1038/s41598-020-78418-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1High-level overview of the deep learning system. Both input streams, represented by the Postural Accelerations (PA) and Typing Dynamics (TD) bags, consist of multiple data recordings ( and , respectively) that are transformed independently using a feature extraction module. The resulting M-dimensional features are used to produce a fixed-length embedding (also M-dimensional) using an attention-pooling module. The embeddings for each modality are fused together via a sum operation to produce the subject embedding, which is then used as input to a multi-label classifier module that outputs the probability of tremor, fine-motor impairment (FMI) and PD. Initially, the feature extractors and attention modules of the two modalities are separately pre-trained against the respective symptom ground-truth (tremor or FMI), using a temporary single-output classifier module. After pre-training, the initial classifier modules are discarded, the feature extraction and attention modules are frozen, the embedding vectors are joined and a new multi-output final classifier module is introduced and is fine-tuned using a multi-label logistic loss function (Eq. 4). C denotes the number of accelerometer channels, W the length in samples of each segment in the PA bag and the number of bins for the hold and flight time histograms. Values for all parameters are given in Methods. Figure was drawn using Inkscape v1.0 https://inkscape.org/.
Demographics and symptom information for the PD patient (PD) and Healthy Control (HC) populations of the two datasets (SData and GData) used in our experiments. Reported symptom scores include UPDRS items 16 (tremor self-report), 20 (hand rest tremor), 21 (hand action tremor), 22 (hand rigidity), 23 (finger tapping) and 31 (body bradykinesia), as well as the sum of all UPDRS part-III items. Scores for UPDRS 20–23 refer to the sum of both hand scores. UPDRS scores are not available for the GData set, as the GData subjects did not undergo a medical examination. The age corrected GData column contains the demographics of a typical random GData subset, sampled in such a way that the mean age of the HC population lies within ± 2.5 years of the mean age of the PD population (note: the maximum value of 12 in the total UPDRS-III score in the Healthy Control population is caused by a subject that exhibits arthrosis and other problems unrelated to PD.) LEDD stands for Levodopa Equivalent Daily Dose.
| Count | SData set | GData set | GData set (age corrected) | |||
|---|---|---|---|---|---|---|
| HC | PD | HC | PD | HC | PD | |
| 8 | 14 | 131 | 26 | 75 | 26 | |
| Age | 50.5 (9.0) | 60.7 (9.8) | 54.5 (10.0) | 62.5 (8.9) | 60.0 (9.5) | 62.5 (8.9) |
| Gender (female/male) | 4/4 | 3/11 | 55/76 | 12/14 | 35/40 | 12/14 |
| Years diagnosed (mean, std) | – | 7.5 (3.6) | ||||
| Years diagnosed [min, max] | – | [1, 20] | Not available | |||
| PD medication intake (yes/no) | 0/8 | 13/1 | ||||
| LEDD in mg (mean, std) | 0 (0) | 510.7 (379.9) | ||||
| LEDD in mg [min , max] | [0, 0] | [0, 1291] | ||||
| UPDRS 16 mean (std) | 0.1 (0.3) | 1.3 (1.1) | ||||
| UPDRS 16 [min, max] | [0, 1] | [0, 4] | ||||
| UPDRS 20 mean (std) | 0.0 (0.0) | 1.4 (1.4) | ||||
| UPDRS 20 [min,max] | [0, 0] | [0, 5] | ||||
| UPDRS 21 mean (std) | 0.0 (0.0) | 1.2 (1.9) | ||||
| UPDRS 21 [min, max] | [0, 0] | [0, 7] | ||||
| UPDRS 22 mean (std) | 0.2 (0.4) | 2.2 (1.8) | Not available | |||
| UPDRS 22 [min, max] | [0, 1] | [0, 7] | ||||
| UPDRS 23 mean (std) | 0.2 (0.6) | 2.2 (1.5) | ||||
| UPDRS 23 [min, max] | [0, 2] | [5, 5] | ||||
| UPDRS 31 mean (std) | 0.2 (0.4) | 1.2 (0.4) | ||||
| UPDRS 31 [min, max] | [0, 1] | [1, 2] | ||||
| Total UPDRS-III (mean, std) | 1.7 (2.5) | 21.1 (15.0) | ||||
| Total UPDRS-III [min, max] | [0, 12] | [3, 62] | ||||
Classification performance for the LOSO experiment on the SData set. For each predicted label (tremor, FMI and PD) we report the average sensitivity, specificity and precision across 10 independent experimental trials. The first two rows present the predictive performance of standalone symptom classifiers. Such models arise during the initial pre-training step, when the separate branches that comprise the total model are separately trained against a single symptom label (the pre-training procedure is described in detail in Section Training details). The rest of the table, presents the performance of the fused multi-label classifier.
| Classifier | Target | Sensitivity | Specificity | Precision |
|---|---|---|---|---|
| Tremor | Tremor | 0.827 | 0.909 | 0.901 |
| FMI | FMI | 0.866 | 0.800 | 0.902 |
| Fused multi-label | Tremor | 0.854 | 0.936 | 0.930 |
| FMI | 0.866 | 0.842 | 0.922 | |
| PD | 0.928 | 0.862 | 0.921 |
Figure 2ROC curve obtained by using an ensemble of 10 models trained on the SData set (), to infer PD on the GData set (). The corresponding AUC is 0.868 with a confidence interval of 0.773–0.948. Figure was drawn using Inkscape v1.0 https://inkscape.org/.
Experimental results of an ensemble of 10 fused multi-label models trained on the SData set () and evaluated on the GData set () for the detection of PD. The AUC in the age corrected GData column refers to the mean AUC across 10 randomly generated, age-matched GData subsets (as such, we do not report performance metrics values at the two operating points for this experiment).
| Operating point | Metric | GData | GData (age corrected) |
|---|---|---|---|
| High sensitivity | Sensitivity | 0.920 | |
| Specificity | 0.689 | ||
| High specificity | Sensitivity | 0.600 | |
| Specificity | 0.917 | ||
| AUC | 0.868 | 0.834 |
Figure 3Visualization of bag instances that receive the highest and lowest attention weights by the proposed model. Figures were drawn using Inkscape v1.0 https://inkscape.org/. (a) View of the tri-axial (x-axis: blue, y-axis:orange, z-axis: green) accelerometer segments corresponding to the two highest (top row) and two lowest (bottom row) attention weights assigned by the model, for a PD subject that exhibits tremor. As we can see, the model assigns larger attention weights ak to the acceleration segments that contain sinusoidal components in the PD tremor frequency band of 3–8Hz (top row) and low weights to segments that lack such patterns (bottom left) or whose frequency content is outside the target frequency band (bottom right). (b) View of the hold and flight time histograms (smoothed for visualization purposes via a kernel-density estimator) corresponding to the two highest (top row) and two lowest (bottom row) attention weights for a PD subject that exhibits FMI. Notice how the model assigns large attention weights to typing sessions in which the hold and flight time distributions are multi-modal and shifted towards regions of slower typing, in contrast to the sessions that receive little attention and in which the distributions are unimodal and located at regions of faster typing.
Clinically-assessed subjects that contributed typing data. All values (except count) refer to the population mean, with the standard deviation given inside parentheses.
| Healthy controls | PD patients | |
|---|---|---|
| Count | 9 | 16 |
| Age | 49.6 (8.8) | 60.5 (9.3) |
| Years diagnosed | - | 6.8 (3.8) |
| UPDRS 22 (hand rigidity) | 0.2 (0.4) | 2.2 (1.7) |
| UPDRS 23 (alternate finger tapping) | 0.2 (0.6) | 2.0 (1.5) |
| UPDRS 31 (body bradykinesia) | 0.2 (0.4) | 1.1 (0.5) |
| Total UPDRS-III score | 1.7 (2.4) | 20.3 (14.2) |
Clinically-assessed subjects that contributed acceleration data. All values (except count) refer to the population mean, with the standard deviation given inside parentheses.
| Healthy controls | PD patients | |
|---|---|---|
| Count | 29 | 48 |
| Age in years | 57.3 (11.5) | 63.3 (9.1) |
| Years diagnosed | - | 7.5 (4.1) |
| UPDRS 16 (self-report tremor) | 0.1 (0.3) | 1.1 (0.8) |
| UPDRS 20 (hand rest tremor) | 0.0 (0.2) | 1.1 (1.2) |
| UPDRS 21 (hand action tremor) | 0.0 (0.0) | 1.0 (1.3) |
| Total UPDRS-III score | 1.4 (2.7) | 18.9 (10.7) |
The network architectures used for the feature extraction modules , . k denotes kernel size, f the number of filters in the convolutional layers and M the final embedding dimension. denotes the unit interval [0, 1].
| Feature extractor | ||
|---|---|---|
| Input | ||
| 3-DOF acceleration | HT, FT hists | |
| Layer 1 | Conv1D | Dense |
| LReLU ( | LReLU ( | |
| MaxPool | Dropout | |
| Layer 2 | Conv1D | Dense |
| LReLU ( | LReLU ( | |
| MaxPool | Dropout | |
| Layer 3 | Conv1D | Dense |
| LReLU ( | ||
| MaxPool | ||
| Layer 4 | Conv1D | |
| LReLU ( | ||
| MaxPool | ||
| Layer 5 | Flatten | |
| Dense | ||
| Output |
Architecture of the final classifier . M denotes the output dimension of the feature extraction modules, , .
| Final classifier | |
|---|---|
| Input | |
| Layer 1 | Dense |
| LReLU ( | |
| Dropout | |
| Layer 2 | Dense |
| LReLU ( | |
| Dropout | |
| Layer 3 | Dense |
| Sigmoid | |
| Output |