| Literature DB >> 35722168 |
Zoltan Galaz1, Peter Drotar2, Jiri Mekyska1, Matej Gazda2, Jan Mucha1, Vojtech Zvoncak1, Zdenek Smekal1, Marcos Faundez-Zanuy3, Reinel Castrillon4,5, Juan Rafael Orozco-Arroyave4,6, Steven Rapcsak7, Tamas Kincses8, Lubos Brabenec9, Irena Rektorova9,10.
Abstract
Parkinson's disease dysgraphia (PDYS), one of the earliest signs of Parkinson's disease (PD), has been researched as a promising biomarker of PD and as the target of a noninvasive and inexpensive approach to monitoring the progress of the disease. However, although several approaches to supportive PDYS diagnosis have been proposed (mainly based on handcrafted features (HF) extracted from online handwriting or the utilization of deep neural networks), it remains unclear which approach provides the highest discrimination power and how these approaches can be transferred between different datasets and languages. This study aims to compare classification performance based on two types of features: features automatically extracted by a pretrained convolutional neural network (CNN) and HF designed by human experts. Both approaches are evaluated on a multilingual dataset collected from 143 PD patients and 151 healthy controls in the Czech Republic, United States, Colombia, and Hungary. The subjects performed the spiral drawing task (SDT; a language-independent task) and the sentence writing task (SWT; a language-dependent task). Models based on logistic regression and gradient boosting were trained in several scenarios, specifically single language (SL), leave one language out (LOLO), and all languages combined (ALC). We found that the HF slightly outperformed the CNN-extracted features in all considered evaluation scenarios for the SWT. In detail, the following balanced accuracy (BACC) scores were achieved: SL-0.65 (HF), 0.58 (CNN); LOLO-0.65 (HF), 0.57 (CNN); and ALC-0.69 (HF), 0.66 (CNN). However, in the case of the SDT, features extracted by a CNN provided competitive results: SL-0.66 (HF), 0.62 (CNN); LOLO-0.56 (HF), 0.54 (CNN); and ALC-0.60 (HF), 0.60 (CNN). In summary, regarding the SWT, the HF outperformed the CNN-extracted features over 6% (mean BACC of 0.66 for HF, and 0.60 for CNN). In the case of the SDT, both feature sets provided almost identical classification performance (mean BACC of 0.60 for HF, and 0.58 for CNN).Entities:
Keywords: Parkinson's disease dysgraphia; deep learning; feature extraction; handwriting analysis; machine learning
Year: 2022 PMID: 35722168 PMCID: PMC9198652 DOI: 10.3389/fninf.2022.877139
Source DB: PubMed Journal: Front Neuroinform ISSN: 1662-5196 Impact factor: 3.739
Overview of the related works.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| |||||
| Impedovo et al. ( | 29 PD, 36 HC | PaHaW–all | Kinematic, enthropy | GNB | ACC = 72.0% |
| Mucha et al. ( | 33 PD, 36 HC | PaHaW–all | FD-based kinematic | XGBoost | ACC = 97.1% |
| EER = 23.6% (PD dur) | |||||
| EER = 12.5% (UPDRS V) | |||||
| Rios-Urrego et al. ( | 39 PD, 70 HC | Archimedean spiral | Kinematic, geometric | KNN | ACC = 83.3% (spiral) |
| Short sentence | Spectral, non-linear | SVM | ACC = 75.0% (sentence) | ||
| Jerkovic et al. ( | 33 PD, 10 HC | Various sentences | Kinematic | cLDA | ACC = 86.0% |
| Impedovo ( | 37 PD, 38 HC | PaHaW–all | DFT, SLM, MBD | SVM | ACC = 94.0% |
| Aouraghe et al. ( | 40 PD, 40 HC | Segment of text | DTWT, FFT | KNN | ACC = 85.7% (full text) |
| Butter/adaptive filter | decision tree | ACC = 78.6% (first line) | |||
| Vásquez-Correa et al. ( | 44 PD, 40 HC | 14 drawings/writings | Original signal | 1D CNN | ACC = 67.0% |
|
| |||||
| Moetesum et al. ( | 37 PD, 38 HC | PaHaW–all | AlexNet CNN | SVM | ACC = 83.0% |
| Gazda et al. ( | 64 PD, 71 HC | Archimedean spiral | Pre-trained CNN and transfer | ACC = 92.7% (NewHandPD) | |
| 2 dataset | learning (ImageNet → PD dataset) | ACC = 85.8% (PaHaW) | |||
| Kamran et al. ( | PaHaW | Several drawings | ACC = 62.5% (PaHaW) | ||
| HandPD | AlexNet, GoogLeNet, VGG16 | ACC = 91.4% (HandPD) | |||
| NewHandPD | VGG16, ResNet50, ResNet101 | ACC = 98.4% (NewHandPD) | |||
| PD Drawings | ACC = 90.0% (PD Drawings) | ||||
| Diaz et al. ( | 37 PD, 38 HC | PaHaW–all | VGG | SVM | ACC = 86.0% |
PD, Parkinson's disease; HC, healthy control; PaHaW, Parkinson's disease handwriting database (Drotar et al., .
Figure 1Selected samples from the multilingual dataset (blue line – on-surface movement; red line – in-air movement). (A) Spiral drawing (PD patient); (B) Spiral drawing (HC); (C) English sentence (PD patient) “The weather turned nice”; (D) Hungarian sentence (PD patient) “A vonat hirtelen megállt”; (E) Czech sentence (HC) “Tramvaj dnes už nepojede”.
Demographic characteristics.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
|
| |||||||
| PaHaW | CZ | 18 | 15 | 69.21 ± 11.10 | 17 | 19 | 62.50 ± 11.70 |
| CoBeN | CZ | 6 | 13 | 66.48 ± 7.77 | 30 | 10 | 67.04 ± 6.07 |
| CoBeN | US | 3 | 6 | 68.56 ± 4.07 | 9 | 3 | 72.50 ± 8.37 |
| CoBeN | HU | 2 | 7 | 66.00 ± 9.96 | 7 | 5 | 64.92 ± 5.30 |
| HWUDEA | CO | 41 | 28 | 64.42 ± 11.85 | 22 | 27 | 62.69 ± 11.34 |
|
| |||||||
| PaHaW | CZ | 19 | 18 | 69.32 ± 10.97 | 18 | 20 | 62.42 ± 11.39 |
| CoBeN | CZ | 6 | 13 | 66.48 ± 7.77 | 30 | 9 | 67.21 ± 6.05 |
| CoBeN | US | 3 | 6 | 68.56 ± 4.07 | 9 | 3 | 72.50 ± 8.37 |
| CoBeN | HU | 2 | 6 | 65.88 ± 10.64 | 7 | 5 | 64.92 ± 5.30 |
| HWUDEA | CO | 13 | 4 | 63.88 ± 7.61 | 5 | 5 | 70.20 ± 10.67 |
Clinical characteristics of the PD patients.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| PaHaW | CZ | 8.38 ± 4.80 | 1,432.19 ± 704.78 | – | 2.27 ± 0.85 |
| CoBeN | CZ | 4.00 ± 4.15 | 568.33 ± 508.03 | 7.00 ± 1.41 | – |
| CoBeN | US | – | 333.12 ± 240.40 | – | – |
| CoBeN | HU | – | – | – | – |
| HWUDEA | CO | 10.56 ± 11.16 | – | 36.78 ± 19.63 | 2.38 ± 0.61 |
LED, L-dopa equivalent daily dose (Lee et al., .
Classification performance in the single-language scenario.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
|
| |||||
| CZ | Handcrafted | 0.59 ± 0.08 | 0.590.07 | 0.82 ± 0.12 | 0.36 ± 0.14 |
| CNN | 0.64 ± 0.03 | 0.65 ± 0.05 | 0.65 ± 0.09 | 0.65 ± 0.06 | |
| CO | Handcrafted | 0.59 ± 0.12 | 0.72 ± 0.07 | 0.81 ± 0.09 | 0.37 ± 0.23 |
| CNN | 0.61 ± 0.02 | 0.62 ± 0.02 | 0.62 ± 0.03 | 0.62 ± 0.02 | |
| HU | Handcrafted | 0.64 ± 0.17 | 0.61 ± 0.20 | 0.72 ± 0.29 | 0.57 ± 0.34 |
| CNN | 0.48 ± 0.03 | 0.52 ± 0.13 | 0.52 ± 0.16 | 0.52 ± 0.12 | |
| US | Handcrafted | 0.82 ± 0.18 | 0.77 ± 0.28 | 0.84 ± 0.31 | 0.81 ± 0.23 |
| CNN | 0.77 ± 0.02 | 0.77 ± 0.07 | 0.77 ± 0.11 | 0.77 ± 0.08 | |
|
| |||||
| CZ | Handcrafted | 0.66 ± 0.08 | 0.62 ± 0.08 | 0.64 ± 0.10 | 0.69 ± 0.12 |
| CNN | 0.65 ± 0.04 | 0.66 ± 0.04 | 0.66 ± 0.04 | 0.66 ± 0.05 | |
| CO | Handcrafted | 0.56 ± 0.18 | 0.72 ± 0.19 | 0.83 ± 0.22 | 0.28 ± 0.29 |
| CNN | 0.50 ± 0.08 | 0.54 ± 0.07 | 0.54 ± 0.08 | 0.54 ± 0.09 | |
| HU | Handcrafted | 0.75 ± 0.18 | 0.65 ± 0.30 | 0.82 ± 0.34 | 0.59 ± 0.34 |
| CNN | 0.50 ± 0.06 | 0.48 ± 0.08 | 0.48 ± 0.10 | 0.48 ± 0.08 | |
| US | Handcrafted | 0.65 ± 0.20 | 0.54 ± 0.28 | 0.58 ± 0.34 | 0.73 ± 0.32 |
| CNN | 0.70 ± 0.04 | 0.70 ± 0.06 | 0.70 ± 0.08 | 0.70 ± 0.05 | |
BACC, balanced accuracy; F1, F1 score; SEN, sensitivity; SPE, specificity.
Figure 2Importance of the features used in the models in the single-language scenario (spiral drawing task).
Figure 3Importance of the features used in the models in the single-language scenario (sentence writing task).
Figure 4Relevance maps for ten Archimedean spirals (two random samples from each dataset are depicted).
Figure 5Relevance maps for four sentences (one random sample is depicted for each language).
Classification performance in the leave-one-language-out scenario.
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| ||||||
| CO+HU+US | CZ | Handcrafted | 0.54 | 0.51 | 0.62 | 0.46 |
| CNN | 0.45 | 0.41 | 0.48 | 0.42 | ||
| CZ+HU+US | CO | Handcrafted | 0.50 | 0.74 | 1.00 | 0.00 |
| CNN | 0.63 | 0.62 | 0.54 | 0.71 | ||
| CZ+CO+US | HU | Handcrafted | 0.56 | 0.47 | 0.44 | 0.67 |
| CNN | 0.71 | 0.67 | 0.67 | 0.75 | ||
| CZ+CO+HU | US | Handcrafted | 0.65 | 0.67 | 0.88 | 0.41 |
| CNN | 0.38 | 0.32 | 0.33 | 0.42 | ||
|
| ||||||
| CO+HU+US | CZ | Handcrafted | 0.63 | 0.68 | 0.78 | 0.48 |
| CNN | 0.54 | 0.58 | 0.80 | 0.29 | ||
| CZ+HU+US | CO | Handcrafted | 0.59 | 0.30 | 0.18 | 1.00 |
| CNN | 0.51 | 0.72 | 0.82 | 0.20 | ||
| CZ+CO+US | HU | Handcrafted | 0.67 | 0.64 | 0.59 | 0.75 |
| CNN | 0.60 | 0.46 | 0.38 | 0.83 | ||
| CZ+CO+HU | US | Handcrafted | 0.71 | 0.67 | 0.59 | 0.83 |
| CNN | 0.63 | 0.46 | 0.33 | 0.92 | ||
TRAIN, training dataset; TEST, test dataset; BACC, balanced accuracy; F1, F1 score; SEN, sensitivity; SPE, specificity.
Classification performance in the scenario with all languages combined.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| Spiral | Handcrafted | 0.60 ± 0.06 | 0.63 ± 0.06 | 0.73 ± 0.10 | 0.48 ± 0.07 |
| CNN | 0.60 ± 0.01 | 0.61 ± 0.02 | 0.61 ± 0.04 | 0.61 ± 0.04 | |
| Sentence | Handcrafted | 0.69 ± 0.05 | 0.65 ± 0.07 | 0.61 ± 0.09 | 0.78 ± 0.07 |
| CNN | 0.66 ± 0.01 | 0.67 ± 0.01 | 0.67 ± 0.03 | 0.67 ± 0.03 |
BACC, balanced accuracy; F1, F1 score; SEN, sensitivity; SPE, specificity.