| Literature DB >> 32647196 |
Oguz Akbilgic1,2, Rishikesan Kamaleswaran3, Akram Mohammed4, G Webster Ross5, Kamal Masaki6, Helen Petrovitch7, Caroline M Tanner8,9, Robert L Davis4, Samuel M Goldman10,11.
Abstract
Autonomic nervous system involvement precedes the motor features of Parkinson's disease (PD). Our goal was to develop a proof-of-concept model for identifying subjects at high risk of developing PD by analysis of cardiac electrical activity. We used standard 10-s electrocardiogram (ECG) recordings of 60 subjects from the Honolulu Asia Aging Study including 10 with prevalent PD, 25 with prodromal PD, and 25 controls who never developed PD. Various methods were implemented to extract features from ECGs including simple heart rate variability (HRV) metrics, commonly used signal processing methods, and a Probabilistic Symbolic Pattern Recognition (PSPR) method. Extracted features were analyzed via stepwise logistic regression to distinguish between prodromal cases and controls. Stepwise logistic regression selected four features from PSPR as predictors of PD. The final regression model built on the entire dataset provided an area under receiver operating characteristics curve (AUC) with 95% confidence interval of 0.90 [0.80, 0.99]. The five-fold cross-validation process produced an average AUC of 0.835 [0.831, 0.839]. We conclude that cardiac electrical activity provides important information about the likelihood of future PD not captured by classical HRV metrics. Machine learning applied to ECGs may help identify subjects at high risk of having prodromal PD.Entities:
Mesh:
Year: 2020 PMID: 32647196 PMCID: PMC7347531 DOI: 10.1038/s41598-020-68241-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Subject characteristics.
| Control (n = 25) | Prodromal PD (n = 25) | Prevalent PD (n = 10) | |
|---|---|---|---|
Age at ECG Mean (SD), range | 78.0 (3.7), 72–88 | 77.6 (4.9), 72–88 | 79.9 (4.0), 72–85 |
Age at PD diagnosis Mean (SD), range | – | 81.9 (4.8), 74–91 | 74.5 (5.3), 62–80 |
Years from ECG until PD Mean (SD), range | – | 4.3 (2.4), 1–8 | − 5.4 (2.5), − 2 to − 10 |
Years follow up in controls until death (all controls are deceased) Mean (SD), range | 12.3 (4.6), 5–20 | – | – |
| Had autopsy | 10/25 (40%) | 6/25 (24%) | 5/10 (50%) |
Mean HR characteristics with 95% confidence intervals.
| HR characteristics | Controls (n = 25) | Prodromal PD (n = 25) | Prevalent (n = 10) |
|---|---|---|---|
| Mean | 65.30 [61.93, 68.68] | 64.50 [59.99, 69.02] | 68.24 [61.72, 74.76] |
| Median | 65.16 [61.75, 68.58] | 64.21 [59.39, 69.03] | 68.30 [61.75, 74.85] |
| Standard deviation | 2.65 [1.49, 3.80] | 3.87 [1.07, 6.67] | 1.20 [0.42, 1.98] |
| Kurtosis | 3.20 [2.26, 4.13] | 2.54 [2.21, 2.87] | 2.48 [2.01, 2.95] |
| Skewness | 0.19 [− 0.25, 0.63] | 0.10 [− 0.17, 0.36] | 0.02 [− 0.36, 0.40] |
| Maximum | 69.83 [64.81, 74.86 | 71.57 [63.56, 79.57] | 70.21 [63.32, 77.10] |
| Minimum | 61.00 [57.65, 64.34] | 58.90 [54.32, 63.49] | 66.21 [59.81, 72.83] |
| Range | 8.84 [4.48, 13.19] | 12.66 [3.53, 21.79] | 3.90 [1.28, 6.51] |
| Coefficient of variation | 3.85 [2.37, 5.34] | 5.58 [1.75, 9.41] | 1.74 [0.62, 2.86] |
Only Skewness (KS p > 0.05) among nine HR variables (KS p > 0.05) followed a normal distribution. There were no significant differences in means of Skewness between three groups (ANOVA p = 0.86). Among other eight HR variables, there was no variable significantly differed between three groups (Kruskal–Wallis Test p > 0.05).
Figure 1Comparison of prodromal PD (1) and control ECGs (0) based on PSPR features. Note that PSPR features represent how a given ECG (from prodromal PD subjects or control) differs (dissimilarity or distant) from the ECGs of subjects with prevalent PD. This implies that the dissimilarity between ECGs of prodromal PD and prevalent PD are smaller (more similar) than the dissimilarity between controls and prevalent PD ECGs (less similar).
Figure 2k-folds cross-validation results. The solid black line represents the average cross-validation result, while the dashed red line is the corresponding 95% confidence interval. Increasing ‘k’ indicates a larger training and smaller testing set. For example, when k = 2, a model is trained on 1 × (50/2) = 25 subjects and tested on the remaining 25. When k = 10, a model is trained on 9x(50/10) = 45 and tested on the remaining 5.