| Literature DB >> 34665148 |
Wasifur Rahman1, Sangwu Lee1, Md Saiful Islam1, Victor Nikhil Antony1, Harshil Ratnu1, Mohammad Rafayet Ali1, Abdullah Al Mamun1, Ellen Wagner2, Stella Jensen-Roberts2, Emma Waddell2, Taylor Myers2, Meghan Pawlik2, Julia Soto2, Madeleine Coffey2, Aayush Sarkar2, Ruth Schneider2,3, Christopher Tarolli2,3, Karlo Lizarraga2,3, Jamie Adams2,3, Max A Little4,5, E Ray Dorsey2, Ehsan Hoque1.
Abstract
BACKGROUND: Access to neurological care for Parkinson disease (PD) is a rare privilege for millions of people worldwide, especially in resource-limited countries. In 2013, there were just 1200 neurologists in India for a population of 1.3 billion people; in Africa, the average population per neurologist exceeds 3.3 million people. In contrast, 60,000 people receive a diagnosis of PD every year in the United States alone, and similar patterns of rising PD cases-fueled mostly by environmental pollution and an aging population-can be seen worldwide. The current projection of more than 12 million patients with PD worldwide by 2040 is only part of the picture given that more than 20% of patients with PD remain undiagnosed. Timely diagnosis and frequent assessment are key to ensure timely and appropriate medical intervention, thus improving the quality of life of patients with PD.Entities:
Keywords: Parkinson’s disease; improving access and equity in health care; mobile phone; speech analysis
Mesh:
Year: 2021 PMID: 34665148 PMCID: PMC8564663 DOI: 10.2196/26305
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1An outline of our approach for solving the speech task of uttering “The quick brown fox...”.
Figure 2An overview of our data collection, storage, and analysis pipeline.
Figure 3Some screenshots of our subjects while providing the data. All the subjects except B provided data without any supervision. B, D, E, and F have been diagnosed with Parkinson disease. Electronic informed consent was taken from the participants to use their photos for publication.
Figure 4A bar plot showing the age distribution of participants with PD and those without PD in our data set. PD: Parkinson disease.
Demographic composition of our data set (N=726).
| Characteristics | Participants with PDa | Participants without PD | |
| Total, n (%) | 262 (36.1) | 464 (63.9) | |
|
| |||
|
| Female | 101 (38.5) | 300 (64.6) |
|
| Male | 161 (61.4) | 164 (22.5) |
| Age (years), mean (SD) | 65 | 57 | |
|
| |||
|
| United States | 199 (75.9) | 419 (90.3) |
|
| Other | 63 (24) | 45 (9.7) |
| Years since diagnosed, mean (SD) | 7 | N/Ab | |
aPD: Parkinson disease.
bN/A: not applicable.
Names of all the features, code source used for collecting them, and a short descriptiona.
| Feature | Code source | Short description | |||
|
| |||||
|
|
| Little et al [ | Median principal frequency | ||
|
|
| Boersma and Weenink [ | Mean principal frequency | ||
|
|
| Little et al [ | SD in principal frequency | ||
|
| |||||
|
|
| Little et al [ | Perturbation in principal frequency (mean variation) | ||
|
|
| Little et al [ | Perturbation in principal frequency (median variation) | ||
|
| LocalJitter | Boersma and Weenink [ | Average absolute difference between consecutive periods, divided by the average period | ||
|
|
| Boersma and Weenink [ | Average absolute difference between consecutive periods measured in seconds | ||
|
| RapJitter | Boersma and Weenink [ | Average absolute difference between a period and the average of it and its two neighbors | ||
|
| Ppq5Jitter | Boersma and Weenink [ | 5-point period perturbation quotient | ||
|
|
| Boersma and Weenink [ | Difference of differences of periods of principal frequency | ||
|
| |||||
|
| MeanShimmer | Little et al [ | Amplitude perturbation (using mean) | ||
|
|
| Little et al [ | Amplitude perturbation (using median) | ||
|
| LocalShimmer | Boersma and Weenink [ | Average absolute difference between amplitudes of consecutive period | ||
|
| LocaldbShimmer | Boersma and Weenink [ | Average absolute base-10 logarithm of the difference between amplitudes of consecutive period | ||
|
| Apq3Shimmer | Boersma and Weenink [ | 3-point amplitude perturbation quotient | ||
|
| Apq5Shimmer | Boersma and Weenink [ | 5-point amplitude perturbation quotient | ||
|
|
| Boersma and Weenink [ | 11-point amplitude perturbation quotient | ||
|
|
| Boersma and Weenink [ | Shimmer calculated by difference in differences of amplitude | ||
|
| |||||
|
|
| Little et al [ | 13 features of mean MFCC | ||
|
|
| Little et al [ | 13 features of mean variation of MFCC | ||
|
|
| Tsanas et al [ | 4 features capturing relative band power in 4 spectrum ranges | ||
|
| Boersma and Weenink [ | Signal-to-noise ratio | |||
|
| Little et al [ | Pitch estimation uncertainty | |||
|
| Little et al [ | Measure of stochastic self-similarity in turbulent noise | |||
|
| Little et al [ | Measure of inability of maintaining constant pitch | |||
aWe collected the features using the code or methodology described in the corresponding code-source entry.
bThe correlated features were removed, and features in italicized text were used to build the models. Feature names are preceded by the loosely defined umbrella category they belong to.
cMFCC: mel-frequency cepstral coefficient.
dHNR: harmonic-to-noise ratio.
eRPDE: recurrence period density entropy.
fDFA: detrended fluctuation analysis.
gPPE: pitch period entropy.
Performance on the entire data set. The performance of various machine learning algorithms using the standard and embedding features on a data set combining data from both home and laboratory environmentsa.
| Algorithm | Standard features | Embedding features | ||
|
| AUCb | Accuracy | AUC | Accuracy |
| SVMc | 0.751 | 0.735 | 0.738 | 0.692 |
| Random forest | 0.745 | 0.720 | 0.726 | 0.708 |
| LightGBM | 0.753 | 0.720 | 0.737 | 0.693 |
| XGBoost |
|
| 0.722 | 0.689 |
aModels using standard features perform better than the models using embedding features in terms of both binary accuracy and area under the curve. Although the performance of the models is almost similar in terms of area under the curve metric, XGBoost outperforms others by considering both the area under the curve and accuracy metrics simultaneously.
bAUC: area under the curve.
cSVM: support vector machine.
dVariable outperforms all others by taking both area under the curve and accuracy into account.
Figure 5Shapley additive explanations analysis of our best performing models on 3 data sets: (A) main model (ie, entire data set), (B) female model (ie, female only), and (C) age-trimmed model (all subjects are older than 50 years).
Figure 6Changes in fundamental frequency F0 of voice as a function of gender and age (collected from Tsanas et al [13]).
Gender- and age-stratified models. Three separate data sets were constructed: a male data set with male participants, a female data set with female participants, and age-trimmed data set by excluding the participants younger than 50 years. For each of these data sets, a separate model was constructed, and its performance is reported below (N=726).
| Algorithm | Male (n=415) | Female (n=477) | Age-trimmed (male, n=366 and female, n=426) | ||||
|
| AUCa | Accuracy | AUC | Accuracy | AUC | Accuracy | |
| SVMb | 0.795 | 0.717 | 0.659 | 0.763 | 0.755 | 0.723 | |
| Random Forest | 0.758 | 0.702 | 0.699 | 0.788 | 0.739 | 0.713 | |
| LightGBM | 0.725 | 0.665 | 0.717 | 0.768 | 0.749 | 0.712 | |
| XGBoost | 0.762 | 0.717 | 0.682 | 0.771 | 0.742 | 0.704 | |
aAUC: area under the curve.
bSVM: support vector machine.