| Literature DB >> 36010330 |
Son V T Dao1, Zhiqiu Yu2, Ly V Tran1, Phuc N K Phan1, Tri T M Huynh3, Tuan M Le3.
Abstract
Parkinson's Disease (PD) is a brain disorder that causes uncontrollable movements. According to estimation, roughly ten million individuals worldwide have had or are developing PD. This disorder can have severe consequences that affect the patient's daily life. Therefore, several previous works have worked on PD detection. Automatic Parkinson's Disease detection in voice recordings can be an innovation compared to other costly methods of ruling out examinations since the nature of this disease is unpredictable and non-curable. Analyzing the collected vocal records will detect essential patterns, and timely recommendations on appropriate treatments will be extremely helpful. This research proposed a machine learning-based approach for classifying healthy people from people with the disease utilizing Grey Wolf Optimization (GWO) for feature selection, along with Light Gradient Boosted Machine (LGBM) to optimize the model performance. The proposed method shows highly competitive results and has the ability to be developed further and implemented in a real-world setting.Entities:
Keywords: Parkinson’s disease; feature subset selection; grey wolf optimization
Year: 2022 PMID: 36010330 PMCID: PMC9406914 DOI: 10.3390/diagnostics12081980
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Flowchart of wrapper-based approach.
Figure 2Proposed model applying machine learning algorithms.
Overview of the feature sets.
| Title 1 | Measure | Explanation | # of Feature |
|---|---|---|---|
| Baseline features | Jitter variants | Capture instabilities of the oscillating pattern of the vocal folds & its subset quantify the cycle-to-cycle changes in the fundamental frequency. | 5 |
| Shimmer variants | Capture instabilities of the oscillating pattern of the vocal folds & its subset quantify the cycle-to-cycle changes in the fundamental amplitude. | 6 | |
| Fundamental frequency parameters | the frequency of focal folds vibration. Mean, median, standard deviation, minimum & maximum values were used | 5 | |
| Harmonicity parameters | Due to incomplete vocal fold closure, increased noise components occur in speech pathologies. Harmonics to Noise Ratio and Noise to Harmonics Ratio parameters quantify the ratio of signal information over noise, which were used as features. | 2 | |
| Recurrence Period density Entropy (RPDE) I | Provides information about the ability of the vocal fold oscillations and quantifies the deviation form F0 | 1 | |
| Detrended Fluctuation Analysis (DFA) | Quantifies the stochastic self-similarity of the turbulent noise. | 1 | |
| Pitch Period Entropy (PPE) | Measures the impaired control of fundamental frequency F0 by using a logarithmic scale | 1 | |
| Time-frequency features | Intensity Parameters | Related to the power of speech signal in dB. Mean, minimum and maximum intensity values were used | 3 |
| Formant Frequencies | Amplified by the vocal tract, the first four formants were used as features. | 4 | |
| Bandwidth | The frequency range between the formant frequencies. The first four bandwidths were used as features. | 4 | |
| Mel frequency cepstral coefficients (MFCCs) | MFCCs | Catch the PD effects in the vocal tract separately from the vocal folds | 84 |
| Wavelet Transform based Features | Wavelet Transform (WT) features related to F0 | Quantify the deviation in F0 | 182 |
| Vocal fold features | Glottis Quotient (GQ) | a measure of periodicity in glottis movements that provides information about the opening and closing duration of the glottis | 3 |
| Glottal to Noise Excitation (GNE) | Quantifies the extent of turbulent noise caused by incomplete vocal fold closure in the speech signal. | 6 | |
| Vocal Fold Excitation Ratio (VEER) | Quantities the amount of noise, produced due to the pathological vocal fold vibration using non-linear energy at entropy concepts. | 7 | |
| Empirical Mode Decomposition (EMD) | Decompose a speech signal into elementary signal components by using adaptive basis functions & | ||
| Energy/entropy values obtained from these components are used to quantify noise. | 6 | ||
| Tunable Q-factor Wavelet Transform (TQWT) | TQWT | A more extensive quantification method for fundamental frequency deviation as compared to WT I | 432 |
Figure 3Checking for a missing value.
Figure 4The output of data standardization.
Figure 5The output of data splitting.
Figure 6Hierarchy of wolves in GWO.
Figure 7Updated position of a particle in GWO.
Figure 8Movement of Levy flights.
Results comparison between baseline model and proposed model.
| Classifier | Accuracy | Precision | Recall | F1-Score | AUC | Computational Time | |
|---|---|---|---|---|---|---|---|
| k-NN | Baseline model | 0.862 | 0.857 | 0.979 | 0.914 | 0.75 | 0.443 |
| Proposed model | 0.878 | 0.869 | 0.986 | 0.923 | 0.316 | 0.878 | |
| SVM | Baseline model | 0.841 | 0.840 | 0.972 | 0.901 | 0.71 | 0.560 |
| Proposed model | 0.866 | 0.887 | 0.943 | 0.914 | 0.527 | 0.866 | |
| DT | Baseline model | 0.810 | 0.878 | 0.865 | 0.871 | 0.76 | 0.640 |
| Proposed model | 0.795 | 0.850 | 0.886 | 0.868 | 0.744 | 0.795 | |
| Proposed LGBM | Baseline model | 0.905 | 0.902 | 0.979 | 0.939 | 0.83 | 2.056 |
| Proposed model | 0.894 | 0.895 | 0.972 | 0.932 | 1.926 | 0.894 |
Figure 9Confusion Matrix of the proposed model.