| Literature DB >> 34940256 |
Chien-Te Wu1, Hao-Chuan Huang2, Shiuan Huang2, I-Ming Chen3,4, Shih-Cheng Liao3, Chih-Ken Chen5,6, Chemin Lin5,6, Shwu-Hua Lee6,7, Mu-Hong Chen8,9, Chia-Fen Tsai8,9, Chang-Hsin Weng2, Li-Wei Ko10, Tzyy-Ping Jung11, Yi-Hung Liu12.
Abstract
Major depressive disorder (MDD) is a global healthcare issue and one of the leading causes of disability. Machine learning combined with non-invasive electroencephalography (EEG) has recently been shown to have the potential to diagnose MDD. However, most of these studies analyzed small samples of participants recruited from a single source, raising serious concerns about the generalizability of these results in clinical practice. Thus, it has become critical to re-evaluate the efficacy of various common EEG features for MDD detection across large and diverse datasets. To address this issue, we collected resting-state EEG data from 400 participants across four medical centers and tested classification performance of four common EEG features: band power (BP), coherence, Higuchi's fractal dimension, and Katz's fractal dimension. Then, a sequential backward selection (SBS) method was used to determine the optimal subset. To overcome the large data variability due to an increased data size and multi-site EEG recordings, we introduced the conformal kernel (CK) transformation to further improve the MDD as compared with the healthy control (HC) classification performance of support vector machine (SVM). The results show that (1) coherence features account for 98% of the optimal feature subset; (2) the CK-SVM outperforms other classifiers such as K-nearest neighbors (K-NN), linear discriminant analysis (LDA), and SVM; (3) the combination of the optimal feature subset and CK-SVM achieves a high five-fold cross-validation accuracy of 91.07% on the training set (140 MDD and 140 HC) and 84.16% on the independent test set (60 MDD and 60 HC). The current results suggest that the coherence-based connectivity is a more reliable feature for achieving high and generalizable MDD detection performance in real-life clinical practice.Entities:
Keywords: electroencephalographic (EEG) signal; feature selection; healthcare; machine learning; major depressive disorder (MDD); support vector machine
Mesh:
Year: 2021 PMID: 34940256 PMCID: PMC8699348 DOI: 10.3390/bios11120499
Source DB: PubMed Journal: Biosensors (Basel) ISSN: 2079-6374
Figure 1Comparison of the number of subjects in previous EEG- and ML-based studies.
Figure 2Gender distribution and mean age of participants from each medical center.
Demographic data and results of questionnaires.
| Variable | MDD, | HC, | Effect Size | |
|---|---|---|---|---|
| Gender | 142 F, 58 M | 142 F, 58 M | 1.000 | 0 |
| Age | 53.44 (±16.44) | 51.24 (±17.85) | 0.1969 | 0.1293 |
| BDI-II | 25.79 (±14.30) | 4.18 (±6.74) | 5.60 × 10−59 | 1.9282 |
| PHQ-9 | 12.64 (±7.30) | 2.09 (±3.73) | 4.51 × 10−54 | 1.8150 |
| HADS-A | 10.46 (±4.91) | 3.53 (±3.37) | 1.18 × 10−46 | 1.6425 |
| HADS-D | 10.29 (±5.19) | 2.50 (±2.85) | 7.91 × 10−56 | 1.8556 |
Note: BDI-II, Becks Depression Inventory–II; PHQ-9, Patient Health Questionnaire; HADS, Hospital Anxiety and Depression Scale.
Figure 3Electrode layout. Electrode positions follow the extended 10–20 international system. Reference is at A2. The entire scalp region is divided into four regions for analysis: frontal (brown), central (green), parietal (blue), and temporal (yellow).
Demographic data and scores of questionnaires for participants in the training set.
| Variable | MDD ( | HC ( | Effect Size | |
|---|---|---|---|---|
| Gender | 100 F, 40 M | 97 F, 43 M | 0.7935 | 0.0474 |
| Age | 53.06 (±16.31) | 50.83 (±17.64) | 0.2734 | 0.1312 |
| BDI-II | 25.82 (±14.23) | 3.69 (±5.98) | 1.43 × 10−44 | 2.0201 |
| PHQ-9 | 12.85 (±7.31) | 2.00 (±3.52) | 1.77 × 10−40 | 1.8855 |
| HADS-A | 10.52 (±4.97) | 3.19 (±3.05) | 4.85 × 10−37 | 1.7720 |
| HADS-D | 10.29 (±5.24) | 2.32 (±2.71) | 4.63 × 10−41 | 1.9046 |
Demographic data and scores of questionnaires for participants in the test set.
| Variable | MDD ( | HC ( | Effect Size | |
|---|---|---|---|---|
| Gender | 42 F, 18 M | 45 F, 15 M | 0.6826 | 0.1091 |
| Age | 54.32 (±16.70) | 52.18 (±17.40) | 0.4983 | 0.1240 |
| BDI-II | 25.70 (±14.46) | 5.30 (±8.41) | 4.22 × 10−16 | 1.7237 |
| PHQ-9 | 12.13 (±7.25) | 2.30 (±4.18) | 4.05 × 10−15 | 1.6475 |
| HADS-A | 10.32 (±4.76) | 4.32 (±3.91) | 1.42 × 10−11 | 1.3662 |
| HADS-D | 10.27 (±5.08) | 2.90 (±3.12) | 3.16 × 10−16 | 1.7334 |
Figure 4Topoplots of 5-fold CV classification accuracies in BP, coherence (COH), HFD, and KFD features. The accuracy distribution shown around the occipital area should be ignored, because the EEGs from O1, O2, and Oz were not analyzed in the present study.
Comparison of 5-fold cross validation accuracies among different features, frequency bands, and scalp regions using LDA (in %). In each grid of this table, the numbers inside and outside the brackets denote the accuracies obtained from the features with and without the SBS-based feature selection.
| EEG Features | Frontal | Central | Temporal | Parietal | ALL | |
|---|---|---|---|---|---|---|
| BP |
| 49.53 ± 6.80 | 56.35 ± 4.24 | 53.86 ± 3.50 | 51.80 ± 3.81 | 48.80 ± 6.76 |
|
| 49.63 ± 11.93 | 51.24 ± 6.88 | 52.62 ± 7.25 | 45.83 ± 4.57 | 52.58 ± 8.61 | |
|
| 47.06 ± 7.21 | 51.67 ± 7.20 | 44.49 ± 7.36 | 46.16 ± 6.84 | 42.09 ± 9.95 | |
|
| 52.53 ± 5.36 | 55.00 ± 7.35 | 53.87 ± 6.51 | 50.76 ± 7.64 | 51.74 ± 9.00 | |
|
| 48.28 ± 7.21 | 55.51 ± 2.19 | 46.62 ± 8.46 | 43.94 ± 6.00 | 44.15 ± 11.66 | |
| COH |
| 53.44 ± 8.04 | 54.38 ± 5.86 | 46.70 ± 5.95 | 52.85 ± 3.23 | 52.49 ± 4.85 |
|
| 56.30 ± 7.58 | 58.64 ± 7.65 | 51.86 ± 5.64 | 52.47 ± 10.46 | 49.16 ± 9.78 | |
|
| 52.67 ± 8.48 | 56.08 ± 4.45 | 56.36 ± 8.28 | 52.34 ± 13.05 | 51.32 ± 6.17 | |
|
| 55.43 ± 6.91 | 58.68 ± 2.92 | 55.78 ± 7.18 | 53.15 ± 6.28 | 53.37 ± 7.41 | |
|
| 50.05 ± 6.86 | 58.60 ± 8.13 | 46.13 ± 5.57 | 55.66 ± 5.33 | 49.70 ± 5.41 | |
| HFD | 51.28 ± 6.06 | 57.38 ± 3.30 | 52.23 ± 7.21 | 48.52 ± 6.25 | 50.25 ± 6.29 | |
| 52.93 ± 6.93 | 59.42 ± 3.90 | 55.31 ± 5.98 | 51.47 ± 3.54 | 54.33 ± 4.38 | ||
| 52.93 ± 6.93 | 58.93 ± 3.86 | 54.14 ± 8.45 | 50.56 ± 5.60 | 54.04 ± 4.73 | ||
| KFD | 44.60 ± 8.37 | 41.05 ± 10.18 | 35.65 ± 9.99 | 50.75 ± 10.15 | 46.38 ± 8.46 | |
Figure 5Plots of classification accuracy (vertical) versus the number of features determined by the SBS-based feature selection procedure (horizontal). The solid curves represent the MDD-HC classification accuracy obtained by performing the 5-fold cross validation on the training set using different classifiers. The dotted curves represent the classification accuracy on the test set.
Figure 6Plots of the 61 optimal coherence features of SVM determined by the SBS-based feature selection procedure. Electrodes at the left and right hemispheres are marked in red and blue, respectively. It is noted that the line connected between an electrode pair does not represent that the coherence between the two electrodes is higher than a threshold, but shows that the coherence feature of the electrode pair is one of the 61 optimal coherence features.
Figure A1Features of the K-NN classifier. Among the 33 features, 32 are coherence features and the remaining one feature is the delta BP feature at TP7.
Figure A2Features of the LDA classifier. Among the 63 features, 60 are coherence features and the remaining three are all BP features.
Figure A3Features of the SVM classifier. Among the 62 features, 61 are coherence features and the remaining one is the beta BP feature at Pz.
The numbers and the percentages of BP, COH, HFD, and KFD in the optimal feature subset for each classifier.
| LDA | SVM | ||
|---|---|---|---|
| BP | 3.03% (1) | 4.76% (3) | 1.61% (1) |
| COH | 96.97% (32) | 95.24% (60) | 98.39% (61) |
| HFD | 0 (0) | 0 (0) | 0 (0) |
| KFD | 0 (0) | 0 (0) | 0 (0) |
Comparison of training and testing classification accuracy among different classifiers (in %).
| Classifier (Number of Features) | Training Set | Test Set | ||
|---|---|---|---|---|
| 5-Fold CV Accuracy | (Accuracy | Sensitivity | Specificity) | |
| K-NN ( | 66.43 ± 7.79 | 48.33 | 48.33 | 48.33 |
| LDA ( | 88.21 ± 5.60 | 69.17 | 75.00 | 63.33 |
| SVM ( | 86.07 ± 4.71 | 80.83 | 86.67 | 75.00 |
| SVM ( | 87.50 ± 4.92 | 77.50 | 85.00 | 70.00 |
| CK-SVM ( | 91.07 ± 3.43 | 84.16 | 88.33 | 80.00 |
| CK-SVM ( | 89.28 ± 3.29 | 80.83 | 88.33 | 73.33 |
Figure 7Means and 95% error bars (standard error of the mean) for MDD and HC participants. Left: frontal alpha asymmetry (FAA) calculated by the ratio (F4 − F3)/(F4 + F3) of alpha power. Right: EEG complexity represented by the HFD with kmax = 50. There were no significant differences in FAA (p = 0.66) and HFD-based complexity (p = 0.07) between the MDD and HC groups.
Five-fold cross validation accuracies among different frequency bands and scalp regions using a normalized BP and a LDA classifier (in %). The numbers inside and outside the brackets denote the accuracies obtained from the features with and without the SBS-based feature selection.
| EEG Features | Frontal | Central | Temporal | Parietal | ALL | |
|---|---|---|---|---|---|---|
| Normalized BP |
| 52.14 (56.43) | 57.14 (58.57) | 53.35 (58.21) | 42.86 (51.43) | 50.00 (63.57) |
|
| 53.93 (58.21) | 52.86 (54.64) | 57.14 (58.21) | 43.93 (54.64) | 54.29 (60.36) | |
|
| 51.43 (58.57) | 51.43 (52.86) | 46.43 (53.93) | 46.43 (53.93) | 49.29 (57.50) | |
|
| 54.29 (56.07) | 56.43 (57.86) | 56.07 (57.86) | 51.43 (55.00) | 52.50 (62.86) | |
|
| 54.29 (58.21) | 59.29 (59.29) | 50.00 (56.43) | 46.79 (55.00) | 48.21 (63.21) | |
Figure 8Characteristics of the 61 optimal coherence features selected by the SBS and SVM classifier. Upper-left: The occurring times of each electrode in the 61 features. Bottom-left: Proportion of each band in the 61 features. Remaining: Proportions of the intra-hemispheric (Intra) and inter-hemispheric (Inter) coherence features in each band. The gray part represents the regional connectivity of the electrode pair of FCz and Pz.