| Literature DB >> 35933348 |
Yujie Li1, Yingshan Shen1, Xiaomao Fan2, Xingxian Huang3, Haibo Yu3, Gansen Zhao1, Wenjun Ma1.
Abstract
BACKGROUND: Major depressive disorder (MDD) is a common mental illness, characterized by persistent depression, sadness, despair, etc., troubling people's daily life and work seriously.Entities:
Keywords: Depression detection; EEG; Two-stage feature selection
Mesh:
Year: 2022 PMID: 35933348 PMCID: PMC9357341 DOI: 10.1186/s12911-022-01956-w
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 3.298
Fig. 1Proposed framework for MDD detection and severity prediction: Firstly, in the input module of the framework, the raw EEG signals are derived by Nerron-Spectrum-5 to obtain the EEG rhythm features, and the subjects are diagnosed and scored by a physician to get the HAMD-17 score. Then, in the data processing module, the features are extracted from the EEG rhythm features and are Z-score standardized together with the rhythm features to obtain standardized features. Subjects with HAMD-17 scores greater than 17 are labeled as MDD, and those with HAMD-17 scores less than or equal to 17 are labeled as non-MDD. Moreover, the HAMD-17 score directly served as an indicator of MDD severity assessment. Then, in the feature selection module, PCC carries out the first stage feature selection on standardized features, and RFE carries out the second stage feature selection on reserved features. Finally, LR and SVM are used as classification models to classify subjects into MDD and non-MDD. LNR is used as the regression model to assess the severity of MDD, and the HAMD-17 score predicted by LNR is used as the severity indicator of MDD
MDD severity assessment performance
| Feature selection | Model | |||
|---|---|---|---|---|
| PCC | LNR | 3.6917 | 21.6649 | 0.0644 |
| LNR-DF | 3.5198 | 20.9566 | 0.0927 | |
| RFE | LNR | 1.5306 | 3.4942 | 0.8474 |
| LNR-DF | 1.2450 | 2.3995 | 0.8962 | |
| PCC and RFE | LNR | 1.2799 | 2.7388 | 0.8812 |
| LNR-DF | 0.9123 | 1.2012 | 0.9479 |
MDD detection performance of the proposed framework
| Feature selection | Model | |||||
|---|---|---|---|---|---|---|
| – | LR | 0.4476 | 0.3776 | 0.6167 | 0.5557 | 0.4005 |
| SVM | 0.4524 | 0.4809 | 0.7061 | 0.6139 | 0.4510 | |
| LR-DF | 0.5667 | 0.4728 | 0.6182 | 0.5984 | 0.5050 | |
| SVM-DF | 0.5333 | 0.5000 | 0.6894 | 0.6318 | 0.5144 | |
| PCC | LR | 0.5619 | 0.5061 | 0.6530 | 0.6207 | 0.5235 |
| SVM | 0.5381 | 0.5444 | 0.7591 | 0.6782 | 0.5306 | |
| LR-DF | 0.6238 | 0.6083 | 0.7394 | 0.6961 | 0.5988 | |
| SVM-DF | 0.5619 | 0.6442 | 0.7727 | 0.6969 | 0.5766 | |
| RFE | LR | 0.7381 | 0.8571 | 0.9106 | 0.8466 | 0.7803 |
| SVM | 0.9143 | 0.9667 | 0.9833 | 0.9579 | 0.9385 | |
| LR-DF | 0.8524 | 0.8298 | 0.8955 | 0.8800 | 0.8386 | |
| SVM-DF | 0.9429 | 1.0000 | 1.0000 | 0.9789 | 0.9667 | |
| PCC and RFE | LR | 0.7381 | 0.8571 | 0.9106 | 0.8466 | 0.7803 |
| SVM | 0.9714 | 0.9464 | 0.9652 | 0.9678 | 0.9581 | |
| LR-DF | 0.8238 | 0.9417 | 0.9636 | 0.9127 | 0.8713 | |
| SVM-DF | 0.9714 | 1.0000 | 1.0000 | 0.9895 | 0.9846 |
Performance comparison with cutting-edge MDD detection methods
| Method | ||||
|---|---|---|---|---|
| [ | – | – | 0.9000 | – |
| [ | 0.9490 | 0.8090 | 0.8790 | – |
| [ | 0.9444 | – | 0.8912 | – |
| [ | – | – | 0.8500 | – |
| [ | 0.9666 | 1.0000 | 0.9840 | – |
| [ | 0.9990 | 0.9500 | 0.9800 | 0.9700 |
| [ | – | – | 0.7927 | – |
| [ | – | – | 0.8833 | – |
| SVM-DF | 0.9714 | 1.0000 | 0.9895 | 0.9846 |
Fig. 2MDD severity assessment performance on random selected 10 records
Fig. 3Scatter chart of LNR and LNR-DF with two-stage feature selection
Fig. 4The process of determining and according to LR, SVM, and LNR respectively in feature selection: a LR, b SVM, c LNR
Hyper-parameter of feature selection
| Feature selection | Method | ||||
|---|---|---|---|---|---|
| – | SVM | – | 154 | 0.4510 | – |
| SVM-DF | – | 192 | 0.5144 | – | |
| LNR | – | – | – | – | |
| LNR-DF | – | – | – | – | |
| PCC | SVM | 0.055 | – | 0.5306 | – |
| SVM-DF | 0.055 | – | 0.5766 | – | |
| LNR | 0.210 | – | – | 0.0644 | |
| LNR-DF | 0.185 | – | – | 0.0927 | |
| RFE | SVM | – | 43 | 0.9385 | – |
| SVM-DF | – | 53 | 0.9667 | – | |
| LNR | – | 53 | – | 0.8474 | |
| LNR-DF | – | 66 | – | 0.8962 | |
| PCC and RFE | SVM | 0.040 | 38 | 0.9581 | – |
| SVM-DF | 0.015 | 36 | 0.9846 | – | |
| LNR | 0.010 | 59 | – | 0.8812 | |
| LNR-DF | 0.010 | 63 | – | 0.9479 |
Statistical power analysis between SVM-DF with two-stage feature selection and other MDD detection methods in the framework
| Feature selection | Model | |||||
|---|---|---|---|---|---|---|
| – | LR | 1.47e−05*** | 9.73e−06*** | 7.91e−06*** | 1.58e−06*** | 9.40e−06*** |
| SVM | 1.52e−05*** | 7.80e−06*** | 1.67e−04*** | 6.50e−06*** | 2.80e−06*** | |
| LR-DF | 8.90e−07*** | 6.39e−06*** | 2.43e−05*** | 3.95e−06*** | 4.27e−06*** | |
| SVM-DF | 2.72e−04*** | 4.82e−05*** | 3.28e−05*** | 1.70e−06*** | 3.67e−05*** | |
| PCC | LR | 4.81e−05*** | 7.46e−06*** | 2.64e−05*** | 1.53e−06*** | 3.66e−06*** |
| SVM | 6.10e−06*** | 2.76e−06*** | 3.39e−04*** | 1.11e−05*** | 7.80e−07*** | |
| LR-DF | 8.37e−06*** | 1.44e−05*** | 1.90e−04*** | 5.78e−06*** | 1.30e−07*** | |
| SVM-DF | 4.61e−04*** | 2.99e−06*** | 2.30e−04*** | 2.44e−06*** | 1.26e−05*** | |
| RFE | LR | 3.88e−04*** | 1.42e−03*** | 3.99e−04*** | 2.88e−04*** | 4.41e−04*** |
| SVM | 8.64e−03*** | 2.59e−03*** | 4.07e−03*** | 2.55e−03*** | 2.61e−03*** | |
| LR-DF | 1.97e−03*** | 1.88e−02** | 2.18e−02** | 7.23e−05*** | 1.28e−04*** | |
| SVM-DF | 1.61e−02** | 3.74e−01* | 3.85e−01* | 5.37e−02* | 2.86e−02** | |
| PCC and RFE | LR | 1.44e−04*** | 5.34e−04*** | 1.78e−04*** | 5.68e−05*** | 1.21e−04*** |
| SVM | 2.08e−01* | 1.70e−01* | 1.84e−01* | 9.50e−03*** | 3.60e−03*** | |
| LR-DF | 3.16e−03*** | 4.47e−03*** | 3.99e−03*** | 7.74e−04*** | 9.18e−04*** | |
*P ≥ 0.05; **0.05 > P ≥ 0.01; ***P < 0.01
Statistical power analysis between LNR-DF with two-stage feature selection and other MDD severity assessment methods in the framework
| Feature selection | Model | |||
|---|---|---|---|---|
| – | LNR | 7.43e−06*** | 4.61e−05*** | 2.43e−04*** |
| LNR-DF | 4.07e−06*** | 5.48e−05*** | 9.15e−05*** | |
| PCC | LNR | 4.00e−08*** | 6.00e−08*** | 4.00e−08*** |
| LNR-DF | 7.90e−07*** | 1.40e−07*** | 3.00e−08*** | |
| RFE | LNR | 1.60e−02** | 1.81e−02** | 1.63e−02** |
| LNR-DF | 3.79e−03*** | 7.15e−03*** | 2.78e−03*** | |
| PCC and RFE | LNR | 1.25e−02** | 1.44e−02** | 1.10e−02** |
*P ≥ 0.05; **0.05 > P ≥ 0.01; ***P < 0.01
Fig. 5Weight importance of selected features shared by SVM-DF and LNR-DF