| Literature DB >> 35455821 |
Emrah Aydemir1, Sengul Dogan2, Mehmet Baygin3, Chui Ping Ooi4, Prabal Datta Barua5,6, Turker Tuncer2, U Rajendra Acharya7,8,9.
Abstract
BACKGROUND ANDEntities:
Keywords: EEG classification; NCA; cyclic group of prime order pattern; kNN; machine learning; schizophrenia detection
Year: 2022 PMID: 35455821 PMCID: PMC9027158 DOI: 10.3390/healthcare10040643
Source DB: PubMed Journal: Healthcare (Basel) ISSN: 2227-9032
Automatic SZ classification using ML techniques.
| Author(s) | Year | Data Set Feature | Method |
|---|---|---|---|
| R. Buettner et al. [ | 2020 | EEG [ | Spectral analysis-based feature extraction and classification using random forest. |
| 14 schizophrenias, 14 HCs; | |||
| 1 min segmentation. | |||
| V. Jahmunah et al. [ | 2019 | EEG [ | Nonlinear statistical moment-based feature extraction, feature selection with |
| 14 schizophrenias, 14 HCs; | |||
| 25 s segmentation. | |||
| S.L. Oh et al. [ | 2019 | EEG [ | Convolutional neural network |
| 14 schizophrenias, 14 HCs; | |||
| 25 s segmentation. | |||
| L.S. Mayo et al. [ | 2017 | EEG; | Feature extraction at time and frequency domains, J5 feature selection, and classification with multilayer perceptron. |
| 16 schizophrenias, 31 HCs; | |||
| 0.8 s segmentation. | |||
| L. Zhang [ | 2019 | EEG [ | Event-related potential feature extraction and classification with random forest. |
| 49 schizophrenias, 32 HCs; | |||
| 3 s segmentation. | |||
| Z. Chen et al. [ | 2020 | Magnetic resonance images (MRIs) [ | Image segmentation for detecting gray matter, white matter, and cerebrospinal fluid; two-sample |
| 34 schizophrenias, 34 HCs. | |||
| C.W. Espinola et al. [ | 2021 | Voice; | Acoustic feature extraction, particle swarm optimization (PSO)-based feature selection, and classification using SVM. |
| 20 schizophrenias, 11 HCs; | |||
| 96.9 min HC, 125.7 min schizophrenia | |||
| 10 s segmentation. | |||
| A.N. Chandran et al. [ | 2021 | EEG [ | Time-domain-based feature extraction and classification deploying long short-term memory (LSTM). |
| 14 schizophrenias, 14 HCs; | |||
| 4 s segmentation | |||
| D. Lei et al. [ | 2019 | MRIs | Gray matter, white matter, low-frequency fluctuation, regional homogeneity, structural covariance matrices, and functional connectivity matrices and SVM classifier. |
| Combination of five data sets; | |||
| 295 schizophrenias, 452 HCs; | |||
| H. Akbari et al. [ | 2021 | EEG [ | Graphical feature extraction, forward feature selection algorithm, and classification with kNN. |
| 14 schizophrenias, 14 HCs; | |||
| Z. Aslan and M. Akin [ | 2020 | Two EEG data sets: | Spectrogram images from EEG signals and classification using VGG16 deep network. |
| Data set 1: 45 schizophrenias, 39 HCs [ | |||
| Data set 2: 14 schizophrenias, 14 HCs [ | |||
| 5 s segmentation. |
Attributes of the data set used in this work.
| Feature | Value |
|---|---|
| Groups | 14 Schizophrenic groups, 14 control groups |
| Gender | 28 Patients (14 males, 14 females) |
| Average Age | 27.9 ± 3.3 (7 schizophrenic males) |
| 28.3 ± 4.1 (7 schizophrenic females) | |
| 26.8 ± 2.9 (7 healthy males) | |
| 28.7 ± 3.4 (7 healthy females) | |
| Length of Each EEG Segment | 25 s (250 × 25 = 6250) |
Figure 1Schematic diagram of the proposed CGP17Pat-based schizophrenia detection model.
Generated cyclic group of 17 orders.
|
| 3 | 9 | 10 | 13 | 5 | 15 | 11 | 16 | 14 | 8 | 7 | 4 | 12 | 2 | 6 | 1 |
|
| 5 | 8 | 6 | 13 | 14 | 2 | 10 | 16 | 12 | 9 | 11 | 4 | 3 | 15 | 7 | 1 |
|
| 6 | 2 | 12 | 4 | 7 | 8 | 14 | 16 | 11 | 15 | 5 | 13 | 10 | 9 | 3 | 1 |
|
| 7 | 15 | 3 | 4 | 11 | 9 | 12 | 16 | 10 | 2 | 14 | 13 | 6 | 8 | 5 | 1 |
|
| 10 | 15 | 14 | 4 | 6 | 9 | 5 | 16 | 7 | 2 | 3 | 13 | 11 | 8 | 12 | 1 |
|
| 11 | 2 | 5 | 4 | 10 | 8 | 3 | 16 | 6 | 15 | 12 | 13 | 7 | 9 | 14 | 1 |
|
| 12 | 8 | 11 | 13 | 3 | 2 | 7 | 16 | 5 | 9 | 6 | 4 | 14 | 15 | 10 | 1 |
|
| 14 | 9 | 7 | 13 | 12 | 15 | 6 | 16 | 3 | 8 | 10 | 4 | 5 | 2 | 11 | 1 |
Figure 2The created eight patterns using CGP17, where each pattern is named as P; by using each pattern, 256 features are extracted and our presented CGP17Pat uses these eight patterns together: (a) patterns 1–4; (b) patterns 5–8.
Figure 3The graphical summarization of the presented CGP17Pat. Here, P denotes patterns (see Table 2), and each pattern extracts 256 features. Then, these feature vectors are merged, and 2048 (=256 × 8) features are created.
Fine-tuned hyperparameters of the kNN classifier.
| Hyperparameter | Value |
|---|---|
|
| 1 |
| Distance | Euclidean |
| Weight | None |
| Standardize Data | True |
Parameters of our CGP17Pat-based EEG classification model.
| Phase | Parameters |
|---|---|
| Feature Extraction | MAP: 2-, 4-, and 8-sized overlapping blocks were used |
| CGP17Pat: 16-sized overlapping blocks were used, and eight patterns were deployed | |
| INCA | Range: [100–1000] |
| Classification | Fine kNN with 10-fold cross-validation and LOSO |
| Iterative Hard Majority Voting | The iteration range selected was [ |
The 10-fold cross-validation and LOSO cross-validation results (%) of the CGP17Pat-based model.
| Channel | 10-Fold Cross-Validation | LOSO Cross-Validation | ||||||
|---|---|---|---|---|---|---|---|---|
| Accuracy | Sensitivity | Specificity | Geometric Mean | Accuracy | Sensitivity | Specificity | Geometric Mean | |
| Fp1 | 99.47 | 99.36 | 99.61 | 99.49 | 75.48 | 69.65 | 82.56 | 75.83 |
| Fp2 | 98.95 | 99.04 | 98.84 | 98.94 | 82.22 | 80.03 |
| 82.42 |
| F7 | 99.47 | 99.20 | 99.81 | 99.50 |
|
| 84.30 |
|
| F3 | 99.30 | 99.20 | 99.42 | 99.31 | 73.99 | 69.49 | 79.46 | 74.31 |
| Fz | 98.77 | 98.56 | 99.03 | 98.80 | 71.72 | 70.13 | 73.64 | 71.86 |
| F4 | 99.30 | 99.52 | 99.03 | 99.28 | 71.80 | 66.77 | 77.91 | 72.13 |
| F8 | 99.12 | 99.36 | 98.84 | 99.10 | 77.58 | 77.96 | 77.13 | 77.54 |
| T3 | 99.39 | 99.20 | 99.61 | 99.41 | 79.68 | 80.51 | 78.68 | 79.59 |
| C3 | 99.30 | 99.04 | 99.61 | 99.33 | 71.28 | 77.48 | 63.76 | 70.28 |
| Cz | 98.77 | 98.72 | 98.84 | 98.78 | 71.10 | 68.05 | 74.81 | 71.35 |
| C4 | 99.65 | 99.52 | 99.81 | 99.66 | 74.96 | 69.17 | 81.98 | 75.30 |
| T4 | 99.65 | 99.52 | 99.81 | 99.66 | 81.61 | 82.27 | 80.81 | 81.54 |
| T5 | 99.56 | 99.36 | 99.81 | 99.58 | 81.61 | 80.67 | 82.75 | 81.70 |
| P3 | 99.65 | 99.52 | 99.81 | 99.66 | 72.50 | 73.80 | 70.93 | 72.35 |
| Pz |
|
| 99.81 |
| 76.09 | 80.35 | 70.93 | 75.49 |
| P6 | 99.47 | 99.52 | 99.42 | 99.47 | 79.60 | 77.80 | 81.78 | 79.76 |
| T6 | 99.56 | 99.68 | 99.42 | 99.55 | 74.69 | 71.73 | 78.29 | 74.94 |
| O1 | 99.65 | 99.36 |
| 99.68 | 79.95 | 77.48 | 82.95 | 80.16 |
| O2 | 99.65 | 99.52 | 99.81 | 99.66 | 79.60 | 78.12 | 81.40 | 79.74 |
The calculated voted results (%) according to the 10-fold cross-validation and LOSO cross-validation.
| Validation | Number of Channels | Accuracy | Sensitivity | Specificity | Geometric Mean |
|---|---|---|---|---|---|
| 10-fold | 3 | 99.91 | 99.84 | 100 | 99.92 |
| LOSO | 17 | 84.33 | 77 | 93.22 | 84.72 |
Figure 4The lengths of the optimal feature vectors chosen by INCA.
Figure 5Classification accuracies of the decision tree (DT), quadratic discriminant (QD), logistic regression (LR), naive Bayes (NB), support vector machine (SVM), Fine kNN (kNN), bagged tree (BT), ensemble subspace kNN (ESkNN), and artificial neural network (ANN) for the Fp2 channel with 10-fold cross-validation.
Figure 6The obtained comparative results according to the validation technique.
Figure 7Voted results: confusion matrices of the presented CGP17Pat-based EEG classification model using (a) 10-fold cross-validation and (b) LOSO cross-validation.
Automatic schizophrenia classification based on EEG signals (2019–2021).
| Author(s) | Year | Method | Segmentation | Validation | Result(s) |
|---|---|---|---|---|---|
| R. Buettner et al. [ | 2020 | Spectral analysis, random forest | 1 min | 10-Fold cross-validation | Acc. = 96.77 |
| Bac. = 96.77 | |||||
| Kap. = 93.55 | |||||
| R. Buettner et al. [ | 2019 | Independent component analysis, random forest | - | 10-Fold cross-validation | Acc. = 71.43 |
| Bac. = 80.0 | |||||
| P.T. Krishnan et al. [ | 2020 | Multivariate empirical model decomposition, entropy computation, recursive feature elimination, and SVM | 2 s | 10-Fold cross-validation | Acc. = 93.0 |
| Auc. = 98.31 | |||||
| Sen. = 94.0 | |||||
| Spe. = 92.0 | |||||
| Pre. = 92.71 | |||||
| FScr. = 93.04 | |||||
| A.N. Chandran et al. [ | 2021 | Time-domain feature extraction, long short-term memory (LSTM) | 4 s | Holdout | Acc. = 99.0 |
| Pre. = 99.2 | |||||
| 88:12 | Rec. = 98.9 | ||||
| FScr. = 99.0 | |||||
| K. Singh et al. [ | 2021 | Fast Fourier transform, spectral feature extraction, CNN, and LSTM | 5 s | Holdout | Acc. = 98.96 |
| Sen. = 99.05 | |||||
| 90:10 | Spe. = 98.88 | ||||
| FScr. = 98.87 | |||||
| S.L. Oh et al. [ | 2019 | Custom convolutional neural network (CNN) design, subject and non-subject based testing | 25 s | 10-Fold cross-validation | Non-Sub. |
| Acc. = 98.07 | |||||
| Sen. = 97.32 | |||||
| Spe. = 98.17 | |||||
| Subject | |||||
| Acc. = 81.26 | |||||
| Sen. = 75.42 | |||||
| Spe. = 87.59 | |||||
| M. Baygin [ | 2021 | Tunable Q-factor wavelet transform (TQWT), statistical moment, ReliefF, and kNN | 25 s | 10-Fold cross-validation | Acc. = 99.12 |
| Pre. = 99.04 | |||||
| Rec. = 99.36 | |||||
| Geo. = 99.10 | |||||
| FScr. = 99.20 | |||||
| K. Kim et al. [ | 2021 | Microstate features; statistical, frequency, and time domain features; | 5 s | 10-Fold cross-validation | Acc. = 75.64 |
| Auc. = 80.19 | |||||
| Sen. = 71.93 | |||||
| Spe. = 75.50 | |||||
| M. Krishnaveni et al. [ | 2019 | Non-local mean algorithm, empirical mode decomposition, discrete Fourier transform, mel-warp triangular filter, and optimized backpropagation neural network | - | 10-Fold cross-validation | Acc. = 90.26 |
| Sen. = 88.64 | |||||
| Spe. = 89.17 | |||||
| V. Jahmunah et al. [ | 2019 | Nonlinear feature extraction, | 25 s | 10-Fold cross-validation | Acc. = 92.91 |
| Sen. = 93.45 | |||||
| Spe. = 92.24 | |||||
| H. Akbari et al. [ | 2021 | Graphical feature extraction, forward selection algorithm, and kNN | - | 10-Fold cross-validation | Acc. = 94.80 |
| Sen. = 94.30 | |||||
| Spe. = 95.20 | |||||
| Z. Aslan and M. Akin [ | 2020 | Spectrogram images from EEG signals, VGG16-based CNN | 5 s | - | Acc. = 97.0 |
| Rec. = 97.0 | |||||
| FScr. = 97.0 | |||||
| A. Shoeibi et al. [ | 2021 | CNN and LSTM | 25 s | 5-Fold cross-validation | Acc. = 99.25 |
| Pre. = 98.33 | |||||
| Rec. = 98.86 | |||||
| Auc. = 99.73 | |||||
| M. Sharma and U.R. Acharya [ | 2021 | L1 Norm, ES-KNN | 25 s | 10-Fold cross-validation and LOSO | 10-fold CV |
| Acc. = 99.21 | |||||
| LOSO CV | |||||
| Acc. = 97.2 | |||||
| Our Method | CGP17Pat, MAP, INCA, kNN, and iterative hard majority voting | 25 s | 10-Fold cross-validation and LOSO | 10-fold CV | |
| Acc. = 99.91 | |||||
| LOSO CV | |||||
| Acc. = 84.33 | |||||
Acc. = accuracy; Sen. = sensitivity; Spe. = specificity; Bac. = balanced accuracy; Kap. = kappa; FScr. = F-score; Pre. = precision; Rec. = recall; Geo. = geometric mean; SVM = support vector machine; kNN = k-nearest neighbor; CV = cross-validation.