| Literature DB >> 33364482 |
Sunil Kumar Prabhakar1, Harikumar Rajaguru2.
Abstract
The basic function of the brain is severely affected by alcoholism. For the easy depiction and assessment of the mental condition of a human brain, Electroencephalography (EEG) signals are highly useful as it can record and measure the electrical activities of the brain much to the satisfaction of doctors and researchers. Utilizing the standard conventional techniques is quite hectic to derive the useful information as these signals are highly non-linear and non-stationary in nature. While recording the EEG signals, the activities of the neurons are recorded from various scalp regions which has varied characteristics and has a very low magnitude. Therefore, human interpretation of such signals is very difficult and consumes a lot of time. Hence, with the advent of Computer Aided Diagnosis (CAD) Techniques, identifying the normal versus alcoholic EEG signals has been of great utility in the medical field. In this work, we perform the initial clustering of the alcoholic EEG signals by means of using Correlation Dimension (CD) for easy feature extraction and then the suitable features are selected in it by means of employing various distance metrics like correlation distance, city block distance, cosine distance and chebyshev distance. Proceeding in such a methodology aids and assures that a good discrimination could be achieved between normal and alcoholic EEG signals using non-linear features. Finally, classification is then carried out with the suitable classifiers chosen such as Adaboost.RT classifier, the proposed Modified Adaboost.RT classifier by means of introducing Ridge and Lasso based soft thresholding technique, Random Forest with bootstrap resampling technique, Artificial Neural Networks (ANN) such as Radial Basis Functions (RBF) and Multi-Layer Perceptron (MLP), Support Vector Machine (SVM) with Linear, Polynomial and RBF Kernel, Naïve Bayesian Classifier (NBC), K-means classifier, and K Nearest Neighbor (KNN) Classifier and the results are analyzed. Results report a comparatively high classification accuracy of about 98.99% when correlation distance metrics are utilized with CD and the proposed Modified Adaboost.RT classifier using Ridge based soft thresholding technique.Entities:
Keywords: Alcoholism; Classification; Computer science; Correlation distance; Distance metrics; EEG
Year: 2020 PMID: 33364482 PMCID: PMC7750377 DOI: 10.1016/j.heliyon.2020.e05689
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Figure 1A simplified block diagram of the work for easy understanding.
Figure 2Histogram plot (a) CD for alcoholic EEG Signals for a patient.(b) Chi-Square PDF for C D with ten degree of Freedom for. alcoholic EEG Signal of a patient.
Average value of Statistical Parameters of CD Features for Alcoholic EEG Signal of a Subject.
| Sl. No | Statistical Parameter | Numerical Value |
|---|---|---|
| 1 | Mean | 4.091731 |
| 2 | Variance | 10.73556 |
| 3 | Skewness | 0.849836 |
| 4 | Kurtosis | 0.477126 |
Figure 3Histogram of Hilbert transform of normalized CD values for alcoholic EEG signal.
Figure 4Cumulative Distribution Function (CDF) Plot for (a) CD Mean values b) CD Skewness values.
Figure 5(a) Histogram of Chi Square PDF with ten degree of freedom for statistical parameters of CD (b) Histogram of CD with Distance features.
Figure 6Scatter Plot (a) between Correlation and City block Distance Features of CD Values (b) between Cosine and Chebyshev Distance Features of CD Values.
MSE analysis for RBF architecture.
| Sl.No | RBF Architecture | Training MSE | Testing MSE |
|---|---|---|---|
| 1 | 32-1-1 | 0.00019321 | 0.002506079 |
| 2 | 32-2-1 | 0.000545623 | 7.84E-06 |
| 3 | 32-4-1 | 0.000001 | 8.56625E-06 |
| 4 | 32-8-1 | 9.39067E-05 | 7.05094E-06 |
| 5 | 32-16-1 | 1.156E-05 | 2.84625E-06 |
| 6 | 32-32-1 | 1.96E-06 | 4.84E-06 |
| 32-64-1 | 1.44E-06 | 4.69844E-07 | |
| 8 | 32-128-1 | 1.76241E-05 | 1.96E-06 |
| 9 | 32-148-1 | 2.90631E-05 | 4.41E-06 |
MSE analysis for MLP architecture.
| Sl.No | MLP Architecture | Training MSE | Testing MSE |
|---|---|---|---|
| 1 | 32-1-1 | 0.000163929 | 0.000001 |
| 2 | 32-2-1 | 8.836E-05 | 7.84E-06 |
| 3 | 32-4-1 | 1.369E-05 | 4E-06 |
| 4 | 32-8-1 | 0.000036 | 6.4E-06 |
| 5 | 32-16-1 | 4.80644E-05 | 5.24E-06 |
| 6 | 32-32-1 | 1.31567E-05 | 1.156E-05 |
| 32-64-1 | 8.1E-07 | 1E-08 | |
| 8 | 32-128-1 | 4.84E-06 | 4.84E-06 |
| 9 | 32-148-1 | 1E-06 | 0 |
Consolidated Result Analysis of Correlation Distance metric with classifiers.
| Classifiers | PI(%) | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| Adaboost.RT | 89.98 | 90.63 | 100 | 95.315 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 98.93 | 97.98 | 100 | 98.99 |
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 96.31292 | 96.4541 | 100 | 98.22705 |
| Random Forest | 95.65 | 95.83 | 100 | 97.915 |
| RBF | 97.87 | 97.92 | 100 | 98.96 |
| MLP | 81.47 | 84.38 | 100 | 92.19 |
| Linear SVM | 84.315 | 86.46 | 100 | 93.23 |
| Polynomial SVM | 82.93 | 85.42 | 100 | 92.71 |
| RBF SVM | 86.37 | 88.0225 | 100 | 94.01125 |
| NBC | 85.7 | 87.5 | 100 | 93.75 |
| K-means | 51.163 | 67.188 | 100 | 83.594 |
| KNN | 84.315 | 86.46 | 100 | 93.23 |
Consolidated Result Analysis of City Block Distance metric with classifiers.
| Classifiers | PI(%) | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| Adaboost.RT | 83.31458 | 86.77778 | 100 | 93.38889 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 97.87 | 97.92 | 100 | 98.96 |
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 97.41 | 97.48 | 100 | 98.74 |
| Random Forest | 85.7 | 87.5 | 100 | 93.75 |
| RBF | 90.78 | 91.15 | 100 | 95.575 |
| MLP | 94.49 | 94.79 | 100 | 97.395 |
| Linear SVM | 77.925 | 100 | 80.225 | 90.1125 |
| Polynomial SVM | 84.315 | 100 | 86.46 | 93.23 |
| RBF SVM | 89.98 | 90.63 | 100 | 95.315 |
| NBC | 78.465 | 100 | 82.295 | 91.1475 |
| K-means | 66.66 | 75 | 100 | 87.5 |
| KNN | 88.38 | 100 | 89.59 | 94.795 |
Consolidated Result Analysis of Cosine Distance metric with classifiers.
| Classifiers | PI(%) | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| Adaboost.RT | 76.92 | 81.25 | 100 | 90.625 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 94.49 | 94.79 | 100 | 97.395 |
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 97.83 | 100 | 97.88 | 98.94 |
| Random Forest | 80.01 | 83.34 | 100 | 91.67 |
| RBF | 93.615 | 93.75 | 100 | 96.875 |
| MLP | 82.565 | 85.16 | 100 | 92.58 |
| Linear SVM | 88.38 | 89.59 | 100 | 94.795 |
| Polynomial SVM | 93.33 | 93.75 | 100 | 96.875 |
| RBF SVM | 96.76 | 96.875 | 100 | 98.4375 |
| NBC | 63.76 | 73.43 | 100 | 86.71 |
| K-means | 40 | 62.5 | 100 | 81.25 |
| KNN | 77.925 | 80.225 | 100 | 90.1125 |
Consolidated Result Analysis of Chebyshev Distance metric with classifiers.
| Classifiers | PI(%) | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|
| Adaboost.RT | 93.33 | 100 | 93.75 | 96.875 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 92.455 | 92.71 | 100 | 96.355 |
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 97.57681 | 98.40806 | 99.27778 | 98.84292 |
| Random Forest | 89.98 | 90.63 | 100 | 95.315 |
| RBF | 90.78 | 91.15 | 100 | 95.575 |
| MLP | 91.79 | 91.93 | 100 | 95.965 |
| Linear SVM | 85.7 | 100 | 87.5 | 93.75 |
| Polynomial SVM | 81.47 | 100 | 84.38 | 92.19 |
| RBF SVM | 92.017 | 92.19 | 100 | 96.095 |
| NBC | 63.76 | 100 | 73.43 | 86.71 |
| K-means | 51.163 | 67.188 | 100 | 83.594 |
| KNN | 75.86 | 78.15 | 100 | 89.075 |
Consolidated MSE and GDR Result Analysis with Distance metric and Classifiers.
| Consolidated Analysis | Correlation Distance Metric | City Block Distance Metric | Chebyshev Distance Metric | Cosine Distance Metric | ||||
|---|---|---|---|---|---|---|---|---|
| Classifiers | MSE | GDR | MSE | GDR | MSE | GDR | MSE | GDR |
| Adaboost.RT | 1.44E-06 | 90.63 | 8.267E-06 | 86.77778 | 4.9E-07 | 93.33333 | 1.296E-05 | 81.25 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 1E-08 | 98.02 | 4E-08 | 97.92 | 8.1E-07 | 92.71 | 3.6E-07 | 94.79474 |
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 1.873E-07 | 96.46086 | 2.5E-09 | 97.48 | 1.239E-07 | 97.6705 | 4E-08 | 97.83482 |
| Random Forest | 2.5E-07 | 95.83958 | 4E-06 | 87.5 | 1.44E-06 | 90.63 | 9E-06 | 83.34 |
| RBF | 4E-08 | 97.92 | 1.21E-06 | 91.15 | 1.21E-06 | 91.15 | 6.4E-07 | 93.75 |
| MLP | 7.84E-06 | 84.38 | 3.6E-07 | 94.79 | 8.1E-07 | 91.93 | 5.76E-06 | 85.16 |
| Linear SVM | 4.84E-06 | 86.46 | 1.44E-05 | 75.32727 | 4E-06 | 85.71429 | 2.25E-06 | 89.59 |
| Polynomial SVM | 6.76E-06 | 85.42 | 4.84E-06 | 84.33958 | 7.29E-06 | 81.4885 | 6.4E-07 | 93.75 |
| RBF SVM | 2.89E-06 | 88.0225 | 1.44E-06 | 90.63 | 6.4E-07 | 92.19 | 9E-08 | 96.875 |
| NBC | 4E-06 | 87.5 | 1.09E-05 | 78.48593 | 2.3E-05 | 63.82979 | 2.21E-05 | 73.45 |
| K-means | 4.1E-05 | 67.18263 | 2.03E-05 | 75 | 4.1E-05 | 67.188 | 8.65E-05 | 62.5 |
| KNN | 5.76E-06 | 85.94 | 2.25E-06 | 88.3804 | 1.68E-05 | 78.12 | 1.37E-05 | 80.21 |
Comparison of our works with previous works.
| Authors | Features Obtained | Classification technique | Classification Accuracy (%) |
|---|---|---|---|
| Patidar et al [ | Correlation Entropy | LS -SVM | 97.02 |
| Faust et al [ | Higher Order Spectra (HOS) Features | Fuzzy Sugeno Classifier (FSC) | 92.40 |
| Acharya et al [ | Entropy, HOS and LLE | SVM with different kernels | 91.7 |
| Kannathal et al [ | CD and entropy | Distinct ranges | 90 |
| Proposed Work | CD with the Correlation Distance Metrics | Adaboost.RT | 95.315 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 98.99 | ||
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 98.227 | ||
| Random Forest | 97.915 | ||
| RBF | 98.96 | ||
| MLP | 92.19 | ||
| Linear SVM | 93.23 | ||
| Polynomial SVM | 92.71 | ||
| RBF SVM | 94.011 | ||
| NBC | 93.75 | ||
| K-means | 83.594 | ||
| KNN | 93.23 | ||
| Proposed Work | CD with the City Block Distance Metrics | Adaboost.RT | 93.388 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 98.96 | ||
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 98.74 | ||
| Random Forest | 93.75 | ||
| RBF | 95.575 | ||
| MLP | 97.395 | ||
| Linear SVM | 90.112 | ||
| Polynomial SVM | 93.23 | ||
| RBF SVM | 95.315 | ||
| NBC | 91.147 | ||
| K-means | 87.5 | ||
| KNN | 94.795 | ||
| Proposed Work | CD with the Cosine Distance Metrics | Adaboost.RT | 90.625 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 97.395 | ||
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 98.94 | ||
| Random Forest | 91.67 | ||
| RBF | 96.875 | ||
| MLP | 92.58 | ||
| Linear SVM | 94.795 | ||
| Polynomial SVM | 96.875 | ||
| RBF SVM | 98.437 | ||
| NBC | 86.71 | ||
| K-means | 81.25 | ||
| KNN | 90.1125 | ||
| Proposed Work | CD with the Chebyshev Distance Metrics | Adaboost.RT | 96.875 |
| Ridge Based Modified Adaboost.RT dependent on Soft Thresholding | 96.355 | ||
| Lasso Based Modified Adaboost.RT dependent on Soft Thresholding | 98.842 | ||
| Random Forest | 95.315 | ||
| RBF | 95.575 | ||
| MLP | 95.965 | ||
| Linear SVM | 93.75 | ||
| Polynomial SVM | 92.19 | ||
| RBF SVM | 96.095 | ||
| NBC | 86.71 | ||
| K-means | 83.594 | ||
| KNN | 89.075 |