| Literature DB >> 25710040 |
Paul Fergus1, David Hignett1, Abir Hussain1, Dhiya Al-Jumeily1, Khaled Abdel-Aziz2.
Abstract
The epilepsies are a heterogeneous group of neurological disorders and syndromes characterised by recurrent, involuntary, paroxysmal seizure activity, which is often associated with a clinicoelectrical correlate on the electroencephalogram. The diagnosis of epilepsy is usually made by a neurologist but can be difficult to be made in the early stages. Supporting paraclinical evidence obtained from magnetic resonance imaging and electroencephalography may enable clinicians to make a diagnosis of epilepsy and investigate treatment earlier. However, electroencephalogram capture and interpretation are time consuming and can be expensive due to the need for trained specialists to perform the interpretation. Automated detection of correlates of seizure activity may be a solution. In this paper, we present a supervised machine learning approach that classifies seizure and nonseizure records using an open dataset containing 342 records. Our results show an improvement on existing studies by as much as 10% in most cases with a sensitivity of 93%, specificity of 94%, and area under the curve of 98% with a 6% global error using a k-class nearest neighbour classifier. We propose that such an approach could have clinical applications in the investigation of patients with suspected seizure disorders.Entities:
Mesh:
Year: 2015 PMID: 25710040 PMCID: PMC4325968 DOI: 10.1155/2015/986736
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Seizure information for each case.
| Case | Number of seizures | Gender | Age |
|---|---|---|---|
| 1 | 7 | F | 11 |
| 2 | 3 | M | 11 |
| 3 | 7 | F | 14 |
| 4 | 4 | M | 22 |
| 5 | 5 | F | 7 |
| 6 | 10 | F | 1.5 |
| 7 | 3 | F | 14.5 |
| 8 | 5 | M | 3.5 |
| 9 | 4 | F | 10 |
| 10 | 7 | M | 3 |
| 11 | 3 | F | 12 |
| 12 | 27 | F | 2 |
| 13 | 10 | F | 3 |
| 14 | 8 | F | 9 |
| 15 | 20 | M | 16 |
| 16 | 8 | F | 7 |
| 17 | 3 | F | 12 |
| 18 | 6 | F | 18 |
| 19 | 3 | F | 19 |
| 20 | 8 | F | 6 |
| 21 | 4 | F | 13 |
| 22 | 3 | F | 9 |
| 23 | 7 | F | 6 |
| 24 | 16 | Unknown | Unknown |
Results for Feature Selection techniques.
| AUCs for Feature Selection techniques | ||||||||
|---|---|---|---|---|---|---|---|---|
| AUCknn | AUCknn | AUCsvn | AUCknn | AUCtreec | AUCknn | AUCloglc | AUCknn | AUCSVN |
| P | q | PC1 | PC2 | PC1 & 2 | LDAi | LDAf | LDAb | GS |
| 90 | 89 | 83 | 88 | 87 | 86 | 88 | 91 | 88 |
|
| ||||||||
| Sensitivities for Feature Selection techniques | ||||||||
| SENSknn | SENSknn | SENSsvn | SENSknn | SENStreec | SENSknn | SENSloglc | SENSknn | SENSloglc |
|
|
| PC1 | PC2 | PC1 & 2 | LDAi | LDAf | LDAb | GS |
|
| ||||||||
| 83 | 84 | 53 | 86 | 80 | 78 | 76 | 84 | 76 |
|
| ||||||||
| Specificities for Feature Selection techniques | ||||||||
| SPECknn | SPECknn | SPECsvn | SPECknn | SPECtreec | SPECknn | SPECloglc | SPECknn | SPECloglc |
|
|
| PC1 | PC2 | PC1 & 2 | LDAi | LDAf | LDAb | GS |
|
| ||||||||
| 83 | 82 | 90 | 81 | 79 | 80 | 85 | 85 | 86 |
List of channels for the five scalp regions.
| Region | Channels |
|---|---|
| 1 | FP1-F7, F7-T7, FP1-F3, F3-C3, T7-FT9 |
| 2 | FP2-F4, F4-C4, FP2-F8, F8-T8, T8-FT10 |
| 3 | T7-P7, P7-O7, C3-P3, P3-O1 |
| 4 | C4-P4, P4-O2, T8-P8, P8-O2 |
| 5 | FZ-CZ, CZ-PZ, FT9-FT10 |
Top five features for the five scalp regions.
| Feature set | Description | Features |
|---|---|---|
| 1 | Top 5 features from region 1 | RMS CH2 0.5–30 Hz |
|
| ||
| 2 | Top 5 features from region 2 | RMS CH16 0.5–30 Hz |
|
| ||
| 3 | Top 5 features from region 3 | RMS CH3 0.5–30 Hz |
|
| ||
| 4 | Top 5 features from region 4 | RMS CH18 4–8 Hz |
|
| ||
| 5 | Top 5 features from region 5 | RMS CH21 0.5–30 Hz |
Figure 1PCA for RMS feature discrimination.
Summary of classifiers considered in this study.
| Classifiers | Features | Validation | Sample sizes |
|---|---|---|---|
| Density-based | Variance | Holdout cross-validation | Original (171 seizures/171 nonseizures) |
| Linear discriminant classifier (LDC) | |||
| Quadratic discriminant classifier (QDC) | Root mean squares |
| |
| Uncorrelated normal density classifier (UDC) | |||
| Linear and polynomial-based | Skewness | Sensitivity/specificity | |
| Polynomial classifier (POLYC) | Kurtosis | ||
| Logistic classifier (LOGLC) | Peak frequency | SMOTE (342 seizures/342 nonseizures) | |
| Nonlinear-based | Median frequency | Receiver operator curve | |
|
| |||
| Decision tree classifier (TREEC) | |||
| Parzen classifier (PARZENC) | Sample entropy | Area under the curve | |
| Support vector classifier (SVC) |
Classifier performance results for top 20 uncorrelated features.
| Classifier | Sensitivity | Specificity | AUC |
|---|---|---|---|
| LDC | 70% | 83% | 54% |
| QDC | 65% | 92% | 62% |
| UDC | 39% | 95% | 65% |
| POLYC | 70% | 83% | 83% |
| LOGLC | 79% | 86% | 89% |
| KNNC |
|
|
|
| TREEC | 78% | 80% | 86% |
| PARZENC | 61% | 86% | 54% |
| SVC | 79% | 86% | 88% |
Cross-validation results for top 20 uncorrelated features.
| Classifiers | 80% holdout: 100 repetitions | Cross-validation, 5-fold, 1 repetition | Cross-validation, 5-fold, 100 repetitions | ||
|---|---|---|---|---|---|
| Mean error | SD | Mean error | Mean error | SD | |
| LDC | 0.2386 | 0.0506 | 0.2427 | 0.2398 | 0.0107 |
| QDC | 0.2179 | 0.0434 | 0.2164 | 0.2171 | 0.0064 |
| UDC | 0.3299 | 0.0431 | 0.3304 | 0.3310 | 0.0035 |
| POLYC | 0.2388 | 0.0507 | 0.2544 | 0.2385 | 0.0107 |
| LOGLC | 0.1771 | 0.0489 | 0.1813 | 0.1734 | 0.0085 |
| KNNC |
|
| 0.1696 | 0.1674 | 0.0148 |
| TREEC | 0.2071 | 0.0510 | 0.1959 | 0.2003 | 0.0157 |
| PARZENC | 0.2651 | 0.0493 | 0.2544 | 0.2640 | 0.0100 |
| SVC | 0.1752 | 0.0416 | 0.1608 | 0.1728 | 0.0072 |
Figure 2Received operator curve for top 20 uncorrelated features.
Classifier performance results from top five uncorrelated features from five head regions.
| Classifier | Sensitivity | Specificity | AUC |
|---|---|---|---|
| LDC | 78% | 88% | 55% |
| QDC | 84% | 86% | 60% |
| UDC | 51% | 91% | 70% |
| POLYC | 78% | 88% | 89% |
| LOGLC | 82% | 84% | 90% |
| KNNC |
|
|
|
| TREEC | 82% | 81% | 89% |
| PARZENC | 81% | 93% | 61% |
| SVC | 85% | 86% | 90% |
Cross-validation results from top five uncorrelated features from five regions.
| Classifiers | 80% holdout: 100 repetitions | Cross-validation, 5-fold, 1 repetition | Cross-validation, 5-fold, 100 repetitions | ||
|---|---|---|---|---|---|
| Mean error | SD | Mean error | Mean error | SD | |
| LDC | 0.1690 | 0.0419 | 0.1696 | 0.1675 | 0.0120 |
| QDC | 0.1493 | 0.0449 | 0.1462 | 0.1509 | 0.0088 |
| UDC | 0.2926 | 0.0440 | 0.2836 | 0.2940 | 0.0037 |
| POLYC | 0.1690 | 0.0419 | 0.1871 | 0.1709 | 0.0091 |
| LOGLC | 0.1734 | 0.0413 | 0.1696 | 0.1648 | 0.0120 |
| KNNC |
|
| 0.0936 | 0.1135 | 0.0101 |
| TREEC | 0.1835 | 0.0460 | 0.1988 | 0.1784 | 0.0202 |
| PARZENC | 0.1328 | 0.0433 | 0.1316 | 0.1325 | 0.0146 |
| SVC | 0.1460 | 0.0378 | 0.1316 | 0.1411 | 0.0101 |
Figure 3Received operator curve for top five uncorrelated features from five head regions.
Classifier performance results for top 20 uncorrelated features using SMOTE.
| Classifier | Sensitivity | Specificity | AUC |
|---|---|---|---|
| LDC | 72% | 84% | 54% |
| QDC | 64% | 94% | 64% |
| UDC | 38% | 95% | 66% |
| POLYC | 72% | 84% | 85% |
| LOGLC | 82% | 88% | 92% |
| KNNC |
|
|
|
| TREEC | 87% | 88% | 92% |
| PARZENC | 75% | 92% | 57% |
| SVC | 82% | 89% | 91% |
Cross-validation results for top 20 uncorrelated features using SMOTE.
| Classifiers | 80% holdout: 100 repetitions | Cross-validation, 5-fold, 1 repetition | Cross-validation, 5-fold, 100 repetitions | ||
|---|---|---|---|---|---|
| Mean error | SD | Mean error | Mean error | SD | |
| LDC | 0.2174 | 0.0328 | 0.2237 | 0.2158 | 0.0073 |
| QDC | 0.2062 | 0.0286 | 0.2003 | 0.2037 | 0.0055 |
| UDC | 0.3322 | 0.0297 | 0.3333 | 0.3314 | 0.0020 |
| POLYC | 0.2174 | 0.0328 | 0.2266 | 0.2148 | 0.0056 |
| LOGLC | 0.1498 | 0.0285 | 0.1477 | 0.1469 | 0.0048 |
| KNNC |
|
| 0.0599 | 0.0614 | 0.0074 |
| TREEC | 0.1234 | 0.0295 | 0.1360 | 0.1227 | 0.0115 |
| PARZENC | 0.1620 | 0.0420 | 0.1462 | 0.1589 | 0.0130 |
| SVC | 0.1428 | 0.0292 | 0.1418 | 0.1412 | 0.0055 |
Figure 4Received operator curve for top 20 uncorrelated features using SMOTE.
Classifier performance results for top five uncorrelated features ranked using LDA backward search feature selection from five regions and oversampled using SMOTE.
| Classifier | Sensitivity | Specificity | AUC |
|---|---|---|---|
| LDC | 82% | 90% | 56% |
| QDC | 87% | 92% | 63% |
| UDC | 52% | 91% | 70% |
| POLYC | 82% | 90% | 92% |
| LOGLC | 88% | 87% | 94% |
| KNNC |
|
|
|
| TREEC | 90% | 90% | 94% |
| PARZENC | 96% | 98% | 82% |
| SVC | 90% | 89% | 93% |
Cross-validation results for top five uncorrelated features ranked using LDA backward search feature selection from five regions and oversampled using SMOTE.
| Classifiers | 80% holdout: 100 repetitions | Cross-validation, 5-fold, 1 repetition | Cross-validation, 5-fold, 100 repetitions | ||
|---|---|---|---|---|---|
| Mean error | SD | Mean error | Mean error | SD | |
| LDC | 0.1359 | 0.0291 | 0.1374 | 0.1308 | 0.0044 |
| QDC | 0.1060 | 0.0267 | 0.1023 | 0.1082 | 0.0043 |
| UDC | 0.2835 | 0.0304 | 0.2851 | 0.2881 | 0.0025 |
| POLYC | 0.1359 | 0.0291 | 0.1301 | 0.1337 | 0.0049 |
| LOGLC | 0.1260 | 0.0262 | 0.1213 | 0.1182 | 0.0072 |
| KNNC |
|
| 0.0278 | 0.0311 | 0.0049 |
| TREEC | 0.0974 | 0.0319 | 0.1082 | 0.0969 | 0.0117 |
| PARZENC | 0.0321 | 0.0170 | 0.0336 | 0.0341 | 0.0054 |
| SVC | 0.1072 | 0.0255 | 0.1067 | 0.1034 | 0.0063 |
Figure 5Received operator curve for top five uncorrelated features ranked using LDA backward search feature selection from five regions and oversampled using SMOTE.
Seizure detection studies and classification results.
| Author | Year | Dataset | Classifier | Patients | Sensitivity (%) | Specificity (%) | Accuracy (%) | FPR/h |
|---|---|---|---|---|---|---|---|---|
| Aarabi et al. [ | 2006 | AMI | BPNN | 6 | 91.00 | 95.00 | 93.00 | 1.17 |
| Acharya et al. [ | 2012 | BONN | PNN, SVM, C4.5, BC, FSC, KNN, GMM | 10 | 94.4–99.4 | 91.1–100 | 88.1–95.9 | — |
| Bao et al. [ | 2008 | BONN | PNN | 10 | — | — | 71–96.8 | — |
| Chandaka et al. [ | 2009 | BONN | SVM | 10 | 92.00 | 100 | 95.96 | — |
| Kannathal et al. [ | 2005 | BONN | ANFIS | 10 | 91.49 | 93.02 | 92.2 | — |
| Kumar et al. [ | 2010 | BONN | EN, RBNN | 10 | — | — | 94.5 | — |
| Kumari and Jose [ | 2011 | BONN | SVM | 5 | 100.00 | 100 | 100 | 0 |
|
Acharya et al. [ | 2012 | BONN | SVM | 10 | 94.38 | 93.23 | 80.9–86.1 | — |
|
Polat and Güneş [ | 2007 | BONN | DTC | 10 | 99.40 | 99.31 | 98.72 | — |
|
Polat and Güneş [ | 2008 | BONN | C4.5 | 10 | 99.49 | 99.12 | 99.32 | — |
|
Song and Liò [ | 2010 | BONN | BPNN, ELM | 10 | 97.26 | 98.77 | 95.67 | — |
| Srinivasan et al. [ | 2007 | BONN | PNN, EN | — | — | 100 | ||
| Subasi [ | 2007 | BONN | MPNN, ME | 10 | 95.00 | 94 | 94.5 | — |
| Subasi and Gursoy [ | 2010 | BONN | SVM | 99-100 | 98.5–100 | 98.75–100 | — | |
|
Übeyli [ | 2008 | BONN | SVM | 10 | 99.25 | 100 | 99.3 | — |
|
Übeyli [ | 2009 | BONN | PNN, SVM, MPNN, CNN, ME, MME, RNN | 10 | 99.20 | 99.78 | 99.2 | — |
| Yuan et al. [ | 2011 | BONN | SVM, BPNN, ELM | 10 | 92.50 | 96 | 96 | — |
| Zheng et al. [ | 2012 | BXH | SVM | 7 | 44.23 | — | — | 1.6–10.9 |
| Khan et al. [ | 2012 | CHBMIT | LDA | 5 | 83.60 | 100 | 91.8 | |
| Nasehi and Pourghassem [ | 2013 | CHBMIT | IPSONN | 23 | 98.00 | — | — | 0.125 |
| Shoeb [ | 2009 | CHBMIT | SVM | 24 | 96.00 | — | — | 0.08 |
|
Acir and Güzeliş [ | 2004 | DEU | SVM | 7 | 90.30 | — | — | |
| Rasekhi et al. [ | 2013 | EUR | SVM | 10 | 73.90 | — | — | 0.15 |
| Park et al. [ | 2011 | FRE | SVM | 18 | 92.5–97.5 | — | — | 0.2–0.29 |
| Patel et al. [ | 2009 | FRE | SVM, LDA, QDA, MDA | 21 | 90.9–94.2 | 59.5–77.9 | 76.5–87.7 | — |
| Patnaik and Manyam [ | 2008 | FRE | BPNN | 21 | 91.29 | 99.19 | — | — |
| Williamson et al. [ | 2011 | FRE | SVM | 21 | 90.80 | — | — | 0.094 |
| Yuan et al. [ | 2012 | FRE | ELM | 21 | 93.85 | 94.89 | 94.9 | 0.35 |
| Bao et al. [ | 2008 | JPH | PNN | 12 | — | — | 94.07 | — |
| Saab and Gotman [ | 2005 | MON | BC | 76.00 | — | — | 0.34 | |
| Grewal and Gotman [ | 2005 | MON2 | BC | 16 | 89.40 | — | — | 0.22 |
| D'Alessandro et al. [ | 2005 | PEN & BON | PNN | 2 | 100.00 | — | — | 1.1 |
| Sorensen et al. [ | 2010 | RIG | SVM | 6 | 77.8–100 | — | — | 0.16–5.31 |
|
Acharya et al. [ | 2012 | SGR & BONN | PNN, SVM | 21 + 10 | — | — | 99.9 | — |
| Buhimschi et al. [ | 1998 | Unknown | PNN | 4 | 62.50 | 90.47 | — | 0.2775 |
|
Subasi [ | 2006 | Unknown | DFNN | 5 | 93.10 | 92.8 | 93.1 | — |