| Literature DB >> 35884796 |
Gaetano Zazzaro1, Luigi Pavone2.
Abstract
BACKGROUND: The development of automated seizure detection methods using EEG signals could be of great importance for the diagnosis and the monitoring of patients with epilepsy. These methods are often patient-specific and require high accuracy in detecting seizures but also very low false-positive rates. The aim of this study is to evaluate the performance of a seizure detection method using EEG signals by investigating its performance in correctly identifying seizures and in minimizing false alarms and to determine if it is generalizable to different patients.Entities:
Keywords: data mining; electroencephalogram; epilepsy; false-alarm rate; intracranial EEG; k-nearest neighbor; machine learning; seizure detection; signal processing
Year: 2022 PMID: 35884796 PMCID: PMC9312966 DOI: 10.3390/biomedicines10071491
Source DB: PubMed Journal: Biomedicines ISSN: 2227-9059
Figure 1Pipeline of data analysis: from raw data to seizure detection rules.
Features extracted by the Training Builder tool.
| ID | Name of Feature | Code | U/B | Count |
|---|---|---|---|---|
| 1 | Kolmogorov Complexity | KC | U | 36 |
| 2 | Log Energy Entropy | LE | U | 36 |
| 3 | Lower-Limit Lempel-Ziv Complexity | LL | U | 36 |
| 4 | Upper-Limit Lempel-Ziv Complexity | LU | U | 36 |
| 5 | Shannon Entropy | SH | U | 36 |
| 6 | Averaged Period | AP | U | 36 |
| 7 | Inverted Time to Peak | IP | U | 36 |
| 8 | Peak Displacement | PD | U | 36 |
| 9 | Predominant Period | PP | U | 36 |
| 10 | Squared Grade | SG | U | 36 |
| 11 | Squared Time to Peak | SP | U | 36 |
| 12 | Hjorth Mobility | HM | U | 36 |
| 13 | Kurtosis | KU | U | 36 |
| 14 | Standard Deviation | SD | U | 36 |
| 15 | Cross Correlation Index | CC | B | 36 MA + 36 MB |
| 16 | Conditional Entropy | CE | B | 36 MA + 36 MB |
| 17 | Dynamic Time Warping | DT | B | 36 MA + 36 MB |
| 18 | Euclidean Distance | ED | B | 36 MA + 36 MB |
| 19 | Joint Entropy | JE | B | 36 MA + 36 MB |
| 20 | Longest Common Sub-Sequence | LC | B | 36 MA + 36 MB |
| 21 | Levenshtein Distance | LD | B | 36 MA + 36 MB |
| 22 | Mutual Information | MI | B | 36 MA + 36 MB |
Figure 2Binary confusion matrix.
Performance metrics.
| # | Symbol | Performance Metric | Definition as | What Does It Measure? |
|---|---|---|---|---|
|
| ||||
| 1 |
|
|
| How good a model is at correctly predicting positive cases |
| 2 |
|
|
| How good a model is at correctly predicting negative cases |
| 3 |
| False-Positive Rate—Fall-out |
| Proportion of incorrectly classified negative cases |
| 4 |
|
|
| Proportion of correctly classified positive cases out of total positive predictions |
| 5 |
|
|
| Proportion of correctly classified negative cases out of total negative predictions |
|
| ||||
| 6 |
| Matthews Correlation Coefficient |
| Correlation between observed and predicted classifications |
|
| ||||
| 7 |
| ROC Area | Area Under the ROC Curve |
|
Final features selected by filters.
|
| E1 (10) | E2 (8) | E3 (4) | ||||
|
| B40 (6) | B70 (16) | |||||
|
| CE-MA (2) | CE-MB (2) | JE-MB (3) | KC (3) | LD-MA (3) | MI-MA (4) | SP (5) |
Description of the sets for data analysis.
|
| Name | Description | Registration Numbers | Use |
|---|---|---|---|---|
| 1 | FULL (D) | 52–76 | This dataset was used in the feature-selection phase by applying filters based on the Information Gain formula and the Pearson Correlation index. | |
| 2 | FULL_SELECTED | The features were selected, achieving 22 final features + “Actual YN” target class. | Disjointed sets IKTAL, INTERIKTAL and T were obtained. | |
| 3 | ORIGINAL TRAINING (T) |
Dataset with 96,135 instances, 95,798 tagged with “ | 52–60 |
This dataset was used for model training. The parameters of the |
| 4 | IKTAL |
About 1.5 h of preictal phase followed by a seizure of almost 95 s. This dataset had 5353 instances, 5258 tagged with “ | 122–123 | This dataset was used for testing the selected models in order to maximize the number of correctly classified positive instances, thus to detect the seizure. |
| 5 | INTERIKTAL |
About 10 h of records of interictal phase. All the 34,853 instances in this set were tagged with “ | 61–70 | This dataset was used for testing the models in order to reduce the number of false positives. |
s of the 9 selected training sets. Each was calculated as a maximum of 10 values, and each of these was obtained by a - trained on a different -undersampled set of .
|
| 1 | 2 | 5 | 10 | 20 | 25 | 50 | 100 | 284.3 |
|
| 0.949 | 0.951 | 0.955 | 0.954 | 0.955 | 0.954 | 0.936 | 0.925 | 0.865 |
Figure 3Recall– on () as varied.
Figure 4Precision– on as varied.
Figure 5– (Recall on ) as varied.
Figure 6Best models selected by 10-fold cross-validation.
Figure 7Curves to compare the results on the IKTAL set.
Figure 8Curves to compare the results on the INTERIKTAL set.
Figure 9All performance metrics of final model.
Figure 10Time variation in the Kolmogorov Complexity in the IKTAL set and focus on errors in the classification by .
Figure 11Time variation in the Kolmogorov Complexity in the INTERIKTAL set and focus on errors in the classification by .
Figure 12Non-consecutive seconds wrongly tagged with “” by .
Figure 13Qualitative analysis of false positives described in Figure 12: one-hour EEG signal from E1, band B40 (a), and a focused view on the time interval where the false positives were detected by the method (b).
Results of on other patients.
| PAT | No. of Seizures | No. of Instances | Origin | Average Seizure Duration (s) | AUC | Recall on | Recall on |
|---|---|---|---|---|---|---|---|
| 17 | 5 | 136,341 | Temporal | 86.16 | - | - | - |
| 3 | 5 | 108,836 | Frontal | 92.66 | 0.8411 | 0.5991 | 0.9826 |
| 4 | 4 | 99,363 | Temporal | 87.39 | 0.9835 | 0.9831 | 0.8739 |
| 11 | 3 | 103,520 | Parietal | 195.86 | 0.5992 | 0.0633 | 0.9992 |
| 13 | 2 | 95,127 | Temporal/Occipital | 158.28 | 0.5715 | 0.0221 | 0.9871 |
| 19 | 3 | 120,819 | Frontal | 12.54 | 0.5838 | 0.1672 | 1.0000 |
| 21 | 5 | 124,774 | Temporal | 89.09 | 0.5000 | 0.0000 | 0.9990 |