Literature DB >> 35755757

EEG-Based Epileptic Seizure Detection via Machine/Deep Learning Approaches: A Systematic Review.

Ijaz Ahmad1,2,3, Xin Wang1,2,3, Mingxing Zhu2,4, Cheng Wang1,2,3, Yao Pi5, Javed Ali Khan6, Siyab Khan7, Oluwarotimi Williams Samuel1,3, Shixiong Chen1,3, Guanglin Li1,3.   

Abstract

Epileptic seizure is one of the most chronic neurological diseases that instantaneously disrupts the lifestyle of affected individuals. Toward developing novel and efficient technology for epileptic seizure management, recent diagnostic approaches have focused on developing machine/deep learning model (ML/DL)-based electroencephalogram (EEG) methods. Importantly, EEG's noninvasiveness and ability to offer repeated patterns of epileptic-related electrophysiological information have motivated the development of varied ML/DL algorithms for epileptic seizure diagnosis in the recent years. However, EEG's low amplitude and nonstationary characteristics make it difficult for existing ML/DL models to achieve a consistent and satisfactory diagnosis outcome, especially in clinical settings, where environmental factors could hardly be avoided. Though several recent works have explored the use of EEG-based ML/DL methods and statistical feature for seizure diagnosis, it is unclear what the advantages and limitations of these works are, which might preclude the advancement of research and development in the field of epileptic seizure diagnosis and appropriate criteria for selecting ML/DL models and statistical feature extraction methods for EEG-based epileptic seizure diagnosis. Therefore, this paper attempts to bridge this research gap by conducting an extensive systematic review on the recent developments of EEG-based ML/DL technologies for epileptic seizure diagnosis. In the review, current development in seizure diagnosis, various statistical feature extraction methods, ML/DL models, their performances, limitations, and core challenges as applied in EEG-based epileptic seizure diagnosis were meticulously reviewed and compared. In addition, proper criteria for selecting appropriate and efficient feature extraction techniques and ML/DL models for epileptic seizure diagnosis were also discussed. Findings from this study will aid researchers in deciding the most efficient ML/DL models with optimal feature extraction methods to improve the performance of EEG-based epileptic seizure detection.
Copyright © 2022 Ijaz Ahmad et al.

Entities:  

Mesh:

Year:  2022        PMID: 35755757      PMCID: PMC9232335          DOI: 10.1155/2022/6486570

Source DB:  PubMed          Journal:  Comput Intell Neurosci


1. Introduction

Epileptic seizure is a well-known chronic neurological and noncommunicable disease, occurring in 4% to 16% of organ recipients and affecting between 60–70 million people worldwide [1]. Epilepsy can be observed at any age, with a higher incidence in infants and the elderly. Every year, around three million people are affected by this disease [1-4]. An epileptic seizure is a sudden abnormality in the brain's electrical activities, manifesting as excessive discharges of neuronal networks in the cerebral cortex and affecting the whole body [2]. It should be noted that the prediction of a seizure is hard, and for some patients, there may be hundreds of seizures in only one day, which may cause irreversible damage to the brain. Therefore, the timely detection and treatment of epilepsy are of great significance to control the development of the disease and improve the life quality of the patients. The most common causes include the shortage of oxygen during childbirth, malformations of organs, and low blood pressure [3, 4]. EEG (Electroencephalogram) is a method that records the neural electrophysiological activity of the brain by applying several electrodes over the subject's head with some criteria. EEG with different waveforms reflects different frequencies. By comparing, clinicians can diagnose some diseases related to the neural system. Several studies about epilepsy monitoring have been carried out based on electroencephalography (EEG) [5, 6], magnetoencephalography (MEG) [6], positron emission tomography (PET) [7], single-photon emission computerized tomography (SPECT) [8], functional magnetic resonance imaging (fMRI) [5], electrocorticography (ECoG) [9], and functional near-infrared spectroscopy (fNIRS) [9]. Compared to other techniques used in epilepsy, EEG signal devices are portable and economical, with their recordings being time-domain, and they can be transformed into frequency domain. EEG signals are produced by ionic currents from the variations in voltage coming from the brain's neurons, which show the brain's electric activity and are widely used in epileptic seizure detection [10, 11]. As shown in Figure 1, neuro-experts have categorized seizures based on the symptoms into two major categories, partial and generalized [4, 12]. A partial seizure can be defined by its symptoms, mainly caused by the affecting on the cerebral hemisphere.
Figure 1

The illustration of seizure types and their subtypes.

Moreover, a partial seizure can also be divided into two main groups: simple-partial and complex-partial. In the simple-partial, the person looks conscious and can generally communicate, while in the complex-partial, the patients behave abnormally, get confused, and typically act by chewing and mumbling. A generalized seizure also has two main parts. Nonconclusive seizure is diagnosed by obvious motor signs, while conclusive seizures are difficult to diagnose for having no motor signs. The person can only stare and not make additional motions or moments [12, 13]. In the epileptic seizure detection task, the neurologists analyze and diagnose the information reflected from EEG signals, such as the waveform, frequency, and amplitude, since EEG signals in a seizure will manifest some special indications like spikes. However, realizing the efficient detection of epilepsy seizures is frequently a time-consuming and exhausting task with the high possibility of human error, relying on clinicians' visual inspection. To be more specific, the limitations of manual epilepsy diagnosis can be listed as follows: It requires the physicians to have plenty of experience in clinical diagnosis and professional skills, making it more subjective and possible for misdiagnosis. Besides, different clinicians may draw an inconsistent conclusion over the same EEG signals based on their experience [4]. EEG signals are weak electrophysiological signals, which means they are easily interfered with by noises and have a sharp decrease in their signal-to-noise ratio (SNR). EEG signals submerged in noise might have some changes in their waveform and make it difficult to diagnose [12]. The amount of EEG signals used to make a diagnosis of epilepsy is large. In the clinical setting, the EEG signals are usually recorded synchronously with video signals to help diagnose using some behavior indications, which further increases the clinicians' workload. It takes clinicians at least 16 hours to go through the EEG signals of the patients and make the diagnosis [11]. In clinical setting, the interruption of reviewing EEG signals and the heavy work load tremendously affect the clinician's judgments on the signals, which may cause misdiagnosis [13]. Based on the aforementioned limitation, finding a technique to solve those problems is worthwhile and important. With artificial intelligence (AI) development, computer-based prediction techniques, including machine/deep learning classifiers, may alleviate these challenges. In recent years, machine/deep learning techniques have been widely used in the clinical diagnosis of diseases, especially in the application of epileptic seizures. These machine/deep learning techniques greatly free the clinicians from the heavy workload, significantly improve the diagnosis efficiency, and provide an objective and accurate diagnosis. Moreover, the number of studies in this area using machine/deep learning (ML/DL) keeps growing rapidly. The keywords “EEG,” “Epilepsy,” “Epileptic Seizures,” “Deep Learning,” and “Machine Learning” were exploited to search articles. The keywords were searched in several citation databases, including IEEE, PubMed, Elsevier, Springer, Wiley, and ArXiv. In addition, Google Scholar was also utilized for further search. Figure 2 shows the number of articles that have been accepted into each citation database. It has been noticed that IEEE, Elsevier, and Springer citation databases included the most accepted articles.
Figure 2

The proportions of accepted paper for the review using different citation database.

Initially, 400 accepted research articles were found in search engines. After keywords and title searches in each citation database, 200 articles were found. Furthermore, full-text searches were conducted manually to select the best-accepted articles for review, 150 best potential articles were presented for the comprehensive review, and 50 articles were excluded. The first excluded criterion was the non-English articles. While the second excluded criterion represents the articles without availability of the performance metrics (accuracy, pre, sens, and spec), as shown in Figure 3.
Figure 3

A framework of how the systematic review was conducted.

In this paper, the main contribution is divided into four parts and is discussed as follows: We have accomplished a comprehensive review of the three key dimensions. Firstly, the analysis of the statistical features and extraction methods of EEG signals in epilepsy seizures were achieved. Secondly, a systematic review of machine/deep learning models was conducted, including their performance, limitations, and associated challenges in epilepsy seizure datasets. Thirdly, we investigated the performance achieved by machine/deep learning models based on the logical results during adequate detection. Throughout the research, we have found that a random forest model is more effective and efficient than other classifiers based on the adequate detection, and the random forest model handles high dimensional of the dataset and retrieves sensible information. For further analysis, we have selected the time-domain feature extraction method with 9-statistical features (standard deviation, kurtosis, skewness, energy, line length, entropy, mean, mode, and Hurst) because they help the machine/deep learning models to retrieve relevant knowledge and the best logical result (accuracy of 98–100%). The comprehensive review will help the researchers identify and use the most efficient machine/deep learning models with statistical feature extraction methods to improve the research in epileptic seizure detection. The paper distributions are as follows: Section 2 shows a framework for seizure detection. Section 3 contains a detailed review of significant features and extraction methods, machine/deep learning models and challenges in seizure detection. Section 4 presents results and discussion.

2. Epilepsy Detection and Classification Process

The procedure for epilepsy seizure detection and classification is described as follows:

2.1. A Framework of Seizure Detection

We present a framework of seizure detection using an EEG seizure dataset in the given context. Four steps are needed to accomplish the seizure detection process, including data collection and preparation, feature extraction and selection, and machine/deep learning techniques to classify the seizure. The whole framework of epileptic seizure detection is given in Figure 4.
Figure 4

A block diagram of epileptic seizure detection using EEG signals and machine/deep learning techniques.

2.2. Data Collection

Firstly, one of the most important parts to achieve seizure detection is data collection. It can be obtained using an EEG monitoring device to collect the EEG signals of the brain. The EEG monitoring device locates the EEG cap on the scalp area presented in 10–20 international systems [14]. The monitoring device records the electrical signals from different electrodes or channels connected with wires to the scalp's surface with various voltage and spatial information [15]. Moreover, these noisy EEG signals have been carefully investigated and monitored by the neuro-expert and categorized into ‘seizure' and ‘non-seizure' states.

2.3. Data Transformation

Data transformation is a difficult step after data collection, which converts the raw EEG signal data into a table format of 2-D. However, this relevant information is not sufficient for analysis to identify seizures. Various features' selection and modalities are applied to give precise information about a seizure.

2.4. Dataset Preparation

After successfully transforming the dataset (data transformation process), the next step is the preprocessing data phase. It is a data mining technique that transforms raw data into a meaningful and understandable format, removing null values, data reduction, and data cleaning of EEG seizure datasets [16].

2.5. Publicly Available Datasets

Using a dataset is crucial for data scientists and experts as it permits them to evaluate their proposed model's performance. Publicly accessible datasets are very important because they offer a benchmark to analyze the results by comparing each dataset. There are many online existing epilepsy-related datasets, and most of the recent research prefers to use the mentioned datasets, which are further illustrated as follows:

2.5.1. CHB-MIT—EEG Dataset

This dataset is generated at Children's Hospital Boston and the Massachusetts Institute of Technology (CHB-MIT) [17, 18] and is publicly accessible on a PhysioNet server. The dataset contains 23 patients: 5 men aged between 3 and 22 years and 17 girls aged from 1.5 to 19 years. Each patient has numerous seizure and nonseizure recording files in European data format (.edf).

2.5.2. Bonn University—EEG Dataset

This dataset is split into five files (A–E) and includes 100 single-channel recordings. Each file has a record of 23.6 s, while all the signals have equal 128 channels recorded using 10–20 international electrodes system [19].

2.5.3. Kaggle—EEG Dataset

The EEG dataset is part of the American Epilepsy Society's epileptic seizures detection challenge. It includes intracranial EEG signals from five dogs and two people who had 48 seizures spanning 627 hours. The EEG signals of dogs were recorded using 16 implanted electrodes, which were sampled at 400 kHz. In comparison, the EEG signals of patients 1 and 2 were recorded using 15 deep electrodes and 24 subdural electrodes, sampled at 5 kHz [20].

2.5.4. Fribourg—EEG Dataset

This EEG dataset contains invasive EEG signals from 21 patients with refractory focal epilepsy monitored at the University Hospital of Fribourg's epilepsy center before surgery. The signals were collected during presurgical epilepsy monitoring. The intracortical grid, strip, and depth electrodes were used to provide direct recording from the focal area, reduce artifacts, and achieve a higher signal-to-noise ratio (SNR) [21].

2.5.5. Bern Barcelona—EEG Dataset

The Barcelona database was compiled by the Bern Hospital's brain department in Barcelona, including intracranial EEG recordings from individuals who have focal epilepsy. Subjects were followed for many days without the use of antiepileptic medications to evaluate whether they were having seizures or needed surgery. The signals were collected using intracortical electrodes from AD-Tech, with one additional reference electrode located between the PZ and FZ positions [22].

2.5.6. Zenodo—EEG Dataset

This dataset has multichannel EEG recordings of 79 human neonates recorded at Helsinki University Hospital, with an average recording length of 74 minutes. Three experts documented 460 seizures, 39 neonates were found to have seizures, and 22 neonates were seizure-free [23]. Table 1 contains a list of the additional information for each dataset. Figure 5 shows the number of each dataset used in epileptic seizures detection based on ML/DL techniques.
Table 1

Presents full description of publicly available EEG dataset datasets for epilepsy seizure detection.

DatasetRecordingNo. of seizureSampling frequencyTimesNo. of patients
CHB-MIT [18, 19]Scalp EEG16325684422
Bonn [20]Surface and IEEGNA173.6139 m10
Freiburg [22]IEEG8725670821
Kaggle [21]IEEG48400/5 KHz6275 dogs, 2 patients
Zenodo [23, 24]Scalp EEG46025674 m79 neonatal
Bern Barcelona [22]IEEG375051283m5
Figure 5

Represents various datasets in different studies for epilepsy seizure detection using ML/DL techniques.

2.6. Feature Extraction and Selection Techniques Applied in Epilepsy Seizure Detection

Feature extraction is considered a core component of any pattern recognition system [24]. It is mainly because the feature extraction process often adopts a mathematically driven algorithm that helps extract relevant information mostly from a raw dataset to better characterize the pattern of interest at any given point in time. In many cases, integrating a feature extraction component in a pattern recognition system often leads to a better performance in accurately distinguishing various patterns of interest and yielding such results faster than the direct usage of the raw data [24]. Thus, feature extraction is considered necessary in developing an efficiently intelligent system for epileptic seizure detection. In the feature extraction stage of such a system, various approaches have been applied to the raw EEG signals toward obtaining information that allows the proper analysis of the underlying phenomenon of interest. The different commonly adopted feature extraction methods for EEG signal characterization are shown in Figure 6. After the feature extraction process task is completed, the resulting signals become more accessible and would certainly become highly informative for classifying the inherent seizure [25]. As mentioned earlier, it should be noted that using ML algorithms directly on the raw data set may produce low accuracy or even inconsistent results and most certainly require a relatively longer time to complete the prediction task [26]. Therefore, it is necessary to adopt a feature extraction technique, and at the same time, choose the best technique since there are several kinds of features for characterizing physiological signals, and selecting efficient statistical features is required when facing a challenging task.
Figure 6

Different commonly adopted feature extraction methods for EEG signal characterization.

Fundamentally, there are two ways in which features are often extracted from the EEG signal of interest, namely handcrafted and automatic extraction. The handcrafted extraction features are multivariate [27] and univariate in both frequency and time domains. In contrast, automatic features include mean [28], kurtosis, skewness, entropy [28], Horthy parameters, statistical moments, and variance [29]. Meanwhile, the most commonly adopted feature that is widely implemented in EEG signal characterization includes time-domain (TD), time-frequency domain (TFD), frequency domain (FD), fourier transform (FT), discrete wavelet transform (DWT), and continuous wavelet transform (CWT)-based features [30]. Abbasi et al. introduced wavelet scalograms (WSs) feature extraction techniques with DL models to detect HI brain injury and got satisfactory results [31]. Logesparan et al. [32] used various statistical feature extraction methods on EEG datasets but concentrated on only two features, “relative power” and “line length,” which produced better performance in seizure detection. Amin et al. [33] introduced tritime domain approaches for features selection with statistical features, namely line length, frequency, and energy in epilepsy seizure detection. They used CHB-MIT and BONN datasets to test the detection accuracy and reached 93–99% by calculating F-score, sensitivity, and specificity. Besides, many researchers implemented a single feature in epileptic seizure detection [34-36]. For example, Guo et al. [37] tested a single feature “line length” with machine learning classifiers ANN to classify EEG signal recordings, and the accuracy was 95.6%. Koolan et al. [34] introduced “line length” as a feature to detect seizures with a specificity of 85% and a sensitivity of 84%. Some researchers used a single feature, “line length,” while others applied many convenient features. However, many researchers have utilized other statistical features, which resulted in lesser accuracy (%) and more computational time (sec). After feature extraction, one of the essential tasks is choosing a collection of informative, small, and compact features that have improved discriminating power. These features serve as the basic blocks for tasks, such as detection, classification, and regression, in biomedical signal processing. They are also one of the most important stages in the data analysis process. Indeed, features are a novel way of representing data, and they may be binary, categorical, or continuous. For example, characteristics, such as the patient's age, health condition, family history, electrode location, or EEG signal descriptors may be considered (voltage, frequency amplitude, phase, etc.). Therefore, it is suggested that the polynomial-based methods are used before applying machine-learning models to derive low-dimensional features. Usually, polynomial features aim to create/add new input features based on the existing features. The “degree” of the polynomial is used to control the number of features added, e.g., a degree of 3 will add two new variables for each input variable. Different polynomial-based methods are available and may be used to decrease computation time and make more effective use of computer resources, which helps them become more popular [44]. Various efficient linear and nonlinear dimensionality reduction methods for feature selection in EEG-based epileptic seizure detection are shown in Table 2.
Table 2

Efficient polynomial-based methods for the features selection of EEG epileptic seizure detection.

Feature selection methodsDescription
[38] principal component analysis (PCA)It was implemented to compress highly correlated features into a lower-dimensional subspace and use in various pattern recognition applications, including EEG signal classification
[39] T-distributed stochastic neighbor embedding (t-SNE)Used to decrease the dimensionality of nonlinear data with a high-dimensionality of complexity to a lower-dimensional subspace. It is extensively utilized to present large amounts of high-dimensional biological data
[40] kernel principal component analysis (KPCA)Used to handle the problem of nonlinear dimensionality reduction and useful for data compression using electroencephalogram (EEG) signals
[41] independent component analysis (ICA)Process multivariate data representing the vast database samples as EEG signal is composed of various random signals
[42] locally linear embedding (LLE) [43] generalized discriminant analysis (GDA)One of the most frequently utilized methods for extracting the nonlinear features uses the EEG signal. GDA is a highly effective method for extracting the nonlinear features of EEG signal data because generalized discriminants are calculated by mapping the training data in large dimensions of space using a kernel function

3. Comprehensive Review of Efficient ML/Deep Learning Classifiers

Various pieces of literature have introduced machine/deep learning models for epileptic seizure detection using EEG signals datasets [45, 46] with statistical features methods and nonlinear parameters. In machine/deep learning models [47-53], random forest classifier (nonblack-box) and support vector machine (SVM), k-nearest neighbor (K-NN), artificial neural networks (ANN), convolutional neural network (CNN), recurrent neural networks (RNN), and autoencoder (AE) (“black-box”) are considered for review because of their remarkable performances in seizure detection.

3.1. Black-Box Classifiers in Seizure Detection

3.1.1. Convolutional Neural Network (2D-CNN)

CNN is a popular deep learning classifier to predict and diagnose medical diseases [54]. Initially, CNN was used for image classification [55]. However, recent 1D-CNN has been modified to two-dimensionalarchitectures, broadly used to apply epileptic seizures and to process the EEG signal. Table 3 presents a review of recent works that adopted 2D-CNN models to predict an epileptic seizure.
Table 3

A review of recent research that applied the 2D-CNN model for seizure prediction with their corresponding accuracies and limitations.

AuthorsMachine-learning approachesFeature selection methodsDatasetPerformance metricsLimitationsAccuracy (%)
Bizopoulos et al. [56]SoftMax, standard networks2D and 3D phase space presents the intrinsic mode and functionsBONNOverall accuracyLow detection accuracy85.30
Antoniades et al. [57]LR, 2D-CNNTime-domainBONNOverall accuracy87.50
Park et al. [58]SoftMax, 2D-CNN2D, 3D phase space presents the intrinsic mode and functionsCHB-MIT, SNUH-HYU dataSpec, sens, time differenceLow sens, spec90.58
Sui et al. [55]SoftMax, 2D-CNNFTKaggleOverall accuracyHigh time complexity91.18
Turk and Ozerdem [59]Softmax, 2D-CNNFrequency-time domain, CWTFreiburgSpec, sens, acc, F-measureLow spec for multi-class93.60
Faust et al. [60]Softmax, 2D-CNNWavelet transformations (DWT)Bern-Barcelona dataEnergy, frequencyLow accuracy94.50
Tian et al. [61]MV-TSK-FS, 2D-CNNFFT, WPDCHB-MITOverall accuracy-95.33
Lecun et al. [62]Res-CNNConventional feature extraction methodBONNOverall acc-95.70
LeCun and Triesch [63]Softmax, 2D-CNNFeature extracts from CNNBern BarcelonaOverall accuracyHigh detection time95.90
San-Segundo et al. [64]SoftMax, 2D-CNNDWTCHB-MITClass accHigh training time96.10
Akut [65]Sigmoid, 2D-CNNFFT, WPDKaggleSpec, sensHigh training time96.15
1D-CNN architecture is also a suitable choice for processing brain activity signals. Because 1D-CNN architecture requires less number of parameters; therefore, its detection time is less than 2D-CNN architecture but have worst classifcation performance. Therefore, 1D-CNN and 2D-CNN are capable of the diagnosis of epileptic seizures. Figure 7 shows the seizure detection accuracies of the various kinds of literature-implemented 2D-CNN models [66-68].
Figure 7

Comparison of accuracies (%) versus authors introducing 2D-CNN models for seizure detection.

3.1.2. Recurrent Neural Networks (RNNs)

The sequential datasets, including videos, texts, and signals, have some characteristics, such as great length and variable, which is hard for a simple deep learning model to process [69]. RNNs model is widely used to overcome these challenges. RNNs are competitive models for processing biomedical signal data and receiving satisfactory results. The following section reviews RNN models commonly used in epileptic seizure detection with their corresponding accuracies. The LSTM model was introduced after the RNNs drawbacks, short-term memory, and vanishing gradient [70-72]. Various pieces of literature using LSTM in seizure detection are available. Golmohammadi et al. [70] presented a 2-layer LSTM model and SoftMax function to evaluate the data and achieved 90% accuracy. The research of [73] demonstrated a 3-layer LSTM architecture model for classification and got satisfactory results, while the literature of [74] evaluated two hybrid models, GRU and LSTM, with the activator function. One of the layers is fully connected with a sigmoid activator in this network. The studies in [71-74] used 10 different architectures of RNN with 31 layers and got the best accuracy (95%). Table 4 and Figure 8 present a review of recent works that adopted LSTM-RNN models to predict an epileptic seizure.
Table 4

A review of recent research that applied the LSTM-RNN model for seizure prediction with their corresponding accuracies.

AuthorsMachine learning approachesFeature selection methodsDatasetPerformance metricsLimitationsAccuracy (%)
Yao et al. [75]SoftMax, LSTMIndependent RNNCHB-MITSen, spec, PrecLow sens, prec88.80
Chen et al. [72]SoftMax, LSTMWavelet transformations (DWT)ZenodoPre, spec, classLow prec90.00
Chen et al. [72]SoftMax, LSTMWavelet transformations (DWT)BONNOverall accuracyHigh detection time91.82
Hussein et al. [76]SoftMax, LSTMTime domain, time-frequency domainFribourgSen, spec92.75
Jaafar and Mohammad [77]SoftMax, LSTMIndependent RNNFreiburg dataOverall accuracyHigh training time93.75
Talathi and Vartak [78]RNN, GRURNNsBONNClass accuracyHigh time complexity94.00
Ahmed-Aristizabal [79]SoftMax, LSTMComputer-based analytical approachesMater advanced epilepsy UnitOverall accuracy95.00
Yao et al. [80]SoftMax, LSTMIndependent RNNBern BarcelonaSen, spec, PrecHigh time complexity96.00
Hussein [81]SoftMax, LSTMFully connected (FC) RNNZenodoSen, specHigh training time96.00
Figure 8

Comparison of accuracies (%) versus authors introducing LSTM-RNNs models for seizure detection.

3.1.3. CNN_RNN

It is competent to use two models for more accurate diagnosis and prediction of epileptic seizures, such as CNN-RNN architecture. The structure of RNN helps process sequential data (time-series processing). In the literature of [82], they applied various preprocessing schemes and used a modified CNN-LSTM with 13 layers along with the sigmoid activation function in their last layers with 91% accuracy. Roy et al. [83] introduced a hybrid architecture CNN-RNN to achieve the best results. Their first experimental works consist of 1-D with a 7-layer hybrid model of CNN-GRU, and the second work has 3-D and CNN-GRU hybrid architecture. An extended study by Ravi Prakash et al. [84] implemented four deep learning architectures, and the accuracy of these experiments achieved 90.60%. Table 5 and Figure 9 presented hybrid architectures (CNN-RNN) applied in different literature on epileptic seizures and their corresponding accuracies.
Table 5

A review of recent research that applied the CNN-RNNs model for seizure prediction with their corresponding accuracies.

AuthorsML/DL approachesFeature selection methodsDatasetPerformance metricsLimitationsAccuracy (%)
Fang et al. [82]ST-GRU ConvNetsTime-domainCHB-MITLatencyLow accuracy77.30
Ravi Prakash et al. [84]Sigmoid, 1D-CNN-LSTMTime-domain featuresFribourgSen, specLow sens, spec83.05
Ravi Prakash et al. [84]CNN-RNN2D, 3D phase space presents the intrinsic mode and functionsMAEU dataOverall accuracy90.22
Ahmedt Aristizabal et al. [85]Sigmoid, 2D CNN-LSTMTime domain featuresTUH dataOverall accuracyHigh detection time92.50
Roy et al. [83]Sigmoid, 2D CNN-LSTMTime-frequency domain featureKaggleSen, specHigh training time93.00
Liang et al. [86]Softmax, 1D CNN-GRU2D, 3D phase space presents the intrinsic mode and functionsBern BarcelonaOverall accuracyHigh time complexity94.16
Choi et al. [87]ID-CNN biGRUFrequency domainCHB-MITSensitivityHigh training time94.40
Figure 9

Comparison of accuracies (%) versus authors introducing CNN-RNN models for seizure detection.

3.1.4. Autoencoders (AEs)

Autoencoder (AE) is an unsupervised machine learning model that presents different input parameters and works with the function (compression, decompression) coupled with a neural network [88, 89]. The pieces of literature [45, 46, 90, 91] used multilayer autoencoders (MAE) to hybridize EM-PCA methods to reduce the dimensions for classification. They also implemented a genetic algorithm (GA), and the experimental results indicated an accuracy of up to 92.78%. Sharathappriya et al. [92] used stacked denoising AE (SDAE), which consisted of three layers of architecture. Qiu et al. [93] also introduced denoising sparse AE (DSpAE) and reported 95% accuracy. The study in [94] consisted of automated EEG with a machine learning-based system. This system has several parts: the first part extracted linear predictive cepstral coefficients (LPCC) as signal features. After that, three paths were used for accurate detection. They proposed SpAE to extract the feature from EEG, and SVM was used for the classification. Sharma et al. [48] achieved average accuracy up to 93.92%. Table 6 presented AE in seizure detection and performance metrics, and an illustration of the authors of various literature with their accuracies was shown in Figure 10.
Table 6

A review of recent research that applied AE in seizure detection with their corresponding accuracies.

AuthorsMachine learning approachesFeature selection methodsDatasetPerformance metricsLimitationsAccuracy (%)
Gasparini et al. [95]SoftMax, SAETime-frequency, CWTReggio Calabria dataSen, specLow Sen, Spec, Acc86.50
Singh and Malhotra. [96]SoftMax, SAEAE and SEBONNSen, spec, accLow Sen, Spec, Acc88.80
Yuan et al. [97]SoftMax, SSpDAESAE, six featuresZenodoROC, PR, F-measure90.64
Yuan et al. [97]SoftMax, SpDAETime-frequencyCHB-MITF1-measure, Confusion MatrixLow detection acc90.82
Hosseini et al. [98]SoftMax, SpAEPCAZenodoPre sen, FPR FNRHigh FNR91.00
Karim et al. [99]SoftMax, SAEDWTBONNConfusion matrixLow prec91.00
Yuan et al. [100]SoftMax, SAEAE and SECHB-MITPre, sen, F-measure92.61
Sharathappriyaa et al. [92]SoftMax, AEHWPT, FDFribourgSen, specHigh time complexity92.67
Karim et al. [101]SoftMax, SpAEAE and SEFribourgConfusion MatrixHigh detection time93
Karim et al. [102]SoftMax, DSAEESD functionKaggleSen, spec94
Wang et al. [103]SoftMax, SSpDAEAE and NSPBONNSen, specPrec not mentioned95
Figure 10

Comparative study of accuracy (%) versus authors introducing AE model for seizure detection.

3.1.5. Conventional ML (ANN, SVM, KNN)

Based on their significant performances, SVM, ANN, and KNN have also been applied in various domains [73, 104], especially in processing brain signal datasets. Various relevant works listed here on seizure detection used different classifiers. Most of the research articles preferred hybrid models. Dorai and Ponnambalam [105] proposed a hybrid model using SVM and KNN to classify these EEG epochs into seizure and nonseizure types. Birjandtalab et al. [106] implemented a Gaussian mixture model (GMM) to diagnose epileptic seizure detection. They achieved satisfactory results of accuracy and an F-measure of 85.1%. This experimental work addressed the class imbalance issue in the given dataset. A detailed review of SVM, ANN, and KNN in seizure detection is shown in Table 7. The literature of [119] recommended ANN classifiers on the EEG brain activity dataset with time-frequency domain features. The implemented classifiers accurately classify the signals into “nonseizure” and “seizure” with 95% accuracy. They used the EEG dataset class combination from A to E. The proposed study by Satapathy et al. [120] applied two models, SVM and neural networks (“black-box” approaches), to the EEG dataset for seizure detection. The outcomes of the given models indicated that the SVM model was more efficient based on the accuracy and time complexity (sec) compared to other networks. Hassan and Subasi [122] used genetic algorithms (GA), SVM, and particle swarm optimization (PSO) to detect a seizure. This approach achieved the best accuracy up to 92.38%. Shoeb and Guttag [115] implemented SVM classifiers and vector features on the CHB-MIT dataset to predict seizures, achieving 93.38% accuracy. Amin et al. [33] also used four classifiers, namely Naïve Bayes, KNN, MLP, and SVM, for classification with the DWT method and relative features. Their experimental result showed 92% accuracy. Raghu et al. [117] introduced the hybrid KNN-SVM model that was implemented on raw EEG data for accurate classification of epileptic seizure detection, and the experimental result indicated an accuracy of up to 90%. Zabihi et al. [121] used an SVM classifier for specific accurate detection to process the dataset with frequency-domain and time-domain features and achieved 93.78% sensitivity and 96.05% specificity.
Table 7

Recently applied ML (SVM, ANN, and KNN) for seizure detection with their corresponding performances.

AuthorsMachine learning approachesFeature selection methodsDatasetPerformance metricsLimitationsAccuracy (%)
Logesparan et al. [107]SVM, ANNLine length featureCHB-MITROCLow accuracy52
Zeiler Fergus [108]QDA, DT, KNN, SVMTime-frequencyBONNSen, specLow sen, pres85
Birjandtalab et al. [106]ANNSpectral powerCHB-MITF-measureHigh detection high86
Chen et al. [109]SVMDWTBONNConfusion MatrixLow sen, pres86.83
Parvez and Paul [110]LS-SVMIMF, DCT-DWT, DCT, SVDFreiburgSpec, sen, AccLow sen, pres for binary classification91.36
Guo and DiPietro [111]K-NNGenetic programmingBONNClass AccLow accuracy93.50
Nicolaou and Georgiou [112]SVMPermutation entropyCHB-MITPre, Rec, F-measureLow prec and accuracy93.55
Ahmad et al. [113]SVMDWTCHB-MITAvg94.8
Zhang et al. [114]ELM, SVMAE and SEBCI LabClass accuracyHigh time complexity95.58
Shoeb and Guttag [115]SVMTime-frequencyCHB- MITSensitivity (sen)96
Chen et al. [116]Naïve Bayes, SVMEnergy, variance, entropy, RMSCHB-MITPre, Rec, F-measureLow pre96.55
Raghu et al. [117]RF, KNN, adaboostTime-frequencyBern-BarcelonaSen, pre, NPR, ROCNFR not mentioned97.6
Mursalin et al. [118]KNN, SVM, RF15-featuresBONNAcc, sen, spec98
Sharma et al. [119]LS-SVM2D, 3D phases, the intrinsic mode, and functionsBONNOverall Acc98.60
Amin et al. [33]Naïve bayes, SVM, KNN, MLPEnergyEPILEPSYClass Acc98.75
Satapathy et al. [120]Neural network, SVMCWT, DWTBONNOverall AccHigh detection time99.1
Zabihi et al. [121]SVMTime-frequencyCHB-MITSen, specHigh time complexity99.32
Hassan and Subasi [122]SVMDWTBONNClass Acc99.38
Fasil and Rajesh. [123]SVMEnergyBONN, BarcelonaClass Acc99.5
Chen et al. [116]LS-SVMEntropies typesBONNSpec, Acc, sen99.58
Selvakumari et al. [124]LS-SVMDWT, FFTClass AccBONNHigh time complexity100
Lahmiri and Shumel [125]KNN and GHEBONNClass Acc100
Kumar et al. [126]DWT based approximate entropy ANN, SVMDWT based approximate entropyCHB-MITOverall AccHigh time complexity100
Tzallas et al. [127]ANNTime-frequency featuresBONNPre, Rec, F-measure100
Lahmiri and Shmuel [125] successfully used the Hurst exponent (HE) to classify the recorded EEG dataset into nonseizure and seizure with up to 97% accuracy. Further study by Lahmiri and Shmuel [125] used SVM to accurately classify seizures with 100% accuracy but less time complexity (sec). Table 7 and Figures 11–13 showed the authors' accuracy of the three models (SVM, ANN, and KNN) in various pieces of literature.
Figure 11

Comparative study of accuracy (%) versus authors introducing the SVM model for seizure detection.

Figure 12

Comparative study of accuracy (%) versus authors introducing the ANN model for seizure detection.

Figure 13

Comparative study of accuracy (%) versus authors introducing the KNN model for seizure detection.

3.2. Nonblack-Box Classifiers in Seizure Detection

The issue of “black-box” classifiers is that it cannot identify human interpretation and classification procedures [128]. Therefore, there is less chance to retrieve sensible knowledge. Because of the limitation of knowledge retrievals, the researchers focus on “nonblack-box classifiers, including random forest and decision trees approach. The literature of [104, 129–132] examined the decision forest and decision tree, and they reported that decision forest classifiers were more effective than implementing a decision tree for its overfitting issues. An algorithm extracts the rules from training data using a decision tree that generates either a limited or a single set of logic rules (for example, whenever C2 entropy value is less than 101.01, class value = seizure) and stops growing the tree by adding more records to the training dataset once the rule is accepted by the algorithm [127]. Besides, the decision forest grows multiple decision trees on the training data with higher accuracy and sensible logic rules. Chen et al. [133] applied a decision tree on the EEG dataset to successfully classify seizures and reported 98.62% accuracy. Decision forest classifiers in [32, 134, 135] were used as ensemble methods for seizure detection, providing remarkable accuracy and creating additional logic rules with decision trees using the training data [120]. Siddiqui and Islam [136, 137] used the hybrid approaches of systematic forest (SySF) and continuously excluding root node (CERN) without epoch reduction to diagnose seizure detection. Another study [116] implemented decision forests with 9 statistical features with the epoch concept. The training dataset was divided into subdatasets, such as (d, d1,…, dn), and the accuracy was tested on each epoch. The limitation of this literature was that a single patient's dataset had been taken. The dataset could be taken from many patients to achieve the best results. Overall, a systematical review of recent studies and their performance of RF were presented in Figure 14 and Table 8. Because of the nonblack nature and advantages (accuracy, logic rules) [36, 139, 141], several researchers implemented a random forest classifier to diagnose seizure detection. Donos et al. [139] introduced a decision forest classifier on statistical features (frequency and time domains) extracted from the EEG dataset and reported that the system presented sensitivity up to 93.8%. Hosseini et al. [141] used the RF with grid search optimization (RF-GSO) approach and achieved an accuracy of 96.7%.
Figure 14

Comparative study of accuracy (%) versus authors introducing RF model for seizure detection.

Table 8

A review of recent research applied random forest in seizure detection with their corresponding accuracies.

AuthorsMachine learning approachesFeature selection methodsDatasetPerformance metricsLimitationsAccuracy (%)
Birjandtalab et al. [138]Random forest-KNNSpectral powerCHB-MITSen, F-measure, prec,Low sens, spec80.87
Donos et al. [139]Random forestTime, frequencyEPILEPSYSensitivitySpec not mentioned93.8
Siddiqui et al. [140]Random forest, boosting, decision forestNine statistical featuresBern BarcelonaPre, Rec, F-measureHigh time complexity96.67
Wang et al. [137]Random forest classifiersStd, dev, energy, energy,STFT, meanBONNClass AccLow sens, spec for multi-class96.7
Lee and Kim [35]Random forest, SVMFrequency, 10-timeUCIROC-AUC98
Sharma et al. [119]Random forestIMFKaggleSen, spec, AccSen, spec not mentioned98.4
Mursalin et al. [118]Random forestDWT, entropyFribourgClass Acc98.45
Mursalin et al. [118]Random forestDWT, entropyZenodoClass AccSen, Spec not mentioned98.45
Alickovic et al. [46]ANN, random forest, SVM, KNNPower, mean, kurtosis, absolute mean std dev, skewnessCHB-MITSen, spec, AccTime complexity100
Wang et al. [137]Forest CERN9-statistical featuresBONN, CHB-MITClass Acc100
Hosseini et al. [141]Random forest classifiersL1-penalized robust regressionBONN, CHB-MITClass Acc100

4. Observed Challenges from Surveyed Literature

Based on the comprehensive survey of existing related literature reviewed, it was observed that the various challenges in diagnosing epileptic seizures could be summarized as follows: The first challenge is that large epileptic seizure datasets are currently not available publicly for extensive validation of the proposed machine learning/DL-based models for epilepsy detection and classification. Many datasets only include specified chunks of EEG signals, which is insufficient for real-world applications, where detection must be done from real-time signals. Because a large amount of dataset is required for the proper validation of a machine learning model for epileptic seizure detection and classification, plenty of efforts have been made to combine available EEG datasets for this purpose. However, it is still difficult to combine these datasets because they have different parameters and were acquired under relatively different sampling conditions [142]. Because machine/deep learning models mostly require substantial computational resources for their implementation in practical settings, which are sometimes difficult to access, a piece of good knowledge about how to optimize the models' performance is necessary for realizing a practical epileptic seizure detection and classification system. For some researchers working in epileptic seizure detection and prediction, especially those in low to medium-income countries, accessing high-performance hardware resources to implement deep learning models is often a key challenge. Although Google has made powerful computing servers accessible (Google Colab platform and so on), there are still limitations regarding the amount of data transferred to such servers and the length of time it takes for the servers to execute the tasks.

5. Discussion

In this study, we have investigated the use of different machine/deep learning-based algorithms for epileptic seizure detection. For instance, the algorithms considered include the conventional ML (ANN, SVM, and KNN), advanced DL (CNN/RNN/LSTM), and the random forest (RF)-based ML because of their remarkable performances in epileptic seizure detection, as reported in previous studies. A summary of the investigation results reported in recent literature are as follows: This systemic survey indicates that conventional ML algorithms (ANN, SVM, KNN) contribute well to the processing of brain datasets (CHB-MIT, BONN, Kaggle, Fribourg, and Bern Barcelona) for seizure detection [106-120]. However, each method has some pros and cons. For instance, SVM is found to be efficient for binary classification. It has better detection accuracy than ANN and KNN, however, it has high computation time complexity (sec), mainly compared to KNN and ANN. In contrast, KNN has low-performance evaluation metrics (precision, recall, and F1-score), including low detection complexity, however, they can handle high dimensional datasets [111, 118, 125]. While introducing a hybrid classification scheme that involves a combination of machine learning models (SVM-KNN or SVM-ANN), an increase in detection accuracy, precision, recall, and F1-score can be achieved compared to using a single ML model [33, 108, 118, 126]. Even though hybrid models could achieve better prediction accuracy than single models, they are more computationally efficient than their single model counterparts, further limiting their implementation in practical applications [104, 132]. Additionally, a major challenge with conventional ML algorithms is that it is difficult to understand the logical procedure followed to arrive at their prediction outcomes and is largely unexplainable for patterns and the logic rules hidden inside the models (the blackbox concept). Thus, they are not recommended for extracting useful information from datasets. On the other hand, advanced ML/DL (CNN/RNN/LSTM) aid the automatic extraction of high-dimensional features, which may not be easily achieved with conventional ML schemes. For instance, the RNN model is normally faster than CNN and LSTM in execution time but has relatively lower accuracy, precision, and recall. In contrast, LSTM has time complexity issues using CHB-MIT and BONN and other datasets for seizure detection [72, 77, 80]. Besides, the hybrid models (a combination of two or more DL models) were found to perform better in accurately classifying seizures at the expense of more computation time. When considering time complexity, accuracy, precision, and recall issues with the conventional ML and advanced ML (DL) algorithms, decision tree-driven schemes, such as random forest classifiers, may be good. It is partly because of their ensemble nature and multiple logic rules [127]. They can achieve fairly good classification results as shown in the previous sections [134-142]. Decision tree-based models can handle a relatively large number of datasets and are less time-consuming and mostly yield high accuracy, precision, and recall. From adopting the conventional ML models for epileptic seizure detection, feature extract constitutes an essential component of the entire scheme. Hence, it is important to select proper feature extraction methods for characterizing the EEG signals. Recent studies that investigated and analyzed a range of features had indicated that the time-domain feature extraction methods with 9-statistical features (standard deviation, kurtosis, skewness, energy, line length, entropy, mean, mode, and Hurst) would be appropriate for epileptic seizure detection [126]. It is because the mentioned features have been reported to achieve average accuracies in the range of 98–100% when used with ML/DL models for epileptic seizure classification based on EEG signals. Furthermore, it is significant to select a smaller subset of useful features by adopting a selection technique to reduce the model's complexity. It leads to the survey of various feature selection methods adopted mainly for dimensionality reduction. The investigation study showed that Kernel principal component analysis (KPCA) was a suitable nonlinear reduction technique for feature selection. KPCA offers the following major benefits over other feature selection methods: Nonlinear data is successfully handled. No nonlinear optimization is required. KPCA calculations are very easy and are similar to conventional PCA calculations. The number of PCs does not need to be set before modeling [143]. KPCA is a suitable encoding method for data with a nonlinear manifold structure. It is widely used in various datasets, including applied health data, sensor data, and facial pictures.

6. Conclusion

A comprehensive review of efficient machine/deep learning models and feature extraction and selection methods has been performed in this research. This study focused on the conventional ML (ANN/SVM/KNN), advanced ML/DL (CNN/RNN/LSTM), and tree-base ML (RF) because of their remarkable performance in the application of epileptic seizure detection. This paper concluded that decision forest classifiers are the most suitable, effective, and recommended for future research in epilepsy seizure detection. Its nonblack-box nature produces explainable logic rules, multiple sensible knowledge (adequate detection), high accuracy, low detection complexity, high precision, and recall, reveals relevant information (seizure localization), and can handle high volumes of datasets. At the same time, blackbox classifiers, such as conventional ML (ANN SVM KNN) and advanced ML/DL (CNN/RNN/LSTM), cannot create logic rules, including high detection accuracy but have high time complexity. Furthermore, according to the literature review, as for the selection of appropriate features and feature extraction method, we selected the time-domain features extraction method and 9-statistical features (standard deviation, kurtosis, skewness, energy, line length, entropy, mean, mode, and Hurst) because these features provided higher accuracy (%). At the same time, Kernel principal component analysis (KPCA) is a suitable nonlinear polynomial-based method for feature selection. Future research will further study machine learning issues regarding epileptic seizure detection with suitable features.
  56 in total

1.  Feature extraction and classification for EEG signals using wavelet transform and machine learning techniques.

Authors:  Hafeez Ullah Amin; Aamir Saeed Malik; Rana Fayyaz Ahmad; Nasreen Badruddin; Nidal Kamel; Muhammad Hussain; Weng-Tink Chooi
Journal:  Australas Phys Eng Sci Med       Date:  2015-02-04       Impact factor: 1.430

2.  Indications of nonlinear deterministic and finite-dimensional structures in time series of brain electrical activity: dependence on recording region and brain state.

Authors:  R G Andrzejak; K Lehnertz; F Mormann; C Rieke; P David; C E Elger
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2001-11-20

3.  Nonrandomness, nonlinear dependence, and nonstationarity of electroencephalographic recordings from epilepsy patients.

Authors:  Ralph G Andrzejak; Kaspar Schindler; Christian Rummel
Journal:  Phys Rev E Stat Nonlin Soft Matter Phys       Date:  2012-10-12

4.  Cross-Subject Seizure Detection in EEGs Using Deep Transfer Learning.

Authors:  Baocan Zhang; Wennan Wang; Yutian Xiao; Shixiao Xiao; Shuaichen Chen; Sirui Chen; Gaowei Xu; Wenliang Che
Journal:  Comput Math Methods Med       Date:  2020-05-08       Impact factor: 2.238

5.  Detection Analysis of Epileptic EEG Using a Novel Random Forest Model Combined With Grid Search Optimization.

Authors:  Xiashuang Wang; Guanghong Gong; Ni Li; Shi Qiu
Journal:  Front Hum Neurosci       Date:  2019-02-21       Impact factor: 3.169

Review 6.  Imaging the Functional Neuroanatomy of Parkinson's Disease: Clinical Applications and Future Directions.

Authors:  Fulvio Lauretani; Yari Longobucco; Giulia Ravazzoni; Elena Gallini; Marco Salvi; Marcello Maggio
Journal:  Int J Environ Res Public Health       Date:  2021-02-28       Impact factor: 3.390

7.  Automatic seizure detection based on time-frequency analysis and artificial neural networks.

Authors:  A T Tzallas; M G Tsipouras; D I Fotiadis
Journal:  Comput Intell Neurosci       Date:  2007

Review 8.  Methods of EEG signal features extraction using linear analysis in frequency and time-frequency domains.

Authors:  Amjed S Al-Fahoum; Ausilah A Al-Fraihat
Journal:  ISRN Neurosci       Date:  2014-02-13

9.  Identifying lesions in paediatric epilepsy using morphometric and textural analysis of magnetic resonance images.

Authors:  Sancgeetha Kulaseharan; Azad Aminpour; Mehran Ebrahimi; Elysa Widjaja
Journal:  Neuroimage Clin       Date:  2019-01-04       Impact factor: 4.881

10.  A Deep Learning-Based Classification Method for Different Frequency EEG Data.

Authors:  Tingxi Wen; Yu Du; Ting Pan; Chuanbo Huang; Zhongnan Zhang
Journal:  Comput Math Methods Med       Date:  2021-10-21       Impact factor: 2.238

View more
  4 in total

1.  A Novel Decentralized Blockchain Architecture for the Preservation of Privacy and Data Security against Cyberattacks in Healthcare.

Authors:  Ajitesh Kumar; Akhilesh Kumar Singh; Ijaz Ahmad; Pradeep Kumar Singh; Pawan Kumar Verma; Khalid A Alissa; Mohit Bajaj; Ateeq Ur Rehman; Elsayed Tag-Eldin
Journal:  Sensors (Basel)       Date:  2022-08-08       Impact factor: 3.847

2.  A Robust End-to-End Deep Learning-Based Approach for Effective and Reliable BTD Using MR Images.

Authors:  Naeem Ullah; Mohammad Sohail Khan; Javed Ali Khan; Ahyoung Choi; Muhammad Shahid Anwar
Journal:  Sensors (Basel)       Date:  2022-10-06       Impact factor: 3.847

3.  A Classification Model of EEG Signals Based on RNN-LSTM for Diagnosing Focal and Generalized Epilepsy.

Authors:  Tahereh Najafi; Rosmina Jaafar; Rabani Remli; Wan Asyraf Wan Zaidi
Journal:  Sensors (Basel)       Date:  2022-09-25       Impact factor: 3.847

4.  Development and Validation of Embedded Device for Electrocardiogram Arrhythmia Empowered with Transfer Learning.

Authors:  Rizwana Naz Asif; Sagheer Abbas; Muhammad Adnan Khan; Kiran Sultan; Maqsood Mahmud; Amir Mosavi
Journal:  Comput Intell Neurosci       Date:  2022-10-07
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.