| Literature DB >> 30526571 |
Xiaoyan Wei1, Lin Zhou2, Ziyi Chen3, Liangjun Zhang1, Yi Zhou4.
Abstract
BACKGROUND: Automated seizure detection from clinical EEG data can reduce the diagnosis time and facilitate targeting treatment for epileptic patients. However, current detection approaches mainly rely on limited features manually designed by domain experts, which are inflexible for the detection of a variety of patterns in a large amount of patients' EEG data. Moreover, conventional machine learning algorithms for seizure detection cannot accommodate multi-channel Electroencephalogram (EEG) data effectively, which contains both temporal and spatial information. Recently, deep learning technology has been widely applied to perform image processing tasks, which could learns useful features from data and process multi-channel data automatically. To provide an effective system for automatic seizure detection, we proposed a new three-dimensional (3D) convolutional neural network (CNN) structure, whose inputs are multi-channel EEG signals.Entities:
Keywords: Convolutional neural network; Epilepsy; Multi-channel; Seizure detection; Three-dimensional
Mesh:
Year: 2018 PMID: 30526571 PMCID: PMC6284363 DOI: 10.1186/s12911-018-0693-8
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
The details of collected data
| ID | Sex | Age | Channels | State | Time | Seizure | IT |
|---|---|---|---|---|---|---|---|
| 1 | F | 36 | 22 | AS→SS | 8 h | 14 | 654 s |
| 2 | F | 22 | 22 | AS→SS | 48 h | 12 | 274 s |
| 3 | F | 36 | 22 | AS→SS | 8 h | 14 | 1386s |
| 4 | F | 40 | 22 | AS→SS | 24 h | 6 | 302 s |
| 5 | M | 6 | 22 | AS→SS | 24 h | 21 | 453 s |
| 6 | F | 16 | 22 | AS→SS | 24 h | 7 | 329 s |
| 7 | F | 16 | 22 | AS→SS | 24 h | 8 | 254 s |
| 8 | F | 28 | 22 | AS→SS | 24 h | 5 | 400 s |
| 9 | F | 31 | 22 | AS→SS | 24 h | 9 | 423 s |
| 10 | M | 51 | 22 | AS→SS | 24 h | 30 | 1064s |
| 11 | M | 20 | 22 | AS→SS | 24 h | 19 | 4072 s |
| 12 | M | 46 | 22 | AS→SS | 24 h | 6 | 208 s |
| 13 | F | 15 | 22 | AS→SS | 24 h | 8 | 137 s |
Notes: AS Awake stage, SS Sleep stage, IT Ictal time
Fig. 1The single-channel EEG recordings illustrating typical brain states. The typical brain states of epilepsy patients include pre-ictal, ictal and inter-ictal three states. An hour segment before each seizure was defined as a pre-ictal state. Neurophysiology experts annotated ictal state. EEG data of the signal that were neither pre-ictal nor ictal defined as inter-ictal states. The figure represents the whole process of brain electrical signal seizure
Fig. 2Overview of the pipeline used for seizure detection using 3D CNN
Fig. 32D and 3D image reconstruction for multi-channel EEG. a 2D image reconstruction on a multi-channel time series results in an image in 2D (multiple frames as multiple channels). b 3D image reconstruction on multi-channel time series results in 3D image volume, preserving temporal information of the input signal. The z-axis is the channel number, x is the size of the time window, y is the value of the signal
The parameters of the 3D CNN
| Layer | Hidden Layer | Related parameters (kernel, kernel size, stride, dropout) | ||
|---|---|---|---|---|
| 1 | Conv3D + LeakyReLU | 64 | 3*3*3 | 1*1*1 |
| 2 | Max Pooling | 2*2*2 | 2*2*2 | |
| 3 | Conv3D + LeakyReLU | 128 | 3*3*3 | 1*1*1 |
| 4 | Max Pooling | 2*2*2 | 1*2*2 | |
| 5 | Conv3D + LeakyReLU | 256 | 3*3*3 | 1*1*1 |
| 6 | Conv3D + LeakyReLU | 256 | 3*3*3 | 1*1*1 |
| 7 | Max Pooling | 2*2*2 | 1*2*2 | |
| 8 | Fully connected | 4096 | ||
| 9 | Fully connected | 2048 | ||
| Softmax | ||||
Fig. 4The architecture of 3D CNN. 3D CNN network has 4 convolution layers, 3 max-pooling layers, and 2 fully connected layers, followed by a softmax output layer. All conv3D kernels are 3*3*3 with stride 1 in both three dimensions; all pooling layer kernels are 2*2*2. The first fully connected layer has 4096 output units and the second fully connected layer has 2048 output units
Comparison between batch normalization and group normalization
| Method | Batch size = 10 | ||
|---|---|---|---|
| Epoch = 1 | Epoch = 50 | Epoch = 200 | |
| BN | 74% | 84% |
|
| GN | 79% | 87% |
|
BN Batch normalization, GN Group normalization. Bold number represents the largest number is that column
The details of 2D CNN structure
| Layer | Hidden Layer | Related parameters (kernel, kernel size, stride, dropout) | ||
|---|---|---|---|---|
| 1 | Conv2D + LeakyReLU+BN | 32 | 5*5 | 1 |
| 2 | Max Pooling | 3*3 | 2 | |
| 3 | Conv2D + LeakyReLU+BN | 64 | 3*3 | 1 |
| 4 | Max Pooling | 2*2 | 2 | |
| 5 | Conv2D + LeakyReLU+BN | 128 | 3*3 | 1 |
| 6 | Max Pooling | 2*2 | 2 | |
| 7 | Conv2D + LeakyReLU+BN | 256 | 3*3 | 1 |
| 8 | Max Pooling | 2*2 | 2 | |
| 9 | Conv2D + LeakyReLU+BN | 256 | 3*3 | 1 |
| 10 | Max Pooling | 2*2 | 2 | |
| 11 | Fully connected | 2048 | ||
| Dropout | 0.5 | |||
| 12 | Fully connected | 1024 | ||
| Dropout | 0.5 | |||
| Softmax | ||||
Fig. 5The architecture of 2D CNN. 2D CNN network has 5 convolutions, 5 max-pooling and 2 fully connected layers with a dropout rate of 0.5, followed by a softmax output layer. Conv2D kernels are 3*3 with stride 1or 5*5 with stride1; pooling layer kernels are 2*2 with stride 2 or 3*3 with stride 2. The first fully connected layer has 2048 output units and the second fully connected layer has 1024 output units
Obfuscation matrix of prediction results and actual results
| Prediction | Total | |||
|---|---|---|---|---|
| Object | Non-object | |||
| Actual | Object | True Postive(TP) | False Postive(FP) | TP + FP |
| Non-object | False Negtive(FN) | True Negtive(TN) | FN + TN | |
| Total | TP + FN | FP + TN | TP + FP + FN + TN | |
Each row of the matrix represents the instances in a predicted class while each column represents the instances in an actual class
Classification result based on 2DCNN model using single and multi-channel
| Prediction | Accuracy | Specificity | Sensitivity | ||||
|---|---|---|---|---|---|---|---|
| Inter-ictal | Pre-ictal | ictal | |||||
| Single channel | Inter-ictal | 813 | 124 | 63 | 87.53% | 90.65% | 81.30% |
| Pre-ictal | 92 | 864 | 44 | 90.20% | 92.1% | 86.40% | |
| Ictal | 95 | 34 | 871 | 92.13% | 94.65% | 87.10% | |
| Multi channel | Inter-ictal | 822 | 119 | 59 | 88.13% | 91.10% | 82.20% |
| Pre-ictal | 107 | 838 | 55 | 89.20% | 91.90% | 83.80% | |
| Ictal | 71 | 43 | 886 | 92.40% | 94.30% | 88.60% | |
Classification results based on 2D and 3DCNN model using multi-electrode
| Prediction | Accuracy | Specificity | Sensitivity | ||||
|---|---|---|---|---|---|---|---|
| Inter-ictal | Pre-ictal | ictal | |||||
| 3D CNN | Inter-ictal | 861 | 81 | 58 | 90.73% | 93.05% | 86.10% |
| Pre-ictal | 77 | 894 | 29 | 92.57% | 94.15% | 89.40% | |
| Ictal | 62 | 36 | 902 | 93.83% | 94.15% | 90.20% | |
| 2D CNN | Inter-ictal | 822 | 119 | 59 | 88.13% | 91.10% | 82.20% |
| Pre-ictal | 107 | 838 | 55 | 89.20% | 91.90% | 83.80% | |
| Ictal | 71 | 43 | 886 | 92.40% | 94.30% | 88.60% | |
Performance comparison of different methods
| Method | Mean accuracy | Running time |
|---|---|---|
| ApEn+DWT + SVM [ | 91.25% | 85.1 s |
| 2D CNN | 89.91% | 3.8 s |
| 3D CNN |
|
|
SVM Support vector machines, ApEn Approximate entropy, DWT Discrete wavelets transform. The bold number denotes the largest number in that column