| Literature DB >> 30105544 |
Petr Nejedly1,2,3, Jan Cimbalnik4, Petr Klimes4,5, Filip Plesinger5, Josef Halamek5, Vaclav Kremen6,7, Ivo Viscor5, Benjamin H Brinkmann6,7, Martin Pail8, Milan Brazdil8,9, Gregory Worrell6,7, Pavel Jurak5.
Abstract
Manual and semi-automatic identification of artifacts and unwanted physiological signals in large intracerebral electroencephalographic (iEEG) recordings is time consuming and inaccurate. To date, unsupervised methods to accurately detect iEEG artifacts are not available. This study introduces a novel machine-learning approach for detection of artifacts in iEEG signals in clinically controlled conditions using convolutional neural networks (CNN) and benchmarks the method's performance against expert annotations. The method was trained and tested on data obtained from St Anne's University Hospital (Brno, Czech Republic) and validated on data from Mayo Clinic (Rochester, Minnesota, U.S.A). We show that the proposed technique can be used as a generalized model for iEEG artifact detection. Moreover, a transfer learning process might be used for retraining of the generalized version to form a data-specific model. The generalized model can be efficiently retrained for use with different EEG acquisition systems and noise environments. The generalized and specialized model F1 scores on the testing dataset were 0.81 and 0.96, respectively. The CNN model provides faster, more objective, and more reproducible iEEG artifact detection compared to manual approaches.Entities:
Keywords: Artifact probability matrix (APM); Convolutional neural networks (CNN); Intracranial EEG (iEEG); Noise detection
Mesh:
Year: 2019 PMID: 30105544 PMCID: PMC6459786 DOI: 10.1007/s12021-018-9397-6
Source DB: PubMed Journal: Neuroinformatics ISSN: 1539-2791
Fig. 1Example of pathological activity seen in epileptogenic brain, and characterized as a sharp wave transient. The blow up of the pathological transient shows a high frequency oscillation (HFO) riding on spike peak
Fig. 2Example of physiological iEEG in channels B1-B2, compared to muscle
Number of 3-s length segments from St. Anne’s University Hospital and Mayo Clinic in each class based on manual scoring by experts
| St. Anne’s University Hospital | Mayo Clinic | |
|---|---|---|
| Classification category | Segments in category | Segments in category |
| Physiological iEEG | 66,581 | 44,259 |
| Pathological iEEG | 18,184 | 6099 |
| Noise and muscle activity | 13,777 | 25,389 |
| Power line noise (50hz/60hz) | 13,825 | 22,420 |
| Total | 112,367 | 98,167 |
Fig. 4Flowchart illustrating training for generalized model. A- St. Anne’s University Hospital (FNUSA) dataset (Table 1), B- Mayo clinic dataset (Table 1), C- training of generalized model, D- generalized model results for out of institution/out of patient testing Mayo Clinic dataset (Table 3)
Fig. 3Flowchart of the CNN system. As an input of the CNN a z-score of raw data for each 3-s epoch was used as well as envelograms in five different frequency bands. Drop out layers, Batch Normalization Layers and L2 regularization is used to control for over-training. The Artefacts Probability Matrix (APM) was generated from a resulting image
Classification results of the generalized iEEG model (trained on St Anne’s University Hospital training dataset). Testing was performed on Mayo Clinic testing dataset
| Classification Category | Recall | PPV | F1 |
|---|---|---|---|
| Noise and muscle activity | 0.91 | 0.88 | 0.89 |
| Physiological iEEG | 0.87 | 0.93 | 0.90 |
| Pathological iEEG | 0.74 | 0.56 | 0.64 |
| Average | 0.863 | 0.804 | 0.81 |
Fig. 5Flowchart illustrating training for specialized model.A- Mayo clinic dataset (Table 1), B- retraining of generalized model, C- generalized model results for out of sample testing Mayo Clinic dataset (Table 5)
Classification results of the specialized Mayo Clinic data model (generalized model retrained by small Mayo Clinic training dataset). Testing was performed on Mayo Clinic testing dataset
| Classification Category | Recall | PPV | F1 |
|---|---|---|---|
| Power line noise (60hz) | 0.96 | 0.99 | 0.98 |
| Noise and muscle activity | 0.98 | 0.96 | 0.97 |
| Physiological iEEG | 0.97 | 0.97 | 0.97 |
| Pathological iEEG | 0.91 | 0.90 | 0.90 |
| Average | 0.96 | 0.95 | 0.96 |
A confusion matrix of the generalized iEEG model (trained on St Anne’s University Hospital training dataset). Tensting was performed on complete Mayo Clinic dataset
| Automated Classification | |||||
|---|---|---|---|---|---|
| Gold Standard | Noise and muscle activity | Physiological iEEG | Pathological iEEG | Total | |
| Noise and muscle activity | 23,253 | 1241 | 895 | 25,389 | |
| Physiological iEEG | 3073 | 38,647 | 2539 | 44,259 | |
| Pathological iEEG | 20 | 1549 | 4530 | 6099 | |
| Total | 26,346 | 41,437 | 7964 | ||
A confusion matrix of the specialized Mayo Clinic data model (generalized model retrained by small Mayo Clinic training dataset). Testing was performed on Mayo Clinic testing dataset
| Automated Classification | ||||||
|---|---|---|---|---|---|---|
| Gold Standard | Power line noise (60hz) | Noise and muscle activity | Physiological iEEG | Pathological iEEG | Total | |
| Power line noise (60hz) | 19,949 | 171 | 487 | 19 | 20,626 | |
| Noise and muscle activity | 44 | 23,084 | 183 | 47 | 23,358 | |
| Physiological iEEG | 87 | 531 | 39,625 | 475 | 40,718 | |
| Pathological iEEG | 0 | 29 | 454 | 5128 | 5611 | |
| Total | 20,080 | 23,815 | 40,749 | 5669 | ||
Fig. 6Example of binarized Artifacts Probability Matrix (APM). Yellow stripes indicate artifacts with a probability higher than 95%. Y-axis shows iEEG channels and X-axis shows time in seconds
Fig. 7Example of binarized pathology probability matrix (PPM). Yellow stripes indicate patology segments with a probability higher than 95%.Y-axis shows iEEG channels and X-axis shows time in seconds