| Literature DB >> 32425753 |
Jacob M Williams1, Ashok Samal1, Prahalada K Rao2, Matthew R Johnson3.
Abstract
Many recent developments in machine learning have come from the field of "deep learning," or the use of advanced neural network architectures and techniques. While these methods have produced state-of-the-art results and dominated research focus in many fields, such as image classification and natural language processing, they have not gained as much ground over standard multivariate pattern analysis (MVPA) techniques in the classification of electroencephalography (EEG) or other human neuroscience datasets. The high dimensionality and large amounts of noise present in EEG data, coupled with the relatively low number of examples (trials) that can be reasonably obtained from a sample of human subjects, lead to difficulty training deep learning models. Even when a model successfully converges in training, significant overfitting can occur despite the presence of regularization techniques. To help alleviate these problems, we present a new method of "paired trial classification" that involves classifying pairs of EEG recordings as coming from the same class or different classes. This allows us to drastically increase the number of training examples, in a manner akin to but distinct from traditional data augmentation approaches, through the combinatorics of pairing trials. Moreover, paired trial classification still allows us to determine the true class of a novel example (trial) via a "dictionary" approach: compare the novel example to a group of known examples from each class, and determine the final class via summing the same/different decision values within each class. Since individual trials are noisy, this approach can be further improved by comparing a novel individual example with a "dictionary" in which each entry is an average of several examples (trials). Even further improvements can be realized in situations where multiple samples from a single unknown class can be averaged, thus permitting averaged signals to be compared with averaged signals.Entities:
Keywords: EEG; MVPA; cognitive neuroscience; deep learning; machine learning
Year: 2020 PMID: 32425753 PMCID: PMC7203477 DOI: 10.3389/fnins.2020.00417
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1Cognitive task and sample EEG data. (A) Participants viewed pairs of one of three categories of images at the beginning of each trial of the cognitive task, with blank-screen fixation intervals before and after. Other task components followed the presentation of the images, but those elements of the task are not presented or analyzed here. (B) Single representative trial of EEG data after pre-processing and downsampling. Electrode labels are according to the standard 10–20 and 10–10 systems for EEG electrode placement.
Figure 2PTC neural network architecture. The input is two 31 × 37 EEG signals, stacked together in a 2 × 31 × 37 3D array. Batch normalization is immediately applied to each signal, and the output of this is both passed through the 12 filter banks in the Convolution2D layer of Block 1 and passed directly to the Batch Normalization block. Thus, the Batch Normalization layer of Block 1 outputs 12 + 2 = 14 images of size 31 × 37. This process is repeated for a total of four blocks. No concatenation is performed in the dimensionality reduction block, and the 3 × 31 × 37 feature map is flattened and passed to the final dense layers before classification. All convolutional and dense layers use the Leaky ReLU activation function unless otherwise specified.
Baseline accuracy (percent, with chance = 33.33%).
| Single | 63.67 | 59.51 | |
| Averaged | 81.54 | 82.52 |
Bold values indicate highest accuracy in each row.
PTC accuracy summary (percent).
| Single-to-Single | 56.03 | 49.21 |
| Single-to-Average | 71.25 | 61.53 |
| Average-to-Average | 86.15 | 83.32 |
| (Chance) | 50.00 | 33.33 |
Confusion matrices for PTC analyses (percent).
| Same | Different | Same | Different | Same | Different | ||
| Same | 53.71 | 46.29 | 60.96 | 39.04 | 79.39 | 20.61 | |
| Different | 42.74 | 57.26 | 23.50 | 76.50 | 10.47 | 89.53 | |
Figure 3Confusion matrices by individual stimulus categories for all classifiers. Values presented as proportions rather than percentages for readability.