| Literature DB >> 35884656 |
Yuji Kawai1, Kazuki Tachikawa2, Jihoon Park1,3, Minoru Asada1,3,4,5.
Abstract
The integrated gradients (IG) method is widely used to evaluate the extent to which each input feature contributes to the classification using a deep learning model because it theoretically satisfies the desired properties to fairly attribute the contributions to the classification. However, this approach requires an appropriate baseline to do so. In this study, we propose a compensated IG method that does not require a baseline, which compensates the contributions calculated using the IG method at an arbitrary baseline by using an example of the Shapley sampling value. We prove that the proposed approach can compute the contributions to the classification results reliably if the processes of each input feature in a classifier are independent of one another and the parameterization of each process is identical, as in shared weights in convolutional neural networks. Using three datasets on electroencephalogram recordings, we experimentally demonstrate that the contributions obtained by the proposed compensated IG method are more reliable than those obtained using the original IG method and that its computational complexity is much lower than that of the Shapley sampling method.Entities:
Keywords: EEG signal classification; Shapley sampling; deep learning; explainability; integrated gradients
Year: 2022 PMID: 35884656 PMCID: PMC9313049 DOI: 10.3390/brainsci12070849
Source DB: PubMed Journal: Brain Sci ISSN: 2076-3425
Figure 1Diagram of the proposed method. (A) Preprocessing to compute the compensation amount (orange dashed arrow), given as the difference between the contribution to the classification obtained using the IG method with an arbitrary user-defined baseline (blue arrow) and the contribution to the classification obtained using the SS method that implicitly exhibits an appropriate baseline (green arrow). (B) Proposed IG method that compensates for the divergence from the appropriate baseline. This is equivalent to the path integral along the orange path.
Figure 2Model architecture for EEG signal classification using the CNN with constraints. Time-series signals measured by electrode for are fed into temporal convolution layers for . The input features are not spatially convolved owing to the independent constraint. The processing consists of the same parameterization owing to the identical constraint. If a classifier satisfies these constraints, the compensated IG can reliably compute the spatial contributions of to the outputs.
Spearman’s correlation coefficient compared to contributions obtained using the Shapley sampling. The temporal and spatiotemporal CNNs, respectively, correspond to one- and two-dimensional convolutional CNNs.
| Temporal CNNs | Spatiotemporal CNNs | ||||||
|---|---|---|---|---|---|---|---|
| Dataset | Class | C-IG (Proposed) | SS | IG | C-IG | SS | IG |
| PhysioNet | N1 |
| 0.970 | 0.180 | |||
| N2 |
| 0.953 | 0.655 | ||||
| N3 |
|
| 0.925 | ||||
| R | 0.963 |
| 0.665 | ||||
| W |
| 0.972 | 0.326 | ||||
| CHB-MIT | Szr. |
| 0.994 | 0.817 | 0.806 |
| 0.695 |
| No Szr. |
| 0.990 | 0.293 | 0.917 |
| −0.037 | |
| UCI EEG | Alc. |
| 0.991 | 0.260 | 0.793 |
| 0.323 |
| Ctr. |
| 0.992 | 0.258 | 0.739 |
| 0.331 | |
C-IG, compensated integrated gradients (proposed method); SS, Shapley sampling; IG, integrated gradients with the zero-input baseline; N1–3, non-rapid eye movement 1–3; R, rapid eye movement;W, wakefulness; Szr., seizure; No Szr., non-seizure; Alc., alcoholism; Ctr., control. The numbers in bold indicate the best performance of the C-IG, SS and IG methods.
Computational complexity of explanation methods in the UCI EEG dataset.
| Method | Computational Complexity |
|---|---|
| IG | 40,000 |
| SS | 12,200,000 |
| C-IG | 650,000 |
IG, integrated gradients with the zero-input baseline; SS, Shapley sampling; C-IG, compensated integrated gradients (proposed method).
Classification accuracy of models for three datasets.
| Dataset | Model | Accuracy |
|---|---|---|
| PhysioNet | Temporal CNNs | 81.6% |
| CHB-MIT | Temporal CNNs | 83.4% |
| Spatiotemporal CNNs | 88.7% | |
| UCI EEG | Temporal CNNs | 69.5% |
| Spatiotemporal CNNs | 77.1% |
Figure 3Contributions of EEG electrodes over the scalp to classify alcoholism (A) and control (B) using the temporal CNNs. The average contributions of the methods applied to 100 data examples from the UCI EEG dataset are shown. The red and blue areas represent positive and negative contributions to the classifications, respectively. The proposed method involves the compensated integrated gradients (left panels), the Shapley sampling shows the desired contributions as a reference (middle panels), and integrated gradients show the contributions obtained using the original integrated gradients method with the zero-input baseline (right panels).
Figure 4Contributions of EEG electrodes over the scalp to classify alcoholism (A) and control (B) using the spatiotemporal CNNs. The average contributions of the methods applied to 100 data examples from the UCI EEG dataset are shown. The red and blue areas represent positive and negative contributions to the classifications, respectively. The proposed compensated integrated gradients method is shown (left panels), while the Shapley sampling shows the desired contributions as a reference (middle panels), and integrated gradients show the contributions obtained using the original integrated gradients method with the zero-input baseline (right panels).