| Literature DB >> 35755868 |
Guobin Shen1,2, Dongcheng Zhao1, Yi Zeng1,3,4,2,5.
Abstract
The spiking neural network (SNN) mimics the information-processing operation in the human brain. Directly applying backpropagation to the training of the SNN still has a performance gap compared with traditional deep neural networks. To address the problem, we propose a biologically plausible spatial adjustment that rethinks the relationship between membrane potential and spikes and realizes a reasonable adjustment of gradients to different time steps. It precisely controls the backpropagation of the error along the spatial dimension. Secondly, we propose a biologically plausible temporal adjustment to make the error propagate across the spikes in the temporal dimension, which overcomes the problem of the temporal dependency within a single spike period of traditional spiking neurons. We have verified our algorithm on several datasets, and the experimental results have shown that our algorithm greatly reduces network latency and energy consumption while also improving network performance.Entities:
Keywords: SNN; backpropagation; biologically plausible spatial adjustment; biologically plausible temporal adjustment; low energy consumption; low latency; spiking neural network; surrogate gradient
Year: 2022 PMID: 35755868 PMCID: PMC9214320 DOI: 10.1016/j.patter.2022.100522
Source DB: PubMed Journal: Patterns (N Y) ISSN: 2666-3899
Classification accuracy on MNIST, CIFAR10, and CIFAR100 datasets
| Models | Training method | MNIST | CIFAR10 | CIFAR100 |
|---|---|---|---|---|
| Spiking CNN | conversion | – | 82.95 | – |
| BackRes | BP | – | 84.98 | – |
| ContinueSNN | conversion | 99.44 | 90.85 | – |
| Spike-Norm | conversion | – | 91.55 | – |
| STBP | BP | 99.42 | 50.7 | – |
| HM2BP | BP | 99.49 | – | – |
| LISNN | BP | 99.5 | – | – |
| BNTT | BP | – | 90.5 | 66.6 |
| STBP NeuNorm | BP | – | 90.53 | |
| BackEISNN | BP | 99.67 | 90.93 | – |
| SBPSNN | BP | 99.59 | 90.95 | – |
| TSSL-BP | BP | 99.53 | 91.41 | – |
| ST-RSBP | BP | 99.62 | – | – |
| RNL | conversion | 99.51 | 93.45 | 75.1 |
| SNASNet-Fw | NAS + BP | – | 93.64 | 70.06 |
| SNASNet-Bw | NAS + BP | – | 94.12 | 73.04 |
| Our method | BP | 99.67 | 92.15 | 68.28 |
| Our method ResNet34 | BP | – | 94.51 | 69.32 |
Classification accuracy on N-MNIST, DVS-Gesture, and DVS-CIFAR10 datasets
| Models | Method | N-MNIST | DVS-Gesture | DVS-CIFAR10 |
|---|---|---|---|---|
| HM2-BP | BP | 98.88 | – | – |
| SLAYER | BP | 99.2 | 93.64 | – |
| TSSL-BP 30 | BP | 99.28 | – | – |
| IIRSNN | BP | 99.28 | – | – |
| TSSL-BP 100 | BP | 99.4 | – | – |
| STBP | BP | 99.44 | – | – |
| LISNN | BP | 99.45 | – | – |
| STBP NeuNorm | BP | 99.53 | – | 60.5 |
| BNTT | BP | – | – | 63.2 |
| SALT | BP | – | – | 67.1 |
| STBP-tdBN | BP | – | 96.87 | 67.8 |
| LMCSNN | BP | 99.61 | 97.57 | 74.8 |
| BackEISNN | BP | 99.57 | – | – |
| Our method | BP | 99.71 | 98.96 | 78.95 |
Classification accuracy on Google Speech Commands dataset
| Models | Method | Accuracy |
|---|---|---|
| Sample-level | DNN | 92.53 |
| Attention RNN | DNN | 93.9 |
| Sample-level + SE | DNN | 93.95 |
| Harmonic filters | DNN | 96.39 |
| Our method | SNN | 94.2 |
The ablation study of the two adjustments on DVS-Gesture and DVS-CIFAR10 datasets
| Baseline | BPSA | BPSA + BPTA | |
|---|---|---|---|
| DVS-Gesture | 93.92 | 97.56 | 98.96 |
| DVS-CIFAR10 | 71.40 | 75.30 | 78.95 |
Figure 1The test accuracy curve on DVS-Gesture of our method and the baseline
Figure 2The firing frequency of different convolutional layers on MNIST of our method and the baseline
The energy-efficiency study of our model with baseline on different datasets
| Dataset | Accuracy (%) | Firing rate | EE = |
|---|---|---|---|
| MNIST | 99.58/99.42 | 0.082/0.183 | 35.1/15.7 |
| N-MNIST | 99.61/99.32 | 0.097/0.176 | 29.6/16.3 |
| CIFAR10 | 92.33/89.49 | 0.108/0.214 | 26.6/13.4 |
| DVS-Gesture | 98.26/93.92 | 0.083/0.165 | 34.6/17.4 |
| DVS-CIFAR10 | 77.76/71.40 | 0.097/0.177 | 29.5/16.2 |
Represented as baseline/our method.
Figure 3The test accuracy of different simulation lengths on DVS-Gesture dataset with our method and the baseline
The test accuracy on DVS-Gesture dataset of different simulation lengths of our method and the baseline
| T = 32 | T = 16 | T = 8 | T = 4 | |
|---|---|---|---|---|
| BPSA + BPTA | 98.27 | 98.26 | 96.18 | 92.01 |
| BPSA | 96.53 | 97.56 | 94.44 | 89.58 |
| Baseline | 95.49 | 93.92 | 84.03 | 73.96 |
Figure 4The forward and backward process of spiking neural networks
The dotted lines of different colors indicate the impact on the network at different time steps. The earlier spiking node will have more influence on the parameter update.
Figure 5The temporal backpropagation of LIF neurons
The information can only propagate within a single-spike period and cannot propagate cross spikes.
Figure 6The temporal residual pathway helps the error transfer from time step to time step t