| Literature DB >> 29674961 |
Paul Ferré1,2, Franck Mamalet2, Simon J Thorpe1.
Abstract
We present a novel strategy for unsupervised feature learning in image applications inspired by the Spike-Timing-Dependent-Plasticity (STDP) biological learning rule. We show equivalence between rank order coding Leaky-Integrate-and-Fire neurons and ReLU artificial neurons when applied to non-temporal data. We apply this to images using rank-order coding, which allows us to perform a full network simulation with a single feed-forward pass using GPU hardware. Next we introduce a binary STDP learning rule compatible with training on batches of images. Two mechanisms to stabilize the training are also presented : a Winner-Takes-All (WTA) framework which selects the most relevant patches to learn from along the spatial dimensions, and a simple feature-wise normalization as homeostatic process. This learning process allows us to train multi-layer architectures of convolutional sparse features. We apply our method to extract features from the MNIST, ETH80, CIFAR-10, and STL-10 datasets and show that these features are relevant for classification. We finally compare these results with several other state of the art unsupervised learning methods.Entities:
Keywords: Spike-Timing-Dependent-Pasticity; neural network; unsupervised learning; vision; winner-takes-all
Year: 2018 PMID: 29674961 PMCID: PMC5895733 DOI: 10.3389/fncom.2018.00024
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Figure 1Processing chain for the region WTA.
Weight update given x, y, and w following the proposed learning rule (Equation 9).
| −1 | − | +1 | |
| +1 | − | −1 |
Figure 2Architecture of the network in the MNIST experiment.
Figure 3Eight 5 × 5 features learned from MNIST dataset on raw images.
MNIST accuracy.
| SDNN (Kheradpisheh et al., | 98.40 |
| Two layer SNN (Diehl and Cook, | 95.00 |
| PCA-Net (Chan et al., | 98.94 |
| Our method | 98.49 |
Figure 4Architecture of the network in the ETH80 experiment.
ETH80 results.
| HMAX (Riesenhuber and Poggio, | 69.0 |
| SDNN (Kheradpisheh et al., | 82.8 |
| Our method | 75.2 |
Figure 5Architecture of the network in the CIFAR-10 experiment.
Figure 6(A) Sixty-four filters of size 7 × 7 learned with our method on the CIFAR-10 dataset. (B) The weights distribution of the network's first layer trained on CIFAR-10.
CIFAR-10 results.
| Triangle k-means (1,600 features) (Coates et al., | Yes | 50,000 | 79.6 |
| Triangle k-means (100 features) (Coates et al., | Yes | 50,000 | 55.5 |
| PCA-Net (Chan et al., | Yes | 50,000 | 78.67 |
| LIF CNN (Hunsberger and Eliasmith, | No | 50,000 | 82.95 |
| Regenerative Learning (Panda and Roy, | Yes | 20,000 | 70.6 |
| Our method (64 features) | Yes | 5,000 | 71.2 |
| CNN random frozen filters | No | 50,000 | 55.3 |