| Literature DB >> 22016720 |
Stefan Scholze1, Stefan Schiefer, Johannes Partzsch, Stephan Hartmann, Christian Georg Mayr, Sebastian Höppner, Holger Eisenreich, Stephan Henker, Bernhard Vogginger, Rene Schüffny.
Abstract
State-of-the-art large-scale neuromorphic systems require sophisticated spike event communication between units of the neural network. We present a high-speed communication infrastructure for a waferscale neuromorphic system, based on application-specific neuromorphic communication ICs in an field programmable gate arrays (FPGA)-maintained environment. The ICs implement configurable axonal delays, as required for certain types of dynamic processing or for emulating spike-based learning among distant cortical areas. Measurements are presented which show the efficacy of these delays in influencing behavior of neuromorphic benchmarks. The specialized, dedicated address-event-representation communication in most current systems requires separate, low-bandwidth configuration channels. In contrast, the configuration of the waferscale neuromorphic system is also handled by the digital packet-based pulse channel, which transmits configuration data at the full bandwidth otherwise used for pulse transmission. The overall so-called pulse communication subgroup (ICs and FPGA) delivers a factor 25-50 more event transmission rate than other current neuromorphic communication infrastructures.Entities:
Keywords: configurable pulse delays; configuration over AER; packet-based AER; serial AER in VLSI
Year: 2011 PMID: 22016720 PMCID: PMC3191349 DOI: 10.3389/fnins.2011.00117
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Figure 1Overview of one wafer module of the FACETS/BrainScaleS waferscale neuromorphic system (Schemmel et al., .
Figure 2Logical structure of the off-wafer packet-based network and the on-wafer routing grid.
Figure 3Approximation for the region of senders to a single wafer for locally connected networks.
Figure 4Number of external spike sources for a fully utilized wafer with Gaussian (local) connectivity.
Figure 5Block diagram of DNC with its eight HICANN interfaces and the FPGA connection (Scholze et al., .
Figure 6Block diagram of the logical FPGA structure (Hartmann et al., .
Figure 7Photograph of the PCS in its measurement setup. Compared to the PCS in Figure 1, the heat pipes and heat sink have been removed for better visibility.
Characteristics of the presented DNC, overall PCS, and comparable implementations.
| Reference | Interfaces: event rate, pulse event size, and type | Sum of all interfaces | Event error detection | Config. over AER | Topol. remap. | Additional functionality | ||
|---|---|---|---|---|---|---|---|---|
| Host/PC | Inter-board | Neuro. chip | ||||||
| DNC | – | 500 Mevent/s, 24 bit, LVDS serial link | 364 Mevent/s, 24 bit, LVDS serial link | 864 Mevent/s | Yes | Yes | No | Event sorting, configurable delays |
| Overall PCS | 62 Mevents/s, 32 bit, GBit Ethernet | 1.28 Gevent/s, 28 bit, MultiGBit transceiver | 1.46 Gevent/s, 24 bit, LVDS serial link | 2.8 Gevent/s | Yes | Yes | Yes | Large event storage and playback |
| Serrano-Gotarredona et al. ( | 6 Mevent/s, 16 bit, USB2.0 | 25 Mevent/s, 16 bit, parallel | 25 Mevent/s, 16 bit, parallel | 56 Mevent/s | Partially | ? | Yes | Video to/from event transform |
| Berge and Häfliger ( | Not implemented | 41.7 Mevent/s, 20 bit, MultiGBit transceiver | 41.7 Mevent/s, 16 bit, parallel | 83 Mevent/s | No | Planned | No | Speed scaling possible to ca. 200 Mevent/s |
| Fasnacht et al. ( | 5 Mevent/s, 64 bit, USB2.0 | 78 Mevent/s, 32 bit, serial ATA | 30 Mevent/s, 16 bit, parallel | 113 Mevent/s | Yes | Partially | Yes | Asynchronous flow control |
| Merolla et al. ( | Not implemented | 45 Mevent/s, custom serial | 45 Mevent/s, custom serial | 90 Mevent/s | No | Partially | Yes | Traffic distribution by broadcast-mesh structure |
| Mayr et al. ( | 160 Mevent/s parallel synchronous | Not implemented | 160 Mevent/s, parallel synchronous | 320 Mevent/s | No | No | Yes | Event compression, distributed routing |
“Partially” in the sixth column means that error detection as part of standardized interfaces such as USB has been implemented, but no error detection exists for the customized (e.g., serial) communication links. The entries for Serrano-Gotarredona et al. (.
Figure 8Model of a synfire chain with feedforward inhibition employing a controllable delay, figure adapted from Kremkow et al. (.
Figure 13State space diagrams for the synfire chain. (A–D) Number of spikes per neuron in the last excitatory population of the synfire chain with respect to the parameters of the initial stimulation for ideal (A,C) and hardware (B,D) transmission and delays 4 ms (A,B) and 10 ms (C,D). The solid lines indicate the fitted separatrix between successful transmission of the spike volley to the end of the chain and diminishing activity. (E,F) Separatrices for different delay settings with ideal (E) and hardware (F) transmission.
Figure 9Delay measurement of the FPGA–DNC–HICANN connection. (A) sent/received spikes (B) measured delays (C) delay jitter.
Figure 10Bandwidth measurement of a single FPGA–DNC–HICANN connection. (A) received spike rate (B) spike loss.
Figure 11A sample pulse pattern of the synfire chain, with pulses of the inhibitory FS populations transmitted via the PCS. Upper plot: pulses sorted by population, from the bottom: initial stimulus, resultant activity of the first inhibitory FS population before (FS1) and after (FS1trans) transmission, activity of the first excitatory RS population (RS1), and the corresponding patterns for the second group of populations (compare Figure 8). Lower plot: given spike density and resulting histogram of the stimulus in the upper plot. Parameters are a = 4, σ = 20 ms, FS-to-RS (hardware) delay 10 ms.
Figure 12Transmission of spike volleys by the PCS. Pulse packets for 10 spike sources with varying number of spikes per neuron a and temporal spread σ were transmitted via the PCS, configured with an 8-ms delay, and the received spike train characterized: (A) fraction of actually transmitted pulses, (B) temporal spread ratio of received and sent pulse packets.