| Literature DB >> 30210278 |
Runchun Wang1, André van Schaik1.
Abstract
We present a massively-parallel scalable multi-purpose neuromorphic engine. All existing neuromorphic hardware systems suffer from Liebig's law (that the performance of the system is limited by the component in shortest supply) as they have fixed numbers of dedicated neurons and synapses for specific types of plasticity. For any application, it is always the availability of one of these components that limits the size of the model, leaving the others unused. To overcome this problem, our engine adopts a unique novel architecture: an array of identical components, each of which can be configured as a leaky-integrate-and-fire (LIF) neuron, a learning-synapse, or an axon with trainable delay. Spike timing dependent plasticity (STDP) and spike timing dependent delay plasticity (STDDP) are the two supported learning rules. All the parameters are stored in the SRAMs such that runtime reconfiguration is supported. As a proof of concept, we have implemented a prototype system with 16 neural engines, each of which consists of 32768 (32k) components, yielding half a million components, on an entry level FPGA (Altera Cyclone V). We verified the prototype system with measurement results. To demonstrate that our neuromorphic engine is a high performance and scalable digital design, we implemented it using TSMC 28nm HPC technology. Place and route results using Cadence Innovus with a clock frequency of 2.5 GHz show that this engine achieves an excellent area efficiency of 1.68 μm2 per component: 256k (218) components in a silicon area of 650 μm × 680 μm (∼0.44 mm2, the utilization of the silicon area is 98.7%). The power consumption of this engine is 37 mW, yielding a power efficiency of 0.92 pJ per synaptic operation (SOP).Entities:
Keywords: FPGA; neural engine; neuromorphic engineering; spike timing dependant delay plasticity; spike timing dependant plasticity; spiking neural networks
Year: 2018 PMID: 30210278 PMCID: PMC6123369 DOI: 10.3389/fnins.2018.00593
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
Arrangement of the neural states.
| Bit[7:4] | Bit[3:0] | |
|---|---|---|
| LIF neuron | Membrane voltage ( | PSC |
| STDP adaptor | The exponential time window. Bit[7] indicates the polarity of the time window (0 and 1: triggered by pre- and post-synaptic spike). Bit[6:4] are the value of the exponential time window. | Bit[3]: single active signal of the pre-synaptic spike Bit[2:0]: synaptic weight (positive only) |
| STDDP adaptor | Ramp value | Axonal delay |
Functions of the physical component.
| Time-driven tasks | Event-driven tasks | |
|---|---|---|
| LIF Neuron | (1)Apply the leak to the PSC and the membrane voltage. | (1) Update the PSC with the incoming pre-synaptic spikes. |
| STDP adaptor | (1)Apply the leak to the time window for the exponential decay. | (1) Start the time window when there is an incoming spike. |
| STDDP adaptor | (1) Increase the ramp value by 1 when the ramp is active. | (1) Start the ramp when there is a pre-synaptic spike. |
Device utilization Altera cyclone 5CSXFC6D6F31C6.
| Adaptive Logic Modules (ALMs) | RAMs | DSPs | |
|---|---|---|---|
| Neural engine | 1250 | 256k bit | 18 |
| Event-driven module | 946.4 | N/A | 16 |
| Time-driven module | 76.5 | N/A | 2 |
| Controller | 227.1 | N/A | N/A |
| Prototype system | 29212/41910 | 4.6 M bit/5.6 M bit | 112/112∗ |
FPGA’s power dissipation estimated by PowerPlay.
| Total power dissipation | 1.95 W |
| I/O power dissipation | 45.6 mW |
| Core static power dissipation | 434.5 mW |
| Core dynamic power dissipation | 1.48 W |
| One Neural engine dynamic power dissipation | 37.75 mW |
| DDR interface dynamic power dissipation | 67.16 mW |
| USB Interface dynamic power dissipation | 20.61 mW |
| Debug module dynamic power dissipation | 133.1 mW |
| Routing dynamic power dissipation∗ | 695.3 mW |
Parameters of the LIF neuron.
| τEPSC | 6 ms |
| τlPSC | 6 ms |
| τmem | 5.5 ms |
| τrfc | 3 ms |
| 1 |
Comparison of with the state of the art of spiking neuromorphic circuits, adapted and completed from (Frenkel et al., 2018).
| HICANN | NeuroGrid | ROLLS | DYNAPs | IFAT | TrueNorth | Loihi | Odin | This work | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Implementation | Mixed-signal | Mixed-signal | Mixed-signal | Mixed-signal | Mixed-signal | Mixed-signal | Digital | Digital | Digital | Digital | Digital |
| Technology | 180 nm | 180 nm | 180 nm | 28 nm | 90 nm | 28 nm | 45 nm | 28 nm | 14 nm | 28 nm | 28 nm |
| Area of a neurosynaptic core [mm2] | 26.3 | 168 | 51.4 | 7.5 | 0.31 | 0.36 | 0.8/1.15 | 0.095 | 0.4 | 0.086 | 0.44 |
| Neurons per core | 512 | 64k | 256 | 256 | 2k | 64 | 256 | 256 | max. 1024 | 256 | 256k† |
| Synaptic weight storage | 4-bit SRAM | Off-chip | Capacitor | 12-bit CAM | Off-chip | 4-bit SRAM | 1-bit/4-bit SRAM | 1-bit SRAM | 1- to 9-bit SRAM | (3+l)-bit SRAM | 3-bit SRAM |
| Embedded online learning | STDP | No | SDSP | No | No | SDSP | Probabilistic STDP | No | Programmable | SDSP | STDP STDDP |
| Synapse per core | 112k | – | 128k | 16k | – | 8k | 64k | 64k | 1M to 114k (1-to 9-bit) | 64k | 256k† |
| Neuron model | Adaptive exponential IF | Adaptive quadratic IF | Adaptive LIF | Adaptive LIF | 2-compartment LIF | Adaptive exponential IF | Configurable LIF | Configurable LIF | Adaptive LIF | Phenomenological | 2-compartment LIF |
| Time constant | Accelerated | Biological | Biological | Biological | Biological | Bio to accel | Biological | Biological | N/A | Bio to accel | Bio to accel |
| Neuron core density [neur/mm2]∗ | 19.5 | 390 | 5 | 34 | 6.5k | 178 | 827/575 | 2.6k | Max. 640 | 3.0k | 582k† |
| Synapse core density [syn/mm2]∗ | 4.3k | – | 2.5k | 2.1k | – | 22.2k | 207k/144k | 673k | 640k to 71k | 741k | 582k† |
| Supply voltage | 1.8 V | 3.0 V | 1.8 V | 1.3 V-1.8 V | 1.2 V | 0.75 V, 1.0 V | 0.53 V–1.0 V | 0.7 V–1.05 V | 0.5 V–1.25 V | 0.55 V–1.0 V | 0.9 V |
| Energy per SOP | N/A | 941 pJ | 77 fJ | 134 fJ–417 fJ | 22 pJ | >850 pJ° | N/A | 26p at 0.775 V | >23.6 pJ at 0.75‡ | 9.8 pJ at 0.55 V | 0.92 pJ |