| Literature DB >> 30034334 |
Zhenshan Bing1, Claus Meschede1, Florian Röhrbein1, Kai Huang2, Alois C Knoll1.
Abstract
Biological intelligence processes information using impulses or spikes, which makes those living creatures able to perceive and act in the real world exceptionally well and outperform state-of-the-art robots in almost every aspect of life. To make up the deficit, emerging hardware technologies and software knowledge in the fields of neuroscience, electronics, and computer science have made it possible to design biologically realistic robots controlled by spiking neural networks (SNNs), inspired by the mechanism of brains. However, a comprehensive review on controlling robots based on SNNs is still missing. In this paper, we survey the developments of the past decade in the field of spiking neural networks for control tasks, with particular focus on the fast emerging robotics-related applications. We first highlight the primary impetuses of SNN-based robotics tasks in terms of speed, energy efficiency, and computation capabilities. We then classify those SNN-based robotic applications according to different learning rules and explicate those learning rules with their corresponding robotic applications. We also briefly present some existing platforms that offer an interaction between SNNs and robotics simulations for exploration and exploitation. Finally, we conclude our survey with a forecast of future challenges and some associated potential research topics in terms of controlling robots based on SNNs.Entities:
Keywords: brain-inspired robotics; learning control; neurorobotics; spiking neural network; survey
Year: 2018 PMID: 30034334 PMCID: PMC6043678 DOI: 10.3389/fnbot.2018.00035
Source DB: PubMed Journal: Front Neurorobot ISSN: 1662-5218 Impact factor: 2.650
Figure 1(A) Neuron (Wikipedia, 2017b). The figure is attributed to Quasar Jarosz at English Wikipedia, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=7616130. (B) Synapse (Wikipedia, 2017e). The figure is attributed to Thomas Splettstoesser (https://www.scistyle.com) - Own work, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=41349545.
Figure 2Number of publications whose abstract contains the terms “robot” and “spiking neural network” in the IEEE Explore and Elsevier Scopus database, respectively. Number of publications whose title contains the terms “robot” and its main text contains the term “spiking neural network” in the Springer database. All the data is from 2000 to 2016.
Figure 3General design framework for learning-inspired SNN-based robot control.
Figure 4Control architecture of feed-forward SNN. A R-STDP SNN is used to achieve lane-keeping task. The sensor input is the event sequence from DVS and the two LIF output neurons are used to decode motor speed. All these neurons are connected with R-STDP synapse in an “all to all” fashion.
Figure 5Control architecture of recurrent-SNN. A recurrent layer of state neurons is used to control the state of the agent and receives signals from the content population, which decides the target position according to different time step.
Learning rules based on STDP/Hebbian learning.
| Unsupervised | Two-wheel vehicle | 5 Proximity Sensors | Implementing an SNN on a resistive memory device and apply it to navigation tasks | Sarim et al., |
| Mobile vehicle Casis-I | 16 Ultrasonic Sensors | A behavior-based target-approaching navigation controller composed of three sub-controllers: the obstacle-avoidance, wall-following, and goal-approaching SNN controllers. | Wang et al., | |
| TriBot Robot | Distance Sensor, Contact Sensor, | Using SNN to make robot navigate in an unknown environment and avoid obstacles | Arena et al., | |
| Supervised | Two-wheel insect | 4 Proximity Sensors | Implementing an indirect training SNN in digital CMOS to navigate with obstacles | Hu et al., |
| Two-wheel insect | 2 Terrain, 2 Target | Indirectly train an SNN by RBFs to determine precise spike timings and minimize a desired objective function | Zhang et al., | |
| Aircraft | IMU | Indirectly training an SNN to approximate an optimal flight controller | Foderaro et al., | |
| 4-DoF robotic arm | 4 Joint Encoder, 3 Spatial direction of end-effector | Using supervised learning to train a single-layer network to control a robotic arm with 4 degrees of freedom in 3D space | Bouganis and Shanahan, | |
| 2-DoF robotic arm | Sensorimotor | Using supervised learning to train a spiking model of the cerebellum to control a robotic arm | Carrillo et al., | |
| Conditioning | Simulated fly | Olfactory Receptor | Implementing an SNN inspired by Drosophila olfactory system to simulate flight | Faghihi et al., |
| Lego EV3 robotic platform | Camera, Infrared sensor Colour/light sensor | Learning and unlearning autonomously locomotion based on visual-input with reinforced/aversive reflex-response | Jimenez-Romero, | |
| Two-wheel robot | 3 Proximity Sensors, 1 RGB Sensor | Using Reward-dependent STDP learning rule to allow OC and CC learning | Dumesnil et al., | |
| Foraging Ants | Olfactory Sensors, Nociceptor | Learns to associate olfactory sensor input with different behaviors through a single-layer SNN | Jimenez-Romero et al., | |
| Lego NXT 2.0 | Color sensor, Touch Sensor | Using SNN to sustain OC in multiple learning scenarios | Cyr et al., | |
| Two-wheel Vehicle | Light Sensors | Using light sensors in a target-reaching task to punish wrongful behavior | Iwadate et al., | |
| Two-wheel Vehicle | 5 Proximity Sensors, 9 IR Sensors, Vibration Sensor | Using infrared, ultrasound and visual neurons as CS and vibration neurons as US | Cyr and Boukadoum, | |
| Mobile vehicle Casis-I | 16 Ultrasonic Sensors | A learning algorithm combining operant conditioning and a shunting neural dynamics model is applied to the path planning | Wang et al., | |
| TriBot Robot | Distance Sensor, Camera, Contact Sensor, | Using target distance as CS, while contact sensors work as US causing an unconditioned response | Arena et al., | |
| R-STDP | Flapping Insect | GPS and IMU | Indirectly training an SNN-based controller for adaptive flight control | Clawson et al., |
| 1-DoF robotics arm | 5 Proximity Sensors | Using an SNN trained by a global reward and punishment signal to reach arbitrary targets | Spüler et al., | |
| Musculoskeletal arm, WAM robot | Encoders | Using a cortical spiking model composed of several hundred spiking model-neurons to control a two-joint arm | Dura-Bernal et al., | |
| CARL-SJR | Tactile Sensors | Using SNN to provide feedback to users by displaying bright colors on its surface. | Chou et al., | |
| Two-wheel vehicle | 2 Proximity Sensors | Implement a version of DA-modulated STDP on a food foraging task | Evans, | |
| Foraging Simulator | Visual Sensors | Using reward-STDP based SNN to solve a grid-based foraging task | Skorheim et al., | |
| DfRobotShop Rover | Camera, Light Sensor | Using an SNN and external flash to reinforce the goal-directed and adaptive behaviors | Helgadottir et al., | |
| 2-DoF robotics arm | Sensorimotor | Using an SNN based on R-STDP to control a two-joint virtual arm to reach to a fixed target | Neymotin et al., | |
| 1-DoF robotics arm | Encoder | Using an SNN to control a single-joint arm for target reaching | Chadderdon et al., | |
Figure 6Supervised Hebbian training of a synapse: The weight of the synapse between pre and post-synaptic neurons, N and N, is adjusted by the timing of the pre-synaptic spike-train s and external post-synaptic training signal s.
Figure 7Classical Conditioning with STDP synapse between N and N: An unconditioned stimulus (US) A or B causes the post-synaptic neuron N to fire. The conditioned stimulus (CS) firing shortly before its associated US will adjust its weights so that N will fire even in the absence of US. Due to the Hebbian learning rule, the synaptic weight is unchanged when the other, unrelated stimulus causes N to fire.
Figure 8Reward-modulated STDP synapse between N and N: Depending on the post-synaptic output spike-train, a reward r is defined that modulates the weight change of the synapse.
Other learning rules.
| Evolutionary algorithms | Speedometer, Proximity Sensors | Using evolutionary algorithm to train SNN and compare results with multi-layer perceptron | Markowska and Koldowski, | ||
| Quadrotor | GPS | Using evolutionary algorithm to generate high utility topology/weight combinations in the SNN | Howard and Elfes, | ||
| Two-wheel Vehicle | 5 IR Sensors | Using SNN to mimic the behaviors captured under control of a heuristic rule program | Batllori et al., | ||
| Khepera Robot (Two-wheel Vehicle) | Linear Camera | Using evolution to rapidly generate SNN capable of navigating in a textual environment | Floreano et al., | ||
| Two-wheel Vehicle | 4 IR Sensors | A use-dependent synaptic modification algorithm of SNN for obstacle-avoidance vehicle behavior | Alnajjar and Murase, | ||
| Two-wheel Vehicle | 9 Ultrasonic Sensors, 4 Bump Sensors | Using an adaptive GA to evolve the SNN online through interaction with the real environment | Hagras et al., | ||
| Fuzzy logical | Two-wheel Vehicle | 7 Ultrasonic Sensors (5 in front, 2 at back) | Using SNN to mimic the knowledge of a fuzzy controller | Kubota, | |
| Liquid state machine | Hexapod Robot | Visual Sensor (Distance, Height) | Mushroom bodies in drosophila are modeled as a recurrent SNN under LSM paradigm | Arena et al., | |
| 2-Dof Ball Balance Platform | Position and Velocity | Using a cortical network (LSM) to learn under a supervised learning rule for position control | Probst et al., | ||
| Khepera Robot | 8 IR Sensors | Using Randomly generated recurrent SNN to operate real-time obstacle avoidance | Burgsteiner, | ||
Taxonomy of platforms for robotics control based on SNN.
| Platform | Neurorobotics Platform | Design, import, and simulate different robot bodies and diverse brain models in rich environments | Falotico et al., | |
| Musculoskeletal Robots | Combining Myorobotics with SpiNNaker the proof of principle of a system that can scale to dozens of neurally controlled, physically compliant joints. | Richter et al., | ||
| Retina simulation | The retina simulation platform is integrated in the NRP. | Ambrosano et al., | ||
| Neural self-driving vehicle simulation framework | A visual encoder from camera images to spikes inspired by the silicon retina, and a steering-wheel decoder based on an agonist antagonist muscle model. | Kaiser et al., | ||
| iSpike | Interface between SNN simulators and the iCub humanoid robot | Gamez et al., | ||
| AnimatLab | Provide functions, such as robot modeling, two neural models, and plugins for importing other models. | Cofer et al., | ||