| Literature DB >> 32174804 |
Sergey A Lobov1,2, Alexey N Mikhaylov1, Maxim Shamshin1, Valeri A Makarov1,3, Victor B Kazantsev1,2.
Abstract
Development of spiking neural networks (SNNs) controlling mobile robots is one of the modern challenges in computational neuroscience and artificial intelligence. Such networks, being replicas of biological ones, are expected to have a higher computational potential than traditional artificial neural networks (ANNs). The critical problem is in the design of robust learning algorithms aimed at building a "living computer" based on SNNs. Here, we propose a simple SNN equipped with a Hebbian rule in the form of spike-timing-dependent plasticity (STDP). The SNN implements associative learning by exploiting the spatial properties of STDP. We show that a LEGO robot controlled by the SNN can exhibit classical and operant conditioning. Competition of spike-conducting pathways in the SNN plays a fundamental role in establishing associations of neural connections. It replaces the irrelevant associations by new ones in response to a change in stimuli. Thus, the robot gets the ability to relearn when the environment changes. The proposed SNN and the stimulation protocol can be further enhanced and tested in developing neuronal cultures, and also admit the use of memristive devices for hardware implementation.Entities:
Keywords: learning; memristive devices; neural competition; neuroanimat; neurorobotics; spike-timing-dependent plasticity; spiking neural networks; synaptic competition
Year: 2020 PMID: 32174804 PMCID: PMC7054464 DOI: 10.3389/fnins.2020.00088
Source DB: PubMed Journal: Front Neurosci ISSN: 1662-453X Impact factor: 4.677
FIGURE 1Experimental setup. (A) Mapping of the sensory and motoneurons in the mobile LEGO robot. (B) Simple SNN controlling basic robot movements and providing unconditional responses to touch stimuli. (C) Signaling pathways. Touch (top) and sonar (bottom) sensory neurons receive stimulating trains of rectangular pulses from the corresponding sensors. Then, motoneurons drive the robot’s motors.
FIGURE 2The shortest pathway rule. STDP potentiates the shortest pathways and inhibits alternative connections (W, τ are the weight and axonal delay of the coupling from neuron j to neuron i). (A,B) Left: Initial situation. Right: After STDP. The link width corresponds to the synaptic strength. Presynaptic spikes in a unidirectional chain precede postsynaptic spikes and STDP potentiates synaptic couplings. (C) The shortcut from neuron N1 to N3 makes the coupling from N2 to N3 “unnecessary” and STDP depresses it. (D) Spikes in the network and evolution of synaptic weights.
FIGURE 3Associative learning based on the spatial properties of STDP. (A) The initial SNN. (B) Potentiation of the coupling w and depression of w during simultaneous stimulation of neuron N3 and N1 (US pulses are applied with a delay of 10 ms relative to CS pulses in order to comply with the STDP protocol).
FIGURE 4Model of classical conditioning. (A) The design of a two-channel SNN by duplicating the single-channel SNN (Figure 3) sharing some neurons. The neural circuit includes neurons N1–N4 involved in learning. Motoneurons N5 and N6 provide turning the robot away from an obstacle. (B) The SNN after learning. PA, parallel association: N1 (N2) is associated with N3 (N4), couplings w and w are potentiated. DA, diagonal association: N2 (N1) is associated with N3 (N4), couplings w and w are potentiated. (C) Application of a stimulus to the touch and sonar sensors. (D) Evolution of the average weights of parallel (w) and diagonal (w) couplings under classical conditioning. Arrows PA and DA denote the time instants of the beginning of learning with correspondent scheme of the US mapping; touchL (touchR) is the time course of triggering the left (right) touch sensor.
FIGURE 5Operant conditioning. (A) Trajectory of the robot in the first 2 min of the experiment. Exclamation marks indicate the positions of collisions with obstacles. (B) Same as in (A) but after learning. (C) Evolution of the weights of parallel (w) and diagonal (w) couplings (compare to Figure 4D). Beige and green-blue bars correspond to periods (A,B), respectively.