| Literature DB >> 24782758 |
Philip J Tully1, Matthias H Hennig2, Anders Lansner3.
Abstract
Learning and memory operations in neural circuits are believed to involve molecular cascades of synaptic and nonsynaptic changes that lead to a diverse repertoire of dynamical phenomena at higher levels of processing. Hebbian and homeostatic plasticity, neuromodulation, and intrinsic excitability all conspire to form and maintain memories. But it is still unclear how these seemingly redundant mechanisms could jointly orchestrate learning in a more unified system. To this end, a Hebbian learning rule for spiking neurons inspired by Bayesian statistics is proposed. In this model, synaptic weights and intrinsic currents are adapted on-line upon arrival of single spikes, which initiate a cascade of temporally interacting memory traces that locally estimate probabilities associated with relative neuronal activation levels. Trace dynamics enable synaptic learning to readily demonstrate a spike-timing dependence, stably return to a set-point over long time scales, and remain competitive despite this stability. Beyond unsupervised learning, linking the traces with an external plasticity-modulating signal enables spike-based reinforcement learning. At the postsynaptic neuron, the traces are represented by an activity-dependent ion channel that is shown to regulate the input received by a postsynaptic cell and generate intrinsic graded persistent firing levels. We show how spike-based Hebbian-Bayesian learning can be performed in a simulated inference task using integrate-and-fire (IAF) neurons that are Poisson-firing and background-driven, similar to the preferred regime of cortical neurons. Our results support the view that neurons can represent information in the form of probability distributions, and that probabilistic inference could be a functional by-product of coupled synaptic and nonsynaptic mechanisms operating over several timescales. The model provides a biophysical realization of Bayesian computation by reconciling several observed neural phenomena whose functional effects are only partially understood in concert.Entities:
Keywords: Bayes' rule; Hebbian learning; intrinsic excitability; naïve Bayes classifier; spiking neural networks; synaptic plasticity and memory modeling
Year: 2014 PMID: 24782758 PMCID: PMC3986567 DOI: 10.3389/fnsyn.2014.00008
Source DB: PubMed Journal: Front Synaptic Neurosci ISSN: 1663-3563
Figure 1Reconciling neuronal and probabilistic spaces using the spike-based BCPNN architecture for a postsynaptic minicolumn with activity A cartoon of the derived network incorporates H = 5 hypercolumns each containing n = 4 minicolumns that laterally inhibit each other (red lines) to perform a WTA operation via local inhibitory interneurons (red circles). The dotted gray area is represented by B in detail. (B) Weighted input rates x1… are summed and passed through a transfer function to determine the amount of output activation. Connections w can be viewed as synaptic strengths (black lines, semicircles) or inverted directed acyclic graph edges representing the underlying generative model of a naïve Bayes classifier.
Figure 2Schematic flow of BCPNN update equations reformulated as spike-based plasticity. (A) The S pre- (A–D, red) and S postsynaptic (A–D, blue) neuron spike trains are presented as arbitrary example input patterns. Each subsequent row (B–D) corresponds to a single stage in the EWMA estimate of the terms used in the incremental Bayesian weight update. (B) Z traces low pass filter input spike trains with τ = τ. (C) E traces compute a low pass filtered representation of the Z traces at time scale τ. Co-activity now enters in a mutual trace (C,D, black). (D) E traces feed into P traces that have the slowest plasticity and longest memory, which is established by τ.
Figure 3Delayed reward learning using A pair of neurons fire randomly and elicit changes in the pre- (red) and postsynaptic (blue) Z traces of a BCPNN synapse connecting them. Sometimes by chance (pre before post*, synchronous+, post before pre#), the neurons fire coincidentally and the degree of overlap of their Z traces (inset, light blue), regardless of their order of firing, is propagated to the mutual eligibility trace E. (B) A reward (pink rectangular function, not to scale) is delivered as external supervision. Resulting E traces are indicated (gray line, τ = 100 ms and black line, τ = 1000 ms). (C) Behavior of color corresponding P traces and weights (inset) depends on whether or not the reward reached the synapses in ample time.
When parameters are not explicity listed in the text, they are interleaved below, following (Nordlie et al., .
| Neuron model | Leaky IAF |
| Synapse model | Conductance-based with |
| Channel model | K+ channel |
| Input model | Fixed-rate Poisson spike trains |
| Measured quantities | Spike activity, connection strengths, biases, voltages |
| Leaky IAF dynamics | Subthreshold membrane potential |
| Spiking: If | |
| Parameters | |
| ϕ = 50 pA current scaling factor Figures | |
| Activity-dependent hyperpolarizing | K+/CAN current of neuron |
| See Equation 8 for calculation of | |
| Parameters | τ |
| τ | |
| τ | |
| ϵ = 1/( | |
| α-shape PSC dynamics | Excitatory |
| BCPNN synapse | Synaptic strength between |
| See Equation 8 for calculation of | |
| Parameters | τ |
| τ | |
| τ | |
| τ | |
| τ | |
| κ = 1.0, 0.0 to freeze plasticity (Figure | |
| Poisson generator | |
| | |
| | |
| | |
| | |
Figure 4Spike-based BCPNN estimates abstract BCPNN for different input patterns. (A) Pre- and (B) postsynaptic input spike trains. Activation patterns (shaded rectangles) of abstract BCPNN units and corresponding Poisson spike trains (vertical bars) firing at fmax Hz elicited in IAF neurons are differentiated by color. (C) Weight and bias (inset) development under different protocol for the abstract (dotted) and spike-based (solid) versions of the learning rule. Spiking simulations were repeated 100 times and averaged, with standard deviations illustrated by the shaded regions.
Figure 5STDP function curves are shaped by the Schematic representation of the STDP conditioning protocol. Each pre (blue)—post (green) pairing is repeated for each time difference Δt = t1 − t2 illustrated in (C–E). (B) Weight dependence for positive (Δt = 0 ms, solid line) and negative (Δt = 50 ms, dashed line) spike timings. Compare to Figure 5 of Bi and Poo (1998). (C) Relative change in peak synaptic amplitude using τ = 5 ms, τ = 5 ms, τ = 100 ms, and τ = 10000 ms. This curve is reproduced in (D–F) using dotted lines as a reference. (D) The width of the LTP window is determined by the magnitude of the Z trace time constants. When τ is changed to 2 ms, the coincident learning window shifts right. (E) Instead when τ is changed to 2 ms, it shifts left. Note that a decrease in τ is thus qualitatively consistent with the canonical STDP kernel. (F) Changing the P trace time constant influences the amount of LTD. When τ is doubled to 20,000 ms, the learned correlations tend to decay at a slower rate.
Figure 6The BCPNN learning rule exhibits a stable equilibrium weight distribution. (A) Progression of averaged rates of firing (3 s bins) for the presynaptic (blue) and postsynaptic (black) neurons in the network. (B) Setup involves 1000 Poisson-firing presynaptic neurons that drive one postsynaptic cell. (C) The BCPNN synaptic strengths recorded every 100 ms (blue, dotted white line is their instantaneous mean) has an initial transient but then remains steady throughout the entire simulation despite deviation amongst individual weights within the equilibrium distribution. (D) BCPNN weight histogram plotted for the final time epoch is unimodal and approximately normally distributed (blue line, μ0 = 0.0 and σ0 = 0.38).
Figure 7A shift in the weight distribution of correlated neurons arises from structured input. (A) Progression of averaged rates of firing (3 s bins) for the externally stimulated uncorrelated (blue) and correlated (pictured C = 0.2, red) presynaptic neurons, along with the postsynaptic (black) neuron they drive in the network. (B) Setup involves 900 uncorrelated and 100 correlated presynaptic neurons that drive one postsynaptic cell. (C) Synaptic strengths recorded every 100 ms from the correlated group gradually specialize over time vs. their uncorrelated counterparts, resulting in a change in the mean distribution of weights (white dotted lines for each, here C = 0.2). (D) Weight histograms plotted for the final time epoch are unimodal and approximately normally distributed (C = 0.2, μ0 = −0.03, μ0 = 0.34 and σ0 ≈ σ+ = 0.18). (E) The separation between these distributions is expressed as d1, which increases as a function of the input correlation coefficient. (F) Summed weights for the correlated (red), uncorrelated (blue), and combined (black) in the final epoch as a function of C. (G) Same as in (F) but with τ and τ increased by a factor of 2. In both instances, the combined weights remain relatively constant around w = 0, although lower time constants induce more substantial differences between the correlated and uncorrelated weights. Error bars depict the standard deviation gathered from 50 repeated trials.
Figure 8Exponential activation function of a lowly firing IAF neuron is shifted by an injection of a hyperpolarizing current proportional to β Voltage trace and resulting long tail distribution of a membrane potential histogram from an IAF neuron approaching firing threshold of −55 mV (bin size = 0.15 mV). (B) The input-output curve of an IAF neuron with 30 inputs each firing at values listed along the abscissa (black, simulated; blue, see Materials and Methods for theoretical IAF rate). For low firing frequencies at or below 20 Hz, the function is approximately exponential (red-dotted fit: y = 0.48e0.29( − 0.47). (C) The bias term shows logarithmically increasing firing rate values of the neuron for which it is computed. (D) When hyperpolarizing current proportional to β is applied, neurons that have previously been highly active will be more easily excitable (e.g., yellow curve) compared to neurons that have had little recent history of firing (e.g., blue curve). Error bars depict the standard deviation gathered from 50 repeated experiments.
Figure 9The bias term reproduces the graded persistent activity found in entorhinal cortical neurons. (A) Stimulation protocol. Repetitive depolarizing followed by hyperpolarizing current injections (switch occurs at black arrow) of the IAF neuron including β. (B) Peristimulus time histogram (2 s bin width) of the elicited discharge. Red bars indicate the time averaged activity of each 1 min post-stimulus interval. Time averaged activity of 1 min post-stimulus intervals using 0.3 nA depolarizing steps each lasting 2 s (stars, red-dotted line: linear fit). (C) Underlying P trace evolution during the simulation.
Figure 10Spiking BCPNN performs a simple Bayesian inference. (A) Network architecture with excitatory (black) and inhibitory (gray) connections between local minicolumns. Input neurons of groups X and Y each project to the output layer X′ (green) and Y′ (blue), which mutually inhibit each other via an inhibitory WTA population (gray). (B) Posterior probability distributions are reflected by the output rates of postsynaptic neuron pools X′ and Y′ (colors corresponding to A) in 1 s bins during recall. (C) Evolution of the mean weight matrix during training, where each cell represents the averaged activity for all 900 connections. Three snapshots were taken during learning: one at the beginning, one a tenth of the way through, and one at the end of the simulation. Weights that were developed in alternating 200 ms intervals were initially volatile, but eventually settled into a symmetrical terminal weight structure.