Literature DB >> 32820723

Learning prediction error neurons in a canonical interneuron circuit.

Loreen Hertäg1,2, Henning Sprekeler1,2.   

Abstract

Sensory systems constantly compare external sensory information with internally generated predictions. While neural hallmarks of prediction errors have been found throughout the brain, the circuit-level mechanisms that underlie their computation are still largely unknown. Here, we show that a well-orchestrated interplay of three interneuron types shapes the development and refinement of negative prediction-error neurons in a computational model of mouse primary visual cortex. By balancing excitation and inhibition in multiple pathways, experience-dependent inhibitory plasticity can generate different variants of prediction-error circuits, which can be distinguished by simulated optogenetic experiments. The experience-dependence of the model circuit is consistent with that of negative prediction-error circuits in layer 2/3 of mouse primary visual cortex. Our model makes a range of testable predictions that may shed light on the circuitry underlying the neural computation of prediction errors.
© 2020, Hertäg and Sprekeler.

Entities:  

Keywords:  neural circuits; neuroscience; none; prediction-error neurons; predictive processing; sensorimotor processing; synaptic plasticity; visual system

Mesh:

Year:  2020        PMID: 32820723      PMCID: PMC7442488          DOI: 10.7554/eLife.57541

Source DB:  PubMed          Journal:  Elife        ISSN: 2050-084X            Impact factor:   8.140


Introduction

Changes in sensory inputs can arise from changes in our environment, but also from our own movements. When you walk through a room full of people, your perspective changes over time, and you will experience a global visual flow. Superimposed on this global change are local changes generated by the movements of the people around you. An essential task of sensory perception is to disentangle these different origins of sensory inputs, because the appropriate behavioral responses to environmental and to self-generated changes are often different. Am I approaching a person or is she approaching me? A common assumption is that perceptual systems subtract from the sensory data an internal prediction (Bell, 1981; Rao and Ballard, 1999; Friston, 2005; Spratling, 2010; Franklin and Wolpert, 2011; den Ouden et al., 2012; Kennedy et al., 2014; Keller and Mrsic-Flogel, 2018), which is calculated from an efference copy of the motor signals our brain has issued. Changes in the external world then take the form of mismatches – or prediction errors – between internal predictions and sensory data (Wolpert et al., 1995). This comparison requires an accurate prediction system that adapts to ongoing changes in the environment or in behavior. An efficient way to ensure a flexible adaptation is to render the prediction circuits experience-dependent by minimizing prediction errors (Wolpert et al., 2011). Neural hallmarks of prediction errors are found throughout the brain. Dopaminergic neurons in the basal ganglia and the striatum (Schultz and Dickinson, 2000) encode a reward prediction error (mismatch between expected and received reward), and subsets of neurons in visual cortex (Keller et al., 2012; Zmarz and Keller, 2016; Attinger et al., 2017), auditory cortex (Eliades and Wang, 2008; Keller and Hahnloser, 2009) and barrel cortex (Ayaz et al., 2019) code for a mismatch between feedback and feedforward information. While neural correlates of prediction errors have been found broadly, the circuit level mechanisms that underlie their computation are poorly understood. Given that prediction errors involve a subtraction of expectations from sensory data (for an alternative implementation employing divisive inhibition, see Spratling, 2008; Spratling, 2017; Spratling, 2019), the relevant circuits likely involve both excitatory and inhibitory pathways (Attinger et al., 2017). Negative prediction-error (nPE) neurons, which are activated only when sensory signals are weaker than predicted, are likely to receive excitatory predictions counterbalanced by inhibitory sensory signals. Conversely, positive prediction-error (pPE) neurons, which respond only when sensory signals exceed the internal prediction, may receive excitatory sensory signals counterbalanced by inhibitory predictions (Rao and Ballard, 1999; Keller and Mrsic-Flogel, 2018). How the complex inhibitory circuits of the cortex (Markram et al., 2004; Rudy et al., 2011; Pfeffer et al., 2013; Jiang et al., 2015; Tremblay et al., 2016; Wamsley and Fishell, 2017) support the computations of these prediction errors is not resolved and neither are the activity-dependent forms of plasticity that would allow these circuits to refine the prediction machine. For prediction-error neurons, fully predicted sensory signals should cancel with the internal prediction and hence trigger no response. We therefore hypothesized that an experience-dependent formation and refinement of prediction-error circuits can be achieved by balancing excitation and inhibition in an activity-dependent manner. Using a computational model comprised of excitatory pyramidal cells and three types of inhibitory interneurons, we show that nPE neurons can be learned by inhibitory synaptic plasticity rules that balance excitation and inhibition in principal cells. We find that the circuit shows a similar experience dependence as observed in V1 (Attinger et al., 2017). Depending on which interneuron classes receive motor predictions and which receive sensory signals, the plasticity rules shape different, fully functional variants of the prediction circuit. Using simulated optogenetic experiments, we show that these variants have identifiable fingerprints in their reaction to optogenetic activation or inactivation of different interneuron classes. Finally, we demonstrate that the inhibitory prediction circuits can be learned by biologically plausible forms of homeostatic inhibitory synaptic plasticity, which only rely on local information available at the synapses.

Results

We studied a rate-based network model of layer 2/3 of rodent V1 to investigate how prediction-error (PE) neurons develop. In the following, we will focus primarily on negative prediction-error (nPE) neurons. In V1, nPE neurons have been studied more extensively, which allows us to compare our results with experimental findings. However, the same approaches and principles derived for nPE neurons can also be applied to positive prediction-error (pPE) neurons (see Appendix 2—figures 1 and 2). The network model includes excitatory pyramidal cells (PCs) as well as inhibitory parvalbumin-expressing (PV), somatostatin-expressing (SOM) and vasoactive intestinal peptide-expressing (VIP) interneurons (Figure 1a). The relative abundance of the four cell types and the probability of the respective synaptic connections are chosen in line with electrophysiological studies (see Materials and methods). While all inhibitory neurons are modeled as point neurons (Wilson and Cowan, 1972), we used a two-compartment model for PCs with a rectifying active dendritic process that allowed nonlinear dynamics akin to dendritic calcium spikes (Murayama et al., 2009) (see Materials and methods and Appendix 1).
Appendix 2—figure 1.

Multi-pathway balance of excitation and inhibition in different pPE neuron circuits.

(a) Excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways onto PCs that need to be balanced in pPE neuron circuits. Input to the soma of PCs and PV neurons is varied (c–f). VIP neurons receive visual input, SOM neurons receive a motor-related prediction thereof. (b) Test stimuli: Feedback (FB), mismatch (MM) and playback (PB) phases of 1 s each. (c) PCs receive visual input. PV neurons receive visual and motor inputs (left, top). When all visual (V) and motor (M) pathways are balanced (left, bottom), PCs act as pPE neurons (right). PV neuron activity increases in both feedback and playback phases but remains at baseline during mismatch. Responses normalized between −1 and 1 such that baseline is zero. (d) Same as in (c) but PV neurons receive motor predictions only. (e) Same as in (c) but PCs receive no visual input. PV neurons remain at baseline in the absence of visual input to the soma of PCs during feedback and mismatch. (f) Same as in (c) but PCs receive no visual input and PV neurons receive motor predictions only. PV neurons remain at baseline in the absence of visual input to the soma of PCs during feedback and mismatch.

Appendix 2—figure 2.

Balancing excitation and inhibition gives rise to positive prediction-error neurons.

(a) Network model with excitatory PCs and inhibitory PV, SOM and VIP neurons. Connections from PCs not shown for the sake of clarity. Somatic compartment of PCs and VIP receive visual input, apical dendrites of PCs, SOM and PV neurons receive a motor-related prediction thereof. Connections marked with an asterisk undergo experience-dependent plasticity. (b) During plasticity, the network is exposed to a sequence of feedback (coupled sensorimotor experience) and mismatch phases (black square, no visual flow despite motor-related predictions). Stimuli last for 1 s and are alternated with baseline phases (absence of visual input and motor predictions). (c) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback (FB), mismatch (MM) and playback (PB) phase, sorted by amplitude of mismatch response. None of the PCs are classified as pPE neurons. (d) Same as in (c) after plasticity. Somatic excitation and inhibition are balanced. PC population rate fluctuates around baseline. All PCs classified as pPE neurons. Connection strength from VIP onto SOM neurons is set to 0.8.

Figure 1.

Balancing excitation and inhibition gives rise to negative prediction-error neurons.

(a) Network model with excitatory PCs and inhibitory PV, SOM and VIP neurons. Connections from PCs not shown for the sake of clarity. Somatic compartment of PCs, SOM and PV neurons receive visual input, apical dendrites of PCs and VIP neurons receive a motor-related prediction thereof. Connections marked with an asterisk undergo experience-dependent plasticity. (b) During plasticity, the network is exposed to a sequence of feedback (coupled sensorimotor experience) and playback phases (black square, visual input not predicted by motor commands). Stimuli last for 1 s and are alternated with baseline phases (absence of visual input and motor predictions). (c) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback (FB), mismatch (MM) and playback (PB) phase, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (d) Same as in (c) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right). (e) Left: Mismatch response increases with the difference between visual and motor input. Right: nPE neuron response during playback does not change with the difference between visual and motor input but remains at baseline.

(a) Network model as in Figure 1. Connections colored and marked with an asterisk undergo experience-dependent plasticity. (b) PCs receive visual input. Connections onto PCs follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). SOM→PV and VIP→PV synapses approximate a back-propagation of error (diamond). The averaged weights converge to a steady-state. Weights are normalized to the theoretically derived values for nPE neurons (see Materials and methods). (c) Same as in (b) but SOM→PV and VIP→PV synapses change in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (square). (d) Same as in (b) but visual drive onto PCs is absent. SOM→PV and VIP→PV synapses follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). Connections from PCs onto PV neurons establish a baseline for PV neurons by an anti-Hebbian plasticity rule (inverted triangle) .

(a) Network model as in Figure 1 but without VIP→PV synapses. PV neurons receive visual input. (b) Population response () of PCs in feedback (dark gray) and playback phase (light gray) for varying SOM→PV (top), SOM→VIP (middle) and VIP→SOM (bottom) connections. For all values tested, firing rate during feedback and playback deviates from baseline. (c) Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. Most PCs change their firing rate only mildly in feedback and/or playback phase. As indicated by the gray/black shading to the right, many of the PCs are classified as nPE neurons. (d–f) Same as in (a) but PV neurons receive motor predictions. (f) All PCs change their firing rate in response to all stimulation patterns. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). .

Network model, its inputs and the training set are shown in Figure 1. Model setup modified to enable the formation of nPE neurons while abiding to Dale’s law: PCs receive 0.5 x visual input. External excitatory input onto the dendrites is set to 2.8 to mitigate an excess of dendritic inhibition during baseline. Additional non-linearity for synapses from SOM neurons onto the apical dendrites of PCs: , where denotes the total dendritic activity and is a sigmoid function given by . (a) Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents are shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. (b) Left: Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. All PCs change their firing rate in response to all stimulation patterns. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). Right: Population responses of PV, SOM and VIP neurons in all phases. Responses are normalized between −1 and 1 such that baseline is zero. (c) Same as in (a) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. (d) Same as in (b) after plasticity. Almost all PCs classified as nPE neurons (indicated by black/gray shading to the right). PV neurons are less active during the playback phase than during the feedback phase.

Balancing excitation and inhibition gives rise to negative prediction-error neurons.

(a) Network model with excitatory PCs and inhibitory PV, SOM and VIP neurons. Connections from PCs not shown for the sake of clarity. Somatic compartment of PCs, SOM and PV neurons receive visual input, apical dendrites of PCs and VIP neurons receive a motor-related prediction thereof. Connections marked with an asterisk undergo experience-dependent plasticity. (b) During plasticity, the network is exposed to a sequence of feedback (coupled sensorimotor experience) and playback phases (black square, visual input not predicted by motor commands). Stimuli last for 1 s and are alternated with baseline phases (absence of visual input and motor predictions). (c) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback (FB), mismatch (MM) and playback (PB) phase, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (d) Same as in (c) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right). (e) Left: Mismatch response increases with the difference between visual and motor input. Right: nPE neuron response during playback does not change with the difference between visual and motor input but remains at baseline.

Learning prediction-error circuits with different forms of homeostatic plasticity.

(a) Network model as in Figure 1. Connections colored and marked with an asterisk undergo experience-dependent plasticity. (b) PCs receive visual input. Connections onto PCs follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). SOM→PV and VIP→PV synapses approximate a back-propagation of error (diamond). The averaged weights converge to a steady-state. Weights are normalized to the theoretically derived values for nPE neurons (see Materials and methods). (c) Same as in (b) but SOM→PV and VIP→PV synapses change in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (square). (d) Same as in (b) but visual drive onto PCs is absent. SOM→PV and VIP→PV synapses follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). Connections from PCs onto PV neurons establish a baseline for PV neurons by an anti-Hebbian plasticity rule (inverted triangle) .

VIP→PV synapses are not required for the formation of nPE neurons.

(a) Network model as in Figure 1 but without VIP→PV synapses. PV neurons receive visual input. (b) Population response () of PCs in feedback (dark gray) and playback phase (light gray) for varying SOM→PV (top), SOMVIP (middle) and VIPSOM (bottom) connections. For all values tested, firing rate during feedback and playback deviates from baseline. (c) Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. Most PCs change their firing rate only mildly in feedback and/or playback phase. As indicated by the gray/black shading to the right, many of the PCs are classified as nPE neurons. (d–f) Same as in (a) but PV neurons receive motor predictions. (f) All PCs change their firing rate in response to all stimulation patterns. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). .

Balancing excitation, somatic and dendritic inhibition gives rise to nPE neurons in a model in which an excess of dendritic inhibition is forwarded to the soma.

Network model, its inputs and the training set are shown in Figure 1. Model setup modified to enable the formation of nPE neurons while abiding to Dale’s law: PCs receive 0.5 x visual input. External excitatory input onto the dendrites is set to 2.8 to mitigate an excess of dendritic inhibition during baseline. Additional non-linearity for synapses from SOM neurons onto the apical dendrites of PCs: , where denotes the total dendritic activity and is a sigmoid function given by . (a) Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents are shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. (b) Left: Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. All PCs change their firing rate in response to all stimulation patterns. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). Right: Population responses of PV, SOM and VIP neurons in all phases. Responses are normalized between −1 and 1 such that baseline is zero. (c) Same as in (a) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. (d) Same as in (b) after plasticity. Almost all PCs classified as nPE neurons (indicated by black/gray shading to the right). PV neurons are less active during the playback phase than during the feedback phase. A subset of inhibitory synapses – chosen based on a mathematical analysis (see Materials and methods, or Appendix 1) – are subject to experience-dependent plasticity, which aims at minimizing deviations of the PC firing rate from a baseline rate. These deviations can be interpreted as prediction errors. Learning hence strives to adapt the inhibitory circuit such as to reduce these errors. While the synapses onto both the somatic and dendritic compartments of PCs follow an inhibitory plasticity rule akin to Vogels et al., 2011, the inhibitory synapses onto PV neurons follow an approximated backpropagation of error rule akin to Rumelhart et al., 1986. Specifically, the former rule changes the synapses onto PCs in proportion to the presynaptic interneuron activity and the deviation of PC activity from a baseline rate (see Materials and methods, Equation 14). The latter rule changes the synapses onto PV neurons in proportion to the presynaptic interneuron activity and the averaged deviation of the postsynaptic PCs from their baseline rate (see Materials and methods, Equation 16). Earlier work has shown that such forms of plasticity establish a balance of excitation and inhibition (Vogels et al., 2011; Mackwood et al., 2020). All neurons in the model receive excitatory background input that ensures reasonable baseline activities in the absence of visual input and motor-related internal predictions (‘baseline’). In addition, we stimulated the network with time-varying external inputs representing actual and predicted visual stimuli (Figure 1a,b). We reasoned that during natural conditions, movements lead to sensory inputs that are fully predicted by internal motor commands (‘feedback phase’, Attinger et al., 2017), while unexpected external changes in the environment should generate unpredicted sensory signals (‘playback phase’, Attinger et al., 2017). Situations in which internal motor commands are not accompanied by corresponding sensory signals should be rare (‘feedback mismatch phase’, Attinger et al., 2017). During plasticity, we therefore stimulated the circuit with a sequence consisting of feedback and playback phases (‘quasi-natural training’, Figure 1b).

Negative prediction-error neurons emerge by balancing excitation and inhibition

Before the onset of plasticity, synaptic connections were randomly initialized, leading to PCs receiving unbalanced excitation and inhibition. Therefore, all PCs change their firing rate in response to both feedback and playback stimuli, indicating the absence of nPE neurons (Figure 1c). During quasi-natural sensorimotor experience, inhibitory plasticity strengthens or weakens inhibitory synapses to diminish the firing rate deviations of PCs from their baseline firing rate (Figure 1—figure supplement 1). At the same time, dendritic inhibition mediated by SOM interneurons was sufficiently strengthened to suppress the motor prediction arriving at the apical dendrite. After synaptic plasticity, somatic excitation and inhibition are balanced on a stimulus-by-stimulus basis (Figure 1d–e). PCs merely show small and transient onset/offset responses to feedback and playback stimuli. In contrast, all PCs show an increase in activity for feedback mismatch stimuli (Figure 1d), which scales with the size of the difference between actual and predicted visual input (Figure 1e). Hence, inhibitory synaptic plasticity generates nPE neurons by balancing excitation and inhibition in PCs for quasi-natural conditions.
Figure 1—figure supplement 1.

Learning prediction-error circuits with different forms of homeostatic plasticity.

(a) Network model as in Figure 1. Connections colored and marked with an asterisk undergo experience-dependent plasticity. (b) PCs receive visual input. Connections onto PCs follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). SOM→PV and VIP→PV synapses approximate a back-propagation of error (diamond). The averaged weights converge to a steady-state. Weights are normalized to the theoretically derived values for nPE neurons (see Materials and methods). (c) Same as in (b) but SOM→PV and VIP→PV synapses change in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (square). (d) Same as in (b) but visual drive onto PCs is absent. SOM→PV and VIP→PV synapses follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). Connections from PCs onto PV neurons establish a baseline for PV neurons by an anti-Hebbian plasticity rule (inverted triangle) .

Balance of excitation, inhibition and disinhibition in different functional prediction circuits

The circuit we studied so far was motivated by the widely accepted view that PCs and SOM and PV interneurons show visual responses (Ko et al., 2011; Yang et al., 2013; Larkum, 2013a; Xue et al., 2014; Harris and Shepherd, 2015; Lee et al., 2016; Attinger et al., 2017), while long-range (motor) predictions arrive in the superficial layers of V1 and target VIP neurons (Fu et al., 2014; Harris and Shepherd, 2015; Tremblay et al., 2016; Attinger et al., 2017) and the apical and distal compartments of PCs (Larkum, 2013a; Leinweber et al., 2017; Attinger et al., 2017). Because this view is not uncontested (Fu et al., 2014) and it has been shown that interneuron types can receive both feedforward and feedback inputs (Wall et al., 2016), we systematically varied the inputs to the different neuron classes. We first studied circuit variations in which PCs and PV neurons receive visual and/or motor signals (Figure 2, see also Figure 2—figure supplement 1).
Figure 2.

Multi-pathway balance of excitation and inhibition in different nPE neuron circuits.

(a) Excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways onto PCs that need to be balanced in nPE neuron circuits. Input to the soma of PCs and PV neurons is varied (c–f). SOM neurons receive visual input, VIP neurons receive a motor-related prediction. (b) Test stimuli: Feedback (FB), mismatch (MM) and playback (PB) phases of 1 s each. (c) PCs and PV neurons receive visual input (left, top). When all visual (V) and motor (M) pathways are balanced (left, bottom), PCs act as nPE neurons (right). PV neuron activity increases in both feedback and playback phases. Responses normalized between −1 and 1 such that baseline is zero. (d) Same as in (c) but PV neurons receive motor predictions. (e) Same as in (c) but PCs receive no visual input. PV neurons remain at baseline in the absence of visual input to the soma of PCs. (f) Same as in (c) but PCs receive no visual input and PV neurons receive motor predictions. PV neurons remain at baseline in the absence of visual input to the soma of PCs.

(a) Excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways onto PCs that need to be balanced in nPE neuron circuits. Input to the soma of PCs and PV neurons is varied (c–d). SOM neurons receive visual input, VIP neurons receive a motor-related prediction thereof. (b) Test stimuli: Feedback (FB), mismatch (MM) and playback (PB) phases of 1 s each. (c) PCs receive visual input (left, top). When all visual (V) and motor (M) pathways are balanced (left, bottom), PCs act as nPE neurons (right). PV neuron activity increases in both feedback and playback phases. Responses normalized between −1 and 1 such that baseline is zero. (d) Same as in (c) but PCs receive no visual input. PV neurons remain at baseline in the absence of visual input to the soma of PCs.

Figure 2—figure supplement 1.

Multi-pathway balance of excitation and inhibition in different nPE neuron circuits with both visual and motor input onto PV neurons.

(a) Excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways onto PCs that need to be balanced in nPE neuron circuits. Input to the soma of PCs and PV neurons is varied (c–d). SOM neurons receive visual input, VIP neurons receive a motor-related prediction thereof. (b) Test stimuli: Feedback (FB), mismatch (MM) and playback (PB) phases of 1 s each. (c) PCs receive visual input (left, top). When all visual (V) and motor (M) pathways are balanced (left, bottom), PCs act as nPE neurons (right). PV neuron activity increases in both feedback and playback phases. Responses normalized between −1 and 1 such that baseline is zero. (d) Same as in (c) but PCs receive no visual input. PV neurons remain at baseline in the absence of visual input to the soma of PCs.

Multi-pathway balance of excitation and inhibition in different nPE neuron circuits.

(a) Excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways onto PCs that need to be balanced in nPE neuron circuits. Input to the soma of PCs and PV neurons is varied (c–f). SOM neurons receive visual input, VIP neurons receive a motor-related prediction. (b) Test stimuli: Feedback (FB), mismatch (MM) and playback (PB) phases of 1 s each. (c) PCs and PV neurons receive visual input (left, top). When all visual (V) and motor (M) pathways are balanced (left, bottom), PCs act as nPE neurons (right). PV neuron activity increases in both feedback and playback phases. Responses normalized between −1 and 1 such that baseline is zero. (d) Same as in (c) but PV neurons receive motor predictions. (e) Same as in (c) but PCs receive no visual input. PV neurons remain at baseline in the absence of visual input to the soma of PCs. (f) Same as in (c) but PCs receive no visual input and PV neurons receive motor predictions. PV neurons remain at baseline in the absence of visual input to the soma of PCs.

Multi-pathway balance of excitation and inhibition in different nPE neuron circuits with both visual and motor input onto PV neurons.

(a) Excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways onto PCs that need to be balanced in nPE neuron circuits. Input to the soma of PCs and PV neurons is varied (c–d). SOM neurons receive visual input, VIP neurons receive a motor-related prediction thereof. (b) Test stimuli: Feedback (FB), mismatch (MM) and playback (PB) phases of 1 s each. (c) PCs receive visual input (left, top). When all visual (V) and motor (M) pathways are balanced (left, bottom), PCs act as nPE neurons (right). PV neuron activity increases in both feedback and playback phases. Responses normalized between −1 and 1 such that baseline is zero. (d) Same as in (c) but PCs receive no visual input. PV neurons remain at baseline in the absence of visual input to the soma of PCs. We found that inhibitory plasticity establishes nPE neurons independent of the input configuration onto PCs and PV neurons (Figure 2c–f, right). The emerging connectivity of the interneuron circuits varied, however. For PCs not to respond above baseline in feedback and playback phase, various excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways need to be balanced. An informative example is the input configuration in which PCs receive visual input and PV neurons receive motor predictions (Figure 2d). In this case, visual inputs arrive at the PCs as direct excitation, as disinhibition through the SOM-PV pathway, and as dis-disinhibition via the SOM-VIP-PV pathway (Figure 2a). To keep the PCs at their baseline during the playback phase, these three pathways need to be balanced (Figure 2d, left). Similarly, motor signals arrive at the PCs as inhibition from PV neurons, dis-inhibition via the VIP-PV pathway, dis-dis-inhibition via the VIP-SOM-PV pathway and as direct excitation to the dendrite that is canceled by SOM-mediated inhibition. Again, all these pathways need to be balanced to keep the PCs at their baseline for fully predicted visual stimuli (Figure 2d, left). Analog balancing arguments hold for other input configurations (Figure 2c–f, left). Note that this multi-pathway balance applies primarily to somatic inputs to PCs. During feedback and playback phases, this provides a complete picture, because the dendrites are deactivated by inhibition. During mismatch phases, this dendritic inhibition is withdrawn and the dendrites provide additional excitatory input to the soma that can drive mismatch responses. While the flow of visual and motor information in the learned inhibitory microcircuit is different for different input configurations, the neural responses of the different interneuron classes provide limited information about the input configuration. PV neuron activity reflects whether PCs receive visual input: If PCs receive visual input, PV responses increase during feedback and playback phases to balance the sensory input at the soma of PCs (Figure 2c–d, right). If PCs receive no visual input, PV neurons remain at their baseline firing rate (Figure 2e–f, right), which is in contradiction to the experimentally observed increase of PV neurons during feedback (see Attinger et al., 2017). The activity of SOM and VIP neurons varies between playback, feedback and mismatch phases (in line with experimental results, see Attinger et al., 2017), but is independent of the input configuration for PCs and PV interneurons (Figure 2c–f, right). In summary, inhibitory plasticity can establish functional nPE circuits irrespective of the inputs onto the soma of PCs and PV neurons. Although the underlying circuits vary substantially in the specific balance of pathways, the neural activity patterns only weakly reflect the underlying information flow.

Simulated optogenetic manipulations disambiguate prediction-error circuits

We hypothesized that the need to simultaneously balance several pathways offers a way to disambiguate the different prediction-error circuits by optogenetic manipulations. To test this, we systematically suppressed or activated PV, SOM and VIP interneurons in each input configuration after inhibitory plasticity had established the respective nPE circuit. We found that in our model, such simulated optogenetic experiments are highly informative about the underlying input configuration (Figure 3). For example, PV neuron inactivation changes the response of nPE neurons during feedback, playback and mismatch phases if and only if the PCs receive visual inputs. VIP inactivation renders nPE neurons silent unless PV neurons receive motor predictions, in which case they are transformed into positive prediction-error (pPE) neurons. Since SOM and VIP neurons are mutually inhibiting (see e.g. Pfeffer et al., 2013), the same information can be gained by an over-activation of SOM neurons that effectively silences VIP neurons.
Figure 3.

Simulated optogenetic manipulations of PV, SOM and VIP neurons disambiguate prediction-error circuits.

(a) Left: nPE neuron circuit in which PCs and PV neurons receive visual input. Inactivation (middle) or activation (right) of PV (first row), SOM (second row) or VIP neurons (third row). Optogenetic manipulations change responses of nPE neurons (Ctrl) in feedback, mismatch and playback phases. Responses normalized between −1 and 1 such that baseline is zero. Inactivation input is -. Activation input is . (b) Same as in (a) but PV neurons receive motor-related prediction. (c) Same as in (a) but PCs receive no visual input. (d) Same as in (a) but PCs receive no visual input and PV neurons receive a motor-related prediction.

(a) nPE neuron circuit in which PCs and PV neurons receive visual input. In/activation of PV, SOM or VIP neurons changes net current in nPE neurons in feedback (blue), mismatch (red), playback (orange) and baseline (green) phase. Gray shading denotes currents below rheobase of PC. (b) Same as in (a) but PV neurons receive motor-related prediction. (c) Same as in (a) but PCs receive no visual input. (d) Same as in (a) but PCs receive no visual input and PV neurons receive a motor-related prediction.

Simulated optogenetic manipulations of PV, SOM and VIP neurons disambiguate prediction-error circuits.

(a) Left: nPE neuron circuit in which PCs and PV neurons receive visual input. Inactivation (middle) or activation (right) of PV (first row), SOM (second row) or VIP neurons (third row). Optogenetic manipulations change responses of nPE neurons (Ctrl) in feedback, mismatch and playback phases. Responses normalized between −1 and 1 such that baseline is zero. Inactivation input is -. Activation input is . (b) Same as in (a) but PV neurons receive motor-related prediction. (c) Same as in (a) but PCs receive no visual input. (d) Same as in (a) but PCs receive no visual input and PV neurons receive a motor-related prediction.

Net currents in PCs after in/activation of PV, SOM or VIP neurons elucidate prediction-error circuits.

(a) nPE neuron circuit in which PCs and PV neurons receive visual input. In/activation of PV, SOM or VIP neurons changes net current in nPE neurons in feedback (blue), mismatch (red), playback (orange) and baseline (green) phase. Gray shading denotes currents below rheobase of PC. (b) Same as in (a) but PV neurons receive motor-related prediction. (c) Same as in (a) but PCs receive no visual input. (d) Same as in (a) but PCs receive no visual input and PV neurons receive a motor-related prediction. Changes in neuronal activity due to optogenetic manipulations depend on a variety of factors such as baseline firing rates and saturation effects (Phillips and Hasenstaub, 2016). For instance, while an excess of inhibition is not observable when PCs exhibit vanishingly small baseline activity, it leads to a firing rate decrease otherwise. Moreover, in/activation of interneuron types within a recurrent network may also have ambiguous consequences contingent on potential saturation effects in other cell types. These ambiguities can be partially resolved by measuring currents rather than firing rates, during baseline, feedback, mismatch and playback phases. Indeed, we found that the net currents in PCs after in/activation of PV, SOM or VIP neurons are highly informative about the underlying input configuration (Figure 3—figure supplement 1).
Figure 3—figure supplement 1.

Net currents in PCs after in/activation of PV, SOM or VIP neurons elucidate prediction-error circuits.

(a) nPE neuron circuit in which PCs and PV neurons receive visual input. In/activation of PV, SOM or VIP neurons changes net current in nPE neurons in feedback (blue), mismatch (red), playback (orange) and baseline (green) phase. Gray shading denotes currents below rheobase of PC. (b) Same as in (a) but PV neurons receive motor-related prediction. (c) Same as in (a) but PCs receive no visual input. (d) Same as in (a) but PCs receive no visual input and PV neurons receive a motor-related prediction.

When we compared our results with optogenetic experiments in which SOM or VIP neurons are either inactivated or activated during mismatch or running (Attinger et al., 2017), it shows that a homogeneous input configuration in which all PCs receive visual input while all PV neurons receive a motor-related prediction thereof is unlikely (Figure 3b). All other variants of nPE circuits exhibit mismatch responses during SOM/VIP neuron manipulation that are in line with the ones observed experimentally. However, the responses observed in the feedback phase (when compared with ‘during running’, see Attinger et al., 2017) deviate from all the conditions we simulated, indicating that the interneurons do not receive exclusively sensory or motor inputs, but rather a combination of actual and predicted visual input. In summary, our model predicts that optogenetic experiments may unveil a unique fingerprint for nPE circuits that differ in their inputs onto PCs and PV neurons.

Fraction of nPE neurons is modulated by inputs to SOM and VIP interneurons

In the model considered so far, all PCs developed into nPE neurons during learning, irrespective of the inputs to PCs and PV interneurons. However, nPE neurons represent only a small fraction of neurons in mouse V1 (Keller et al., 2012; Saleem et al., 2013; Zmarz and Keller, 2016; Attinger et al., 2017). Given that in our model, motor predictions arriving at the apical dendrites are canceled by SOM neuron-mediated inhibition, we hypothesized that the fraction of PCs that develop into nPE neurons depends on the distribution of visual and motor input onto SOM and VIP neurons. To test this, we allow neurons of both SOM and VIP populations to receive either visual input or a motor prediction thereof. A fraction f of SOM neurons and a fraction of VIP neurons receive visual input. The remaining SOM and VIP neurons receive a motor-related prediction (Figure 4a). When the majority of SOM neurons receive visual inputs and the majority of VIP neurons receive motor predictions (), all PCs develop into nPE neurons (Figure 4b, left). Reducing the proportion of SOM neurons that receive visual input (and, equivalently, the proportion of VIP neurons that receive the motor prediction), the fraction of nPE neurons decreases (Figure 4b, middle). Non-nPE neurons remain at their baseline in all three phases, show a suppression during mismatch or develop into pPE neurons that respond only during playback. pPE neurons only emerge when the inputs to SOM and VIP neurons are reversed such that most SOM neurons receive motor predictions (Figure 4b, right).
Figure 4.

Fraction of nPE neurons depends on SOM and VIP neuron inputs.

(a) Network model with excitatory PCs and inhibitory PV, SOM and VIP neurons. Connections from PCs not shown for the sake of clarity. Somatic compartment of PCs, PV neurons, a fraction f of SOM neurons and a fraction of VIP neurons receive visual input. The remaining SOM and VIP neurons receive motor predictions. (b) Response relative to baseline () of all PCs in feedback, mismatch and playback phases, sorted by amplitude of mismatch response. The fraction of nPE neurons that develop during learning decreases with f (also indicated by black and gray shading to the right). The increasing fraction of non-nPE neurons comprises neurons that remain at their baseline in all three phases, show a suppression during mismatch or develop into positive prediction-error neurons that respond only during playback.

Fraction of nPE neurons depends on SOM and VIP neuron inputs.

(a) Network model with excitatory PCs and inhibitory PV, SOM and VIP neurons. Connections from PCs not shown for the sake of clarity. Somatic compartment of PCs, PV neurons, a fraction f of SOM neurons and a fraction of VIP neurons receive visual input. The remaining SOM and VIP neurons receive motor predictions. (b) Response relative to baseline () of all PCs in feedback, mismatch and playback phases, sorted by amplitude of mismatch response. The fraction of nPE neurons that develop during learning decreases with f (also indicated by black and gray shading to the right). The increasing fraction of non-nPE neurons comprises neurons that remain at their baseline in all three phases, show a suppression during mismatch or develop into positive prediction-error neurons that respond only during playback. In summary, the fraction of nPE neurons that develop during learning depends on the distribution of visual input and motor predictions onto both SOM and VIP neurons.

Experience-dependence of mismatch and interneuron responses

Attinger et al., 2017 showed that the number of nPE neurons and the strength of their mismatch responses decreases when mice are trained in artificial conditions, during which a mouse was shown the visual information of a different mouse, such that motor predictions and visual flow were uncorrelated (‘non-coupled training’). We reasoned that this training paradigm should include baseline phases where both animals sit still and phases, during which the speeds of the two animals differ. To test whether the model shows the same experience-dependence, we generated a modified training paradigm, which includes baseline phases and phases during which the visual inputs and motor-related predictions are statistically independent (‘random gain training’, Figure 5a). We found that the number of nPE neurons and their mismatch responses also decrease for random gain trained relative to quasi-natural trained networks (Figure 5b). This decrease is primarily due to changes in PCs and PV neurons, while the responses of SOM and VIP neurons during the mismatch phase are largely independent of the training paradigm (Figure 5c). Hence, the experience-dependence of the model circuit is in line with that of nPE neurons in rodent V1 (Attinger et al., 2017).
Figure 5.

Experience-dependence of nPE and PV neurons.

(a) The network is either exposed to a sequence of baseline, feedback and playback phases (quasi-natural training, QT), to baseline phases and phases during which the visual inputs and motor-related predictions are statistically independent (random gain training, RT) or perfectly coupled sensorimotor experience (coupled training, CT) (b) The number of nPE neurons that develop during learning (top) and their mismatch responses (bottom) are smaller for RT than for QT networks. 90% of SOM and 50% of VIP neurons receive visual input. (c) Population response () of PCs, PV, SOM and VIP neurons during mismatch phase. SOM and VIP neurons show the same mismatch response for QT and RT, PCs and PV neurons show stronger responses in QT than in RT. 90% of SOM and 50% of VIP neurons receive visual input. (d) Responses during mismatch (top) and playback (bottom) for QT and CT networks. CT networks can exhibit a decrease in activity during playback phase. Connections from VIP to PV neurons are non-plastic and fixed to −0.3.

(a) During plasticity, the network is exposed to a sequence of feedback phases only, representing perfectly coupled sensorimotor experience. Network model shown in Figure 1. Connections from VIP to PV neurons are non-plastic. (b–c) Model in which an excess of dendritic inhibition does not affect the soma of PCs. Connection strength from VIP to PV neurons fixed to −0.3. (b) Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. All PCs increase their activity during mismatch phase but decrease their firing rate during playback phase. The decrease of PC activity during playback is a result of an excess of somatic inhibition mediated by PV neurons. (c) Population responses of PV, SOM and VIP neurons in all phases. Responses normalized between −1 and 1 such that baseline is zero. (d–e) Model in which an excess of dendritic inhibition is forwarded to the soma of PCs. Connection strength from VIP to PV neurons fixed to −0.18. External excitatory input onto the dendrites is set to 2.8 to mitigate an excess of dendritic inhibition during baseline. (d) Same as in (b). The decrease of PC activity during playback is a result of an excess of dendritic inhibition mediated by SOM neurons. (e) Same as in (c). PV neurons are less active during the playback phase than during the feedback phase.

Experience-dependence of nPE and PV neurons.

(a) The network is either exposed to a sequence of baseline, feedback and playback phases (quasi-natural training, QT), to baseline phases and phases during which the visual inputs and motor-related predictions are statistically independent (random gain training, RT) or perfectly coupled sensorimotor experience (coupled training, CT) (b) The number of nPE neurons that develop during learning (top) and their mismatch responses (bottom) are smaller for RT than for QT networks. 90% of SOM and 50% of VIP neurons receive visual input. (c) Population response () of PCs, PV, SOM and VIP neurons during mismatch phase. SOM and VIP neurons show the same mismatch response for QT and RT, PCs and PV neurons show stronger responses in QT than in RT. 90% of SOM and 50% of VIP neurons receive visual input. (d) Responses during mismatch (top) and playback (bottom) for QT and CT networks. CT networks can exhibit a decrease in activity during playback phase. Connections from VIP to PV neurons are non-plastic and fixed to −0.3.

Coupled-trained networks can produce nPE neurons that decrease their activity in playback phase.

(a) During plasticity, the network is exposed to a sequence of feedback phases only, representing perfectly coupled sensorimotor experience. Network model shown in Figure 1. Connections from VIP to PV neurons are non-plastic. (b–c) Model in which an excess of dendritic inhibition does not affect the soma of PCs. Connection strength from VIP to PV neurons fixed to −0.3. (b) Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. All PCs increase their activity during mismatch phase but decrease their firing rate during playback phase. The decrease of PC activity during playback is a result of an excess of somatic inhibition mediated by PV neurons. (c) Population responses of PV, SOM and VIP neurons in all phases. Responses normalized between −1 and 1 such that baseline is zero. (d–e) Model in which an excess of dendritic inhibition is forwarded to the soma of PCs. Connection strength from VIP to PV neurons fixed to −0.18. External excitatory input onto the dendrites is set to 2.8 to mitigate an excess of dendritic inhibition during baseline. (d) Same as in (b). The decrease of PC activity during playback is a result of an excess of dendritic inhibition mediated by SOM neurons. (e) Same as in (c). PV neurons are less active during the playback phase than during the feedback phase. During learning, we exposed the network to sensory inputs and motor-related predictions designed to reflect coupled sensorimotor experience. To account for changes in the external world that do not arise from the animal’s own movements, we included ‘playback’ phases in which the visual input is stronger than predicted by the motor-related input. Consistent with the experimental setup of Attinger et al., 2017, we deliberately excluded feedback mismatch phases. In the model, the stimuli experienced during learning have a strong impact on the response structure of the PCs, because the learning rules aim to keep the PCs at a given baseline rate at all times. The inclusion of feedback and playback phases during learning therefore leads to neurons that remain at their baseline during those phases, in line with nPE neurons. In mouse V1, nPE neurons exhibit an average rate decrease during playback when the animals were only exposed to perfectly coupled sensorimotor experience (Attinger et al., 2017). When our network was trained in the same way, we also observed that PCs reduced their firing rate during playback phases (Figure 5d and Figure 5—figure supplement 1). This can be a result of an excess of somatic inhibition, dendritic inhibition or both. The model hence predicts that the rate reduction during playback phases observed by Attinger et al., 2017 vanishes when playback phases are included during training.
Figure 5—figure supplement 1.

Coupled-trained networks can produce nPE neurons that decrease their activity in playback phase.

(a) During plasticity, the network is exposed to a sequence of feedback phases only, representing perfectly coupled sensorimotor experience. Network model shown in Figure 1. Connections from VIP to PV neurons are non-plastic. (b–c) Model in which an excess of dendritic inhibition does not affect the soma of PCs. Connection strength from VIP to PV neurons fixed to −0.3. (b) Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. All PCs increase their activity during mismatch phase but decrease their firing rate during playback phase. The decrease of PC activity during playback is a result of an excess of somatic inhibition mediated by PV neurons. (c) Population responses of PV, SOM and VIP neurons in all phases. Responses normalized between −1 and 1 such that baseline is zero. (d–e) Model in which an excess of dendritic inhibition is forwarded to the soma of PCs. Connection strength from VIP to PV neurons fixed to −0.18. External excitatory input onto the dendrites is set to 2.8 to mitigate an excess of dendritic inhibition during baseline. (d) Same as in (b). The decrease of PC activity during playback is a result of an excess of dendritic inhibition mediated by SOM neurons. (e) Same as in (c). PV neurons are less active during the playback phase than during the feedback phase.

nPE circuits can also be learned by biologically plausible learning rules

In our model, nPE neurons developed through inhibitory plasticity that establishes an excitation-inhibition (E/I) balance in PCs. So far, we used learning rules that approximate a backpropagation of error (Rumelhart et al., 1986), which changed SOM→PV and VIP→PV connections such as to minimize the difference between the PC firing rate and a baseline rate (see Equation 16 in Materials and methods). The biological plausibility of such backpropagation rules, which are broadly used in artificial intelligence, is still debated, because they rely on information that is not locally available at the synapse in question (Crick, 1989; Richards and Lillicrap, 2019). We therefore wondered whether prediction-error circuits can also be established by biologically plausible local learning rules. We found that nPE neurons also emerged when the backpropagation rules were replaced by a form of plasticity that changes SOM→PV and VIP→PV synapses in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (see Mackwood et al., 2020, and Equations 17 and 18 in Materials and methods). This local form of learning was also able to balance excitation and inhibition sufficiently (Figure 6b,c and Figure 1—figure supplement 1c) so that all PCs developed into nPE neurons (Figure 6c).
Figure 6.

Learning nPE neurons by biologically plausible learning rules.

(a) Left: Network model as in Figure 1. Connections marked with symbols undergo experience-dependent plasticity. Connections onto PCs follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). SOM→PV and VIP→PV synapses change in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (square). Right: During plasticity, the network is exposed to a sequence of feedback (coupled sensorimotor experience) and playback phases (black square, visual input not predicted by motor commands). Stimuli last for 1 s and are alternated with baseline phases (absence of visual input and motor predictions). (b) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback, mismatch and playback phases, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (c) Same as in (b) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right).

(a) Network model as in Figure 1. Connections marked with symbols undergo experience-dependent plasticity. Inhibitory connections onto PCs and PV neurons follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). Synapses from PCs onto PV neurons follow an anti-Hebbian plasticity rule (inverted triangle). (b) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) at PCs are not balanced. Excitatory and inhibitory currents are shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (c) Same as in (b) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right).

Learning nPE neurons by biologically plausible learning rules.

(a) Left: Network model as in Figure 1. Connections marked with symbols undergo experience-dependent plasticity. Connections onto PCs follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). SOM→PV and VIP→PV synapses change in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (square). Right: During plasticity, the network is exposed to a sequence of feedback (coupled sensorimotor experience) and playback phases (black square, visual input not predicted by motor commands). Stimuli last for 1 s and are alternated with baseline phases (absence of visual input and motor predictions). (b) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback, mismatch and playback phases, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (c) Same as in (b) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right).

Learning nPE neurons by biologically plausible learning rules in networks without visual input at the soma of PCs.

(a) Network model as in Figure 1. Connections marked with symbols undergo experience-dependent plasticity. Inhibitory connections onto PCs and PV neurons follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). Synapses from PCs onto PV neurons follow an anti-Hebbian plasticity rule (inverted triangle). (b) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) at PCs are not balanced. Excitatory and inhibitory currents are shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (c) Same as in (b) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right). The plasticity rules can be further simplified when PCs do not receive visual information. In this case, PV neurons also remain at their baseline firing rate in feedback and playback phases (Figure 2e–f, right). Hence, the strength of SOM→PV and VIP→PV synapses can be learned according to a homeostatic rule (Vogels et al., 2011) that aims to sustain a target rate in the PV neurons (Figure 6—figure supplement 1 and Figure 1—figure supplement 1d, Equations 19 and 20 in Materials and methods). In summary, the backpropagation-like learning rules for the synapses onto PV neurons can be approximated by biologically plausible rules that exploit local information available at the respective synapses.
Figure 6—figure supplement 1.

Learning nPE neurons by biologically plausible learning rules in networks without visual input at the soma of PCs.

(a) Network model as in Figure 1. Connections marked with symbols undergo experience-dependent plasticity. Inhibitory connections onto PCs and PV neurons follow an inhibitory plasticity rule akin to Vogels et al., 2011 (triangle). Synapses from PCs onto PV neurons follow an anti-Hebbian plasticity rule (inverted triangle). (b) Left: Before plasticity, somatic excitation (light red) and inhibition (light blue) at PCs are not balanced. Excitatory and inhibitory currents are shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. Right: Response relative to baseline () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). (c) Same as in (b) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. All PCs classified as nPE neurons (also indicated by black shading to the right).

Discussion

How the nervous system disentangles self-generated and external sensory stimuli is a long-standing question (Bell, 1981; Franklin and Wolpert, 2011; Keller and Mrsic-Flogel, 2018). Here, we investigated the circuit level mechanisms that underlie the computation of negative prediction errors and how different types of inhibitory neurons shape these prediction circuits. We used computational modeling to show that nPE neurons can be learned by balancing excitation and inhibition in cortical microcircuits with three types of interneurons. We show that the required E/I balance can be achieved by biologically plausible forms of synaptic plasticity. Furthermore, the experience-dependence of the circuit is similar to that of nPE circuits in mouse V1 (Attinger et al., 2017). Our model makes a number of predictions. Firstly, the multi-pathway balance of excitation and inhibition suggests that the input configuration of the prediction circuit could be disambiguated using cell type-specific modulations of neural activity. This could be achieved by optogenetic or pharmacogenetic manipulations, or by exploiting the differential sensitivity of interneuron classes to neuromodulators. The precarious nature of an exact multi-pathway balance also suggests that nPE neurons might change their response characteristics in a context-dependent way, for example by neuromodulatory effects. Secondly, the central assumption of the model is that nPE neurons emerge by a self-organized E/I balance during sensorimotor experience. It therefore predicts that (i) sensorimotor experience an animal is habituated to should lead to balanced excitation and inhibition in PCs, (ii) E/I balance should break for sensorimotor experience the animal has rarely encountered, for example for mismatches of sensory stimuli and motor predictions and (iii) during altered sensorimotor experience in a virtual reality setting or when the excitability of specific interneuron types is altered, interneuron circuits should gradually reconfigure to reestablish the E/I balance. PCs in L2/3 of V1 have very low spontaneous firing rates (Polack et al., 2013; Xue et al., 2014). A potential rate decrease during feedback and playback could hence be hard to detect. Whether the low response of nPE neurons during feedback and playback phases are due to an E/I balance – as suggested here – or due to an excess of inhibition may hence be difficult to decide, and could for example be resolved by intracellular recordings (Jordan and Keller, 2020). We used a mathematical analysis to derive constraints imposed on an interneuron circuit by the presence of nPE neurons. In particular, the equations unveiled the relation between the strength of a number of inhibitory synapses, describing a multi-pathway E/I balance in a network comprising PV, SOM and VIP neurons (see Materials and methods, Equations 8, 9). However, we also performed an extensive analysis of different subnetworks, to elucidate under which conditions nPE neurons can emerge (see Appendix 1). By comparing nPE circuits with less cell types, a set of common principles can be extracted (see Appendix 1 for a detailed description): (I) SOM neurons must be present to balance feedback predictions at the dendrites of PCs. (II) SOM neurons must receive visual input unless both PV and VIP neurons are present as well. (III) The connections onto the dendrites must undergo experience-dependent plasticity. (IV) PV neurons must be present when PCs receive visual input in their somatic compartment. (V) Dendritic non-linearities are necessary except for a small set of networks, in which all interneuron types are present and specific constraints for the input configuration apply. While a minimal model that allows nPE neurons to develop comprises SOM neurons and PCs (Attinger et al., 2017), the network with three inhibitory neuron types appears the most likely nPE circuit given what is currently known about rodent V1. The interneuron circuit in our model is motivated by the canonical circuit found in a variety of brain regions (Pfeffer et al., 2013; Lee et al., 2013; Jiang et al., 2015). In addition to the connections between interneuron classes that are frequently reported as strong and numerous, we included VIP→PV synapses in the circuit, because a mathematical analysis reveals that they are required for a perfect E/I balance during both feedback and playback phases (see Appendix 1). While VIP→PV synapses have been found in visual (Pfeffer et al., 2013), auditory (Pi et al., 2013), somatosensory (Hioki et al., 2013; Lee et al., 2013) and medial prefrontal cortex (Pi et al., 2013), as well as amygdala (Krabbe et al., 2019), they are less prominent and often weaker than SOM→PV connections (but see Krabbe et al., 2019). VIP→PV synapses can be excluded when the conditions for nPE neurons during feedback and playback phases are mildly relaxed (Keller and Hahnloser, 2009; Keller et al., 2012; Attinger et al., 2017) and when PV neurons receive visual, but not motor inputs (Figure 1—figure supplement 2).
Figure 1—figure supplement 2.

VIP→PV synapses are not required for the formation of nPE neurons.

(a) Network model as in Figure 1 but without VIP→PV synapses. PV neurons receive visual input. (b) Population response () of PCs in feedback (dark gray) and playback phase (light gray) for varying SOM→PV (top), SOM→VIP (middle) and VIP→SOM (bottom) connections. For all values tested, firing rate during feedback and playback deviates from baseline. (c) Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. Most PCs change their firing rate only mildly in feedback and/or playback phase. As indicated by the gray/black shading to the right, many of the PCs are classified as nPE neurons. (d–f) Same as in (a) but PV neurons receive motor predictions. (f) All PCs change their firing rate in response to all stimulation patterns. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). .

Cortical circuits are complex and contain a large variety of interneuron classes (Rudy et al., 2011; Jiang et al., 2015; Tremblay et al., 2016). We restricted the model to three of these classes: PV, SOM and VIP neurons. It is conceivable that several other interneuron types can play a pivotal role in prediction-error circuits. The dendrites of layer 2/3 neurons reach out to layer 1, the major target for feedback connections (Felleman and Van Essen, 1991; Cauller, 1995; Larkum, 2013a) and home to a number of distinct interneuron types (Larkum, 2013b; Schuman et al., 2019), which may contribute to associative learning (Abs et al., 2018). In particular, NDNF neurons unspecifically inhibit apical dendrites located in the superficial layers, and at the same time receive strong inhibition from SOM neurons (Abs et al., 2018). Hence, it is possible that these interneurons also shape the processing of feedback information, including the computation of prediction errors. Our analysis revealed a number of synapses in the circuit that undergo experience-dependent changes. While the synapses from PV neurons onto PCs established a baseline firing rate in the absence of visual input and motor predictions, the synergy between the SOM→PV, VIP→PV and SOM→PC synapses guaranteed that the baseline is retained in feedback and playback phase. The multi-pathway balance of excitation and inhibition could also be achieved by synaptic plasticity in other inhibitory synapses – for example the mutual inhibition between SOM and VIP neurons. However, the assumption that mainly the inhibitory synapses onto PV neurons are plastic is supported by the observation that PV neuron activity – in contrast to SOM and VIP neuron activity – is experience-dependent (Attinger et al., 2017). In our circuit, the bottom-up and top-down connections conveying actual and predicted visual input, respectively, were non-plastic. However, this modeling choice is not a pre-requisite for the formation of nPE neurons and can be relaxed. As a matter of fact, nPE neurons can also develop in a network, in which the excitatory top-down and bottom-up connections undergo experience-dependent plasticity that balances excitation and inhibition in somatic and dendritic compartments of PCs. For instance, nPE neurons can also be learned by endowing the top-down and bottom-up connections onto PCs and PV neurons with similar plasticity rules described here. Restricting plasticity to the excitatory connections onto PCs would, however, require all inhibitory interneurons to exclusively receive visual input, suggesting that excitatory bottom-up/top-down connections onto interneurons must also change in an activity-dependent manner. In the model, the plastic inhibitory synapses onto PV neurons change according to non-local information that might not be directly available at the synapse. These synapses therefore implement an approximation of a backpropagation of error, the biological plausibility of which is debated (Crick, 1989). We showed that this plasticity rule can be approximated by biologically plausible variants of the plasticity rules. If PCs do not receive direct visual input (Figure 6—figure supplement 1), the backpropagation-like algorithm can be replaced by a simple homeostatic Hebbian plasticity rule in the synapses onto the PV interneurons. Given that PCs in V1 are known to receive substantial visual drive (Yang et al., 2013; Xue et al., 2014), this assumption is unlikely to be valid. We therefore propose an alternative form of plasticity that changes SOM→PV and VIP→PV synapses in proportion to the difference between the excitatory recurrent drive onto PV neurons and a target value (Mackwood et al., 2020, see Figure 6). The underlying mechanism is similar to feedback alignment (Lillicrap et al., 2016) and requires sufficient overlap between the set of postsynaptic PCs a PV neuron inhibits and the set of presynaptic PCs the same PV neuron receives excitation from. This is likely, given the high connection probability between PCs and PV neurons (Pfeffer et al., 2013; Pala and Petersen, 2015; Jiang et al., 2015). Given that the main goal of the present paper was to show that PE circuits can be learned by balancing excitation and inhibition, we used the plasticity rule implementing a backpropagation of error, to ensure maximal generality. We modeled the apical dendrite of PCs as a single compartment that integrates excitatory and inhibitory input currents and has the potential to produce calcium spike-like events (Yuste et al., 1994; Larkum et al., 1999; Murayama et al., 2009; Hertäg and Sprekeler, 2019). Moreover, we assumed that an overshoot of inhibition decouples the apical tuft of the PCs from their soma, by including a rectifying non-linearity that precludes an excess of dendritic inhibition to influence somatic activity. However, the presence or nature of these dendritic nonlinearities has a minor influence on the development of nPE neurons (Figure 1—figure supplement 3). When we allowed dendritic inhibition to influence the soma, inhibitory plasticity still established nPE neurons, although the learned interneuron circuit differs with respect to the synaptic strengths. The additional dendritic inhibition reduces the required amount of somatic, PV-mediated inhibition. This is primarily the case during playback phases, when the excitatory motor input to the apical dendrite is absent. PV neurons are therefore less active during the playback phase than during the feedback phase (Figure 1—figure supplement 3), consistent with recordings in mouse V1 (Attinger et al., 2017).
Figure 1—figure supplement 3.

Balancing excitation, somatic and dendritic inhibition gives rise to nPE neurons in a model in which an excess of dendritic inhibition is forwarded to the soma.

Network model, its inputs and the training set are shown in Figure 1. Model setup modified to enable the formation of nPE neurons while abiding to Dale’s law: PCs receive 0.5 x visual input. External excitatory input onto the dendrites is set to 2.8 to mitigate an excess of dendritic inhibition during baseline. Additional non-linearity for synapses from SOM neurons onto the apical dendrites of PCs: , where denotes the total dendritic activity and is a sigmoid function given by . (a) Before plasticity, somatic excitation (light red) and inhibition (light blue) in PCs are not balanced. Excitatory and inhibitory currents are shifted by ±20 pA for visualization. The varying net excitatory current (gray) causes the PC population rate to deviate from baseline. (b) Left: Response () of all PCs in feedback, mismatch and playback phase, sorted by amplitude of mismatch response. All PCs change their firing rate in response to all stimulation patterns. None of the PCs are classified as nPE neurons (indicated by gray shading to the right). Right: Population responses of PV, SOM and VIP neurons in all phases. Responses are normalized between −1 and 1 such that baseline is zero. (c) Same as in (a) after plasticity. Somatic excitation and inhibition are balanced. PC population rate remains at baseline. (d) Same as in (b) after plasticity. Almost all PCs classified as nPE neurons (indicated by black/gray shading to the right). PV neurons are less active during the playback phase than during the feedback phase.

By modeling the apical dendrite as a single compartment, we also neglected the possibility that dendritic branches process distinct information. However, we expect that the suggested framework of generating predictive signals by a compartment-specific E/I balance generalizes to more complex dendritic configurations, in which local inhibition could contribute by gating different dendritic inputs (Yang et al., 2016). A hallmark of neurons in sensory areas is their pronounced feature selectivity (Cardin et al., 2007; Niell and Stryker, 2008; Harris and Mrsic-Flogel, 2013). This selectivity is also present in nPE neurons in layer 2/3 of rodent V1 which preferentially signal mismatches in a particular location of the visual field (Zmarz and Keller, 2016). Here, we did not include feature selectivity, but only modeled one-dimensional input signals representing actual or expected visual input. However, we expect that nPE neurons can also develop in networks in which excitatory neurons are equipped with feature selectivity and receive multi-dimensional inputs, by the same plasticity rules described here. We conjecture that the presence of feature selectivity imposes further constraints on the network, for instance, regarding feature topography or interneuron tuning properties. For future work, it would be interesting to study how the presence of feature-selective PE neurons constrains the feature selectivity in interneurons that tend to be more broadly tuned than excitatory neurons (Sohya et al., 2007; Cardin et al., 2007; Kerlin et al., 2010; Atallah et al., 2012). In our model, the excitatory recurrent connections target the apical dendrites of PCs, but given that PCs comprise a homogeneous population, they serve no specific computational purpose in the present context. We expect that this would change if the neurons in the circuit were endowed with stimulus selectivity. For instance, the predictive coding model by Boerlin et al., 2013 assumes separate recurrent loops for coding and computation (see also Denève and Machens, 2016). In this model, the membrane potential represents a prediction error and occasional spiking serves the purpose of reducing a potential mismatch by initiating fast inhibition. The excitatory neurons receive feedforward inputs, fast feedback inhibition and slow excitatory recurrent connections. While the fast inhibitory loop balances the excitatory feedforward and the slow feedback inputs, the slower loop – combined with dendritic nonlinearities – allows for nonlinear computations of the delayed represented variable (Denève and Machens, 2016). It will be interesting to study how this line of work is related to the PE circuit model we studied here, but it would require to extend the present model to perform richer computations, for example by endowing it with stimulus selectivity. Here, we have mainly focused on the development of nPE neurons because those have been studied extensively in layer 2/3 of rodent V1, which allowed us to qualitatively compare our model with experimental findings. In contrast, to the best of our knowledge, less is known for pPE neurons in the visual system. Moreover, as we assume that excitatory neurons aim to establish an E/I balance for all stimuli they are regularly exposed to, and as animals experience episodes, in which the change of visual input is only caused by external factors (playback phases), excitatory neurons are more likely to develop into nPE than pPE neurons in the sensorimotor paradigm used here. However, it can be assumed that under different circumstances pPE neurons do play an equally important role in the processing of information. We expect that the same principles and approaches described here also hold for the formation of pPE neurons. Indeed, when a network, in which SOM neurons receive motor-related input and VIP neurons receive visual input, is exposed to baseline, feedback and mismatch phases, pPE neurons develop (see Appendix 2—figure 2). The inhibitory plasticity establishes pPE neurons independent of the input configuration onto PCs and PV neurons as long as various excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways can be balanced (see Appendix 2—figure 1, Equations 50 and 51). In the present work, we derived the constraints for separate nPE and pPE neurons and did not study the parallel development of both in the same neural network. While the formation of nPE neurons requires SOM neurons to receive visual input, the formation of pPE neurons requires SOM neurons to receive a motor-related prediction thereof. Given that SOM neurons constitute a heterogeneous population (Jiang et al., 2015; Tremblay et al., 2016; Urban-Ciecko and Barth, 2016), it is conceivable that separate sub-circuits enable the parallel existence of nPE and pPE neurons. However, we expect that the presence of both PE types requires refined constraints on the interneuron circuit and plasticity rules. For instance, the formation of nPE and pPE neurons that only increase their activity in mismatch and playback phases, respectively, while remaining at baseline otherwise, introduces constraints for all three phases. Hence, the network must be exposed to all input phases during learning. In the present framework, this would most likely produce excitatory neurons that remain at their baseline in all phases and hence do not encode prediction errors at all. Hence, the plasticity rules must be modified such that they incorporate gating signals that restrict learning to a subset of input phases or a subset of synapses, for example by controlling the learning rates. It has been argued that specific neuromodulators that are linked to self-motion may guide plasticity in prediction-error circuits (Keller and Mrsic-Flogel, 2018). For example, neuromodulators could restrict learning to feedback phases. In this case, excitatory neurons would show deviations from baseline during both playback and mismatch phases, that is essentially all neurons would encode both positive and negative prediction errors. A dichotomy of nPE and pPE neurons could result from low baseline firing rates. A thorough investigation of these scenarios for the simultaneous development of nPE and pPE neurons is, however, beyond the scope of the present study. Our model suggests a well-orchestrated division of labor of PV, SOM and VIP interneurons that is shaped by experience: While PV neurons balance the sensory input at the somatic compartment of PCs, SOM neurons cancel feedback signals at the apical dendrites. VIP neurons ensure sufficiently large mismatch responses by amplifying small differences between feedforward and feedback inputs (Attinger et al., 2017; Hertäg and Sprekeler, 2019). Given the relative uniformity of cortex in its appearance, structure and cell types (Douglas et al., 1989; Mountcastle, 1997), it is conceivable that the same principles also hold for other regions of the cortex beyond V1. Shedding light on the mechanisms that constitute the predictive power of neuronal circuits may in the long run contribute to an understanding of psychiatric disorders that have long been associated with a malfunction of the brain’s prediction machinery (Fletcher and Frith, 2009; Corlett et al., 2009; Sinha et al., 2014; Lawson et al., 2017) and specific types of interneurons (Marín, 2012; Hattori et al., 2017; Batista-Brito et al., 2018).

Materials and methods

Network model

We simulated a rate-based network model of excitatory pyramidal cells ( = 70) and inhibitory PV, SOM and VIP neurons ( = 10). All neurons are randomly connected with connection strengths and probabilities given below (see ‘Connectivity’). The excitatory pyramidal cells are described by a two-compartment rate model that was introduced by Murayama et al., 2009. The dynamics of the firing rate of the somatic compartment of neuron i obeyswhere denotes the excitatory rate time constant ( = 60 ms), terms the rheobase of the neuron (). Firing rates are rectified to ensure positivity. is the total somatic input generated by somatic and dendritic synaptic events and potential dendritic calcium spikes: Here, the function is a rectifying nonlinearity that prohibits an excess of inhibition at the apical dendrite to reach the soma. and are the total synaptic inputs into dendrite and soma, respectively, and denotes a dendritic calcium event. and are the fractions of ‘currents’ leaking away from dendrites and soma, respectively (=0.27, =0.31). The synaptic input to the soma is given by the sum of external sensory inputs and PV neuron-induced (P) inhibition, The dendritic input is the sum of motor-related predictions , the recurrent, excitatory connections from other PCs and SOM neuron-induced (S) inhibition: The weight matrices , and denote the strength of connection between PV neurons and the soma of PCs (), SOM neurons and the dendrites of PCs () and the recurrence between PCs (), respectively. The input generated by a calcium spike is given bywhere c scales the amount of current produced (), is the Heaviside step function, represents a threshold that describes the minimal input needed to produce a Ca2+-spike () and denotes the total, synaptically generated input in the dendrites, Note that we incorporated the gain factor present in Murayama et al., 2009 into the parameters to achieve unit consistency for all neuron types (when we compared excitatory/inhibitory currents, the respective activities were divided by this gain factor, (pA· s)−1). The firing rate dynamics of each interneuron is modeled by a rectified, linear differential equation (Wilson and Cowan, 1972),where denotes the firing rate of neuron i from neuron type () and represents external inputs. The weight matrices denote the strength of connection between the presynaptic neuron population Y and the postsynaptic neuron population X. The rate time constant was chosen to resemble a fast GABAA time constant, and set to 2 ms for all interneuron types included.

Negative prediction-error neurons

We define PCs as nPE neurons when they exclusively increase their firing rate during feedback mismatch (visual input smaller than predicted), while remaining at their baseline during feedback and playback phases. In a linearized, homogeneous network and under the assumption that the apical dendrites are sufficiently inhibited during feedback and playback phase, this definition is equivalent to two constraints on the interneuron network (see Appendix 1 for a detailed analysis and derivation): The parameters indicate whether neuron type X receives visual and motor-related inputs, respectively, and control the different input configurations. In addition to the conditions Equations 8 and 9, the synapses from SOM neurons onto the apical dendrites must be sufficiently strong to cancel potential excitatory inputs during feedback and playback phase. In practice, we classify PCs as nPE neurons when is larger than 20% in the mismatch phase and less than ±10% elsewhere (, : baseline firing rate). Tolerating small deviations in feedback and playback phase is more in line with experimental approaches. The results do not rely on the precise thresholds used for the classification.

Connectivity

All neurons are randomly connected with connection probabilities motivated by the experimental literature (Fino and Yuste, 2011; Packer and Yuste, 2011; Pfeffer et al., 2013; Lee et al., 2013; Pi et al., 2013; Jiang et al., 2015; Jouhanneau et al., 2015; Pala and Petersen, 2015), All cells of the same neuron type have the same number of incoming connections. The mean connection strengths are given bywhere the symbol * denotes weights that vary between simulations (e.g., subject to plasticity or computed from the Equations 8 and 9). For non-plastic networks, these synaptic strengths are given by , , , (if PCs receive visual input) or (if PCs receive no visual input), and are computed from the Equations 8 and 9. For plastic networks, the initial connections between neurons are drawn from uniform distributions where w denotes the mean connection strengths given in (Equation 11) and , , (if PCs receive visual input) or (if PCs receive no visual input), (if PCs receive visual input) or (if PCs receive no visual input), and . Please note that the system is robust to the choice of connections strengths. The connection strengths are merely chosen such that the solutions of Equations 8 and 9 comply with Dale’s principle. All weights are scaled in proportion to the number of existing connections (i.e., the product of the number of presynaptic neurons and the connection probability), so that the results are independent of the population size.

Inputs

All neurons receive constant, external background input that ensures reasonable baseline firing rates in the absence of visual and motor-related input. In the case of non-plastic networks, these inputs were set such that the baseline firing rates are , , and . In the case of plastic networks, we set the external inputs to , , , and (if not stated otherwise). In addition to the external background inputs, the neurons receive either visual input (v), a motor-related prediction thereof (m) or both. In line with the experimental setup of Attinger et al., 2017, we distinguish between baseline (), feedback (), feedback mismatch () and playback () phases. During training, the network is exposed to feedback and playback phases with stimuli drawn from a uniform distribution from the interval . After learning, the strength of stimuli is set to (plastic networks) or (non-plastic networks).

Plasticity

In plastic networks, a number of connections between neurons are subject to experience-dependent changes in order to establish an E/I balance for PCs. PV→PC and the PC→PV synapses establish the target firing rates for PCs and PV neurons, respectively. VIP→PV and SOM→PV synapses and the synapses from SOM neurons onto the apical dendrites of PCs ensure that PCs remain at their baseline during feedback and playback phase. The corresponding plasticity rules are of the form

Connections onto PCs

In detail, the connections from PV and SOM neurons onto the soma and the apical dendrites, respectively, obey inhibitory Hebbian plasticity rules akin to Vogels et al., 2011 The parameter denotes the baseline firing rate of the postsynaptic PC, and the dendritic activity is given by the rectified synaptic events at the dendrites The small ‘correction’ term eases the effect of strong onset responses (here, we used ).

Connections onto PV neurons - non-local learning

The connections from both SOM and VIP neurons onto PV neurons implement an approximation of a backpropagation of error denotes the set of postsynaptic PCs a particular PV neuron is connected to, and is the number of excitatory neurons in .

Connections onto PV neurons - local approximation to backpropagation of error

When the connection probability between PCs and PV neurons is large, this backpropagation of error can be replaced by a biologically plausible learning rule that only relies on local information available in the PV neurons (Figure 6),where denotes the difference between the excitatory recurrent drive onto PV neuron i and a target value denotes the set of presynaptic PCs a particular PV neuron receives excitation from.

Connections onto PV neurons - learning with a homeostatic firing rate for PV neurons

When nPE neurons do not receive direct visual input, the backpropagation rules can be simplified even further (Figure 6—figure supplement 1). The synapses onto PV neurons can be learned according to a Hebbian inhibitory plasticity rule (Vogels et al., 2011) that aims to sustain a baseline rate in the PV neuronswith . This baseline rate is established by modifying the connections from PCs onto PV neurons according to an anti-Hebbian plasticity rule

Simulation and code availability

All simulations were performed in customized Python code written by LH. Differential equations were numerically integrated using a 2nd-order Runge-Kutta method with time steps between 0.05 and 2 ms. Neurons were initialized with . Source code and data for all figures will be available after publication at Hertäg, 2020 (https://github.com/sprekelerlab/SourceCode_Hertaeg20, copy archived at https://github.com/elifesciences-publications/SourceCode_Hertaeg20). In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses. Acceptance summary: This manuscript explores an excitatory-inhibitory network model to determine the circuit and plasticity mechanisms underlying the generation of neurons coding for prediction errors in the visual cortex. The manuscript demonstrates that negative prediction errors arise naturally if synaptic plasticity acts to produce homeostatic excitation-inhibition balance when the circuit receives both visual inputs and internal predictions thereof. Remarkably, the emergence of prediction error neurons depends on experience of the circuit with coupled visuomotor input, reflecting recent experimental results, and the model provides direction for future optogenetic experiments to disentangle which cell types are targeted by visual or motor input. Decision letter after peer review: Thank you for submitting your article "Learning prediction error neurons in a canonical interneuron circuit" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Srdjan Ostojic as the Reviewing Editor, and the evaluation has been overseen by Richard Ivry as the Senior Editor. The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission. This manuscript models a canonical cortical microcircuit in order to investigate the circuit configurations and plasticity underlying one type of sensorimotor mismatch response identified in visual cortex. The authors show that negative prediction error neurons emerge from the network via plasticity in a set of inhibitory synapses that acts to minimize evoked firing of pyramidal cells. Moreover, the emergence of these response properties is robust to the particular cellular targets of visual and motor input (for PVs and PCs), while the type of prediction error neuron that emerges (positive versus negative) is dependent on how motor and visual inputs target VIPs and SOMs. Very nicely, the emergence of prediction error neurons depends on experience of the circuit with coupled visuomotor input (reflecting recent experimental results), and the model provides direction for future optogenetic experiments to disentangle which cell types are targeted by visual or motor input. The general topic of microcircuits for predictive coding is very interesting, in direct relationship with experimental work. The manuscript is well written and thorough, and the topic is timely. All reviewers were generally supportive, but some concerns were raised and need to be addressed in the revision. Essential revisions: 1) The paper focuses on a very homogeneous coding of negative prediction errors, while positive prediction errors appear only in one figure. The authors don't justify their focus on the former over the latter. It would therefore be important to include better the variety of prediction error coding in the model, and discuss its functional implications. Wouldn't it be possible and interesting to compare the distribution of different types of neurons (nPE, pPE, others) between the model and the data? The manuscript states that "nPE neurons represent only a small fraction of neurons in mouse V1". This undercuts somewhat the previous sections, and calls for a more quantitative comparison, in particular with respect to a distribution that would be obtained from random wiring for instance. The predictions for optogenetic inactivations in Figure 3 are very nice, but how do they extend to the heterogeneous case with pPE and other neuron classes (Figure 4) ? Can Equations 8-9 be extended to pPE neurons? 2) In the model, every neuron receives one or two scalar signals representing a one-dimensional visual input and corresponding motor efference copy. The authors should describe how their results would generalize to a situation in which neurons have different selectivities. In particular, there are observations that receptive-field size differs across cell types. More generally, could the circuit the authors describe generalize to neurons with "mixed selectivity" to multiple variables? 3) The paper heavily focuses on a 4 cell-type motif (Pyr, PV, SOM, VIP), but does not provide a clear message on what the functional importance of this motif is. Could PE and nPE neurons emerge in circuits with only 2 or 3 cell types? If so, what computational benefits does the presence of other cell types provide? Along similar lines, how stringent are the conditions on synaptic weights to generate nPE (or pPE) neurons ? A comparison between the number of constraints and the number of variables would be interesting to determine how large the space of solutions is. 4) What is the role of recurrent connections In the model? The argument in terms of pathways (Figure 2) does not take loops into account. The approach taken here should be compared with predictive coding of Deneve and Machens, which relies on recurrent connection. 5) A key point of the paper is the plasticity rule operative at SOM->PV synapses. In the first part of the paper, the authors use a non-local, backpropagation-like rule, and then later show that under certain circumstances this assumption can be relaxed. The fact that the initial learning rule is based on backpropagation becomes apparent only in the first paragraph of the Results, and the reader needs to dig through the Materials and methods and appendix to understand the details of the plasticity rule that is being used throughout the main text. The learning rules need to be explained earlier in the manuscript, e.g. by including, Equations 16, 17, and 19 of the Materials and methods in the main text. We also suggest reorganising the corresponding part of the Materials and methods to clearly separate the two different types of learning used at different points in the Results. More generally, during the discussion of biological plausible plasticity, the reader is left asking why the authors didn't restrict to such plasticity from the beginning. Are the results worsened when using more realistic rules? Essential revisions: 1) The paper focuses on a very homogeneous coding of negative prediction errors, while positive prediction errors appear only in one figure. The authors don't justify their focus on the former over the latter. It would therefore be important to include better the variety of prediction error coding in the model, and discuss its functional implications. Wouldn't it be possible and interesting to compare the distribution of different types of neurons (nPE, pPE, others) between the model and the data? The manuscript states that "nPE neurons represent only a small fraction of neurons in mouse V1". This undercuts somewhat the previous sections, and calls for a more quantitative comparison, in particular with respect to a distribution that would be obtained from random wiring for instance. The predictions for optogenetic inactivations in Figure 3 are very nice, but how do they extend to the heterogeneous case with pPE and other neuron classes (Figure 4) ? Can Equations 8-9 be extended to pPE neurons? “The paper focuses on a very homogeneous coding of negative prediction errors, while positive prediction errors appear only in one figure. The authors don't justify their focus on the former over the latter.” We do agree that our focus on nPE neurons should be justified more clearly in the main text. There are several reasons why we decided to mainly look at nPE neurons: First of all, the study was motivated and inspired by experimental work on mismatch neurons (aka nPE neurons) that we wanted to compare our results with. Secondly, the vast majority of experimental findings on PE neurons is for nPE neurons (to the best of our knowledge). Thirdly, we assume that in the given context of visuomotor experience, nPE neurons should be the primary PE neurons. This is because we assume that an animal naturally learns over time that when it moves forward the world moves backwards (visual flow changes). At the same time the visual input can also change when the animal doesn’t move (e.g. caused by the movements of other animals in their environment). In contrast, phases in which self-movement does not cause changes in visual input are true mismatches and PE neurons should indicate this. As we assume that the experience a network is exposed to determines the nature of the PE neuron, this would indicate that the majority of PE neurons are indeed nPE neurons (in this simplified setting we studied). To emphasize our choice, we have changed the beginning of the Results section: “We studied a rate-based network model of layer 2/3 of rodent V1 to investigate how prediction-error (PE) neurons develop. In the following, we will focus primarily on negative prediction-error (nPE) neurons. […] However, the same approaches and principles derived for nPE neurons can also be applied to positive prediction-error (pPE) neurons.” and the Discussion: “Here, we have mainly focused on the development of nPE neurons because those have been studied extensively in layer 2/3 of rodent V1, which allowed us to qualitatively compare our model with experimental findings. […] Moreover, as we assume that excitatory neurons aim to establish an E/I balance for all stimuli they are regularly exposed to, and as animals experience episodes, in which the change of visual input is only caused by external factors (playback phases), excitatory neurons are more likely to develop into nPE than pPE neurons in the sensorimotor paradigm used here.” “Wouldn't it be possible and interesting to compare the distribution of different types of neurons (nPE, pPE, others) between the model and the data? Can Equations 8-9 be extended to pPE neurons?” Yes, it would indeed be interesting to compare the constraints for nPE and pPE neurons. Even though we assume that nPE neurons should play the major role in the visuomotor paradigm we study, the existence of pPE neurons in other contexts (e.g., reward coding) is undenied. Therefore, it would be interesting to formally derive and compare constraints on the interneuron circuit. We therefore now include mathematical constraints for pPE neurons (similar to Equation 8 and 9, see Appendix 2) and first simulation results for different pPE circuits (analog to Figure 2, see Appendix 2—figure 1). We also show how those can be learned with the same learning rules used for nPE neurons (similar to Figure 1, see Appendix 2—figure 2). Note that these results are for a homogeneous network of pPE neurons. The results, which are shown in the Appendix, are referred to in the Discussion: “We expect that the same principles and approaches described here also hold for the formation of pPE neurons. […] The inhibitory plasticity establishes pPE neurons independent of the input configuration onto PCs and PV neurons as long as various excitatory, inhibitory, disinhibitory and dis-disinhibitory pathways can be balanced (see Appendix 2—figure 1, Equations 50 and 51)”. A thorough and complete investigation of nPE and pPE neurons developing in the same network is beyond the scope of this study and currently an ongoing project in the lab. To understand why the formation of both PE neurons in the same network is not a trivial task in the framework we use, please consider these two cases: i) Perfect PE neurons, that is, nPE neurons remain at baseline during feedback and playback, while pPE neurons remain at baseline during feedback and mismatch phase: For perfect nPE neurons to develop, the network must be exposed to feedback and playback phases. For perfect pPE neurons to develop, the network must be exposed to feedback and mismatch phases during training. Thus, if both PE neurons should emerge within the very same network, the network must be exposed to all phases. However, given the current plasticity rules we use in the framework, this would most likely produce neurons that remain at their baseline during all phases. To force the network to generate nPE and pPE neurons, the plasticity rules must incorporate gating signals that restrict learning to a subset of these phases. This is beyond the current framework. ii) Imperfect PE neurons, that is, nPE neurons remain at baseline during feedback, but are allowed to decrease activity during playback, and similarly pPE neurons remain at baseline during feedback but are allowed to decrease activity during mismatch: In such a scenario, the network must only be exposed to feedback phases, to avoid that the plasticity establishes an E/I balance also during playback and mismatch phases. nPE and pPE neurons can then develop within the same network, but the outcome will depend on many parameters, including the network initialisation, the number of plastic synapses and the distribution of actual and expected signals. The number of degrees of freedom in the model is then very large. A complete characterisation of all parameter boundaries is subject to ongoing research in the lab and beyond the scope of the present paper. We have now included a section on this topic in the Discussion: “In the present work, we derived the constraints for separate nPE and pPE neurons and did not study the parallel development of both in the same neural network. […] A thorough investigation of these scenarios for the simultaneous development of nPE and pPE neurons is, however, beyond the scope of the present study.” “The predictions for optogenetic inactivations in Figure 3 are very nice, but how do they extend to the heterogeneous case with pPE and other neuron classes (Figure 4)?” We agree that it would be great to extend the simulated optogenetic experience to heterogeneous networks. However, as discussed above, a thorough understanding of heterogeneous networks with nPE and pPE neurons is beyond the scope of the present work. In fact, Loreen Hertäg is currently preparing a grant proposal, in which she aims to extend on this work, including an extensive study of heterogeneous networks. 2) In the model, every neuron receives one or two scalar signals representing a one-dimensional visual input and corresponding motor efference copy. The authors should describe how their results would generalize to a situation in which neurons have different selectivities. In particular, there are observations that receptive-field size differs across cell types. More generally, could the circuit the authors describe generalize to neurons with "mixed selectivity" to multiple variables? That is a very good point and one of the foci of the above mentioned grant proposal. Indeed, it has been found that layer 2/3 mismatch neurons in rodent V1 exhibit receptive fields for mismatches, which co-align with their visual receptive fields (Zmarz and Keller, 2016). These sensorimotor mismatch responses are confined to local regions of the visual field. It is therefore very interesting to equip excitatory neurons with feature selectivity and study how the constraints on the IN circuit change and how this affects the selectivity in IN types, which have been shown to exhibit a broad spectrum of tuning properties (Sohya et al., 2007; Cardin et al., 2007; Kerlin et al., 2010; Atallah et al., 2012). We expect that nPE (and also pPE neurons) can still emerge in these networks, but further constraints will need to be satisfied. In a first approach, feature selectivity can be studied in independent columns (see e.g. Wacongne et al., 2012) with weak cross-coupling that is increased systematically. We intend to study this in detail but would prefer to leave this for future work because it will open up a completely new chapter. Nevertheless, we have included a section on this topic in the Discussion: “A hallmark of neurons in sensory areas is their pronounced feature selectivity (Cardin et al., 2007; Niell and Stryker, 2008; Harris and Mrsic-Flogel, 2013). […] For future work, it would be interesting to study how the presence of feature-selective PE neurons constrains the feature selectivity in interneurons that tend to be more broadly tuned than excitatory neurons (Sohya et al., 2007; Cardin et al., 2007; Kerlin et al., 2010; Atallah et al., 2012).” 3) The paper heavily focuses on a 4 cell-type motif (Pyr, PV, SOM, VIP), but does not provide a clear message on what the functional importance of this motif is. Could PE and nPE neurons emerge in circuits with only 2 or 3 cell types? If so, what computational benefits does the presence of other cell types provide? Along similar lines, how stringent are the conditions on synaptic weights to generate nPE (or pPE) neurons ? A comparison between the number of constraints and the number of variables would be interesting to determine how large the space of solutions is. Thanks for this suggestion. We decided to focus on these three IN types because they form a canonical motif that has been observed frequently. However, we agree that a more extensive discussion further strengthens the paper. We therefore now include an exhaustive list of nPE circuits and the corresponding constraints in the Appendix that hopefully clarifies this matter. Moreover, we adapted the text to account for this: “We used a mathematical analysis to derive constraints imposed on an interneuron circuit by the presence of nPE neurons. […] While a minimal model that allows nPE neurons to develop comprises SOM neurons and PCs (Attinger et al., 2017), the network with three inhibitory neuron types appears the most likely nPE circuit given what is currently known about rodent V1.” 4) What is the role of recurrent connections In the model? The argument in terms of pathways (Figure 2) does not take loops into account. The approach taken here should be compared with predictive coding of Deneve and Machens, which relies on recurrent connection. Thanks for this comment. It highlights that we have not sufficiently emphasized that the model in fact includes recurrent connections, not only between the PCs and the INs, but also within the PC population. To keep the figures clean, excitatory connections are not includes in the network schemata. To highlight that those are included, we updated the corresponding captions. Note that during feedback and playback, these connections are not relevant because the E-E synapses target the dendrites, which are over-inhibited in those phases. Please note also that the generality of the approach does not rely on the recurrent connections targeting the dendrites. Thanks for highlighting that the related work of Deneve and Machens should be discussed in the paper. We have now added a section in the Discussion: “In our model, the excitatory recurrent connections target the apical dendrites of PCs, but given that PCs comprise a homogeneous population, they serve no specific computational purpose in the present context. […] It will be interesting to study how this line of work is related to the PE circuit model we studied here, but it would require to extend the present model to perform richer computations, e.g., by endowing it with stimulus selectivity.” 5) A key point of the paper is the plasticity rule operative at SOM->PV synapses. In the first part of the paper, the authors use a non-local, backpropagation-like rule, and then later show that under certain circumstances this assumption can be relaxed. The fact that the initial learning rule is based on backpropagation becomes apparent only in the first paragraph of the Results, and the reader needs to dig through the Materials and methods and appendix to understand the details of the plasticity rule that is being used throughout the main text. The learning rules need to be explained earlier in the manuscript, e.g. by including, Equations 16, 17, and 19 of the Materials and methods in the main text. We also suggest reorganising the corresponding part of the Materials and methods to clearly separate the two different types of learning used at different points in the Results. More generally, during the discussion of biological plausible plasticity, the reader is left asking why the authors didn't restrict to such plasticity from the beginning. Are the results worsened when using more realistic rules? Thanks for pointing out that the plasticity rules used are not obvious to the reader and should therefore be mentioned earlier. We now describe the plasticity rules in the Results (second paragraph and subsection “nPE circuits can also be learned by biologically plausible learning rules” and reference to the respective equations in the Materials and methods more carefully. The Materials and methods now includes subsections that hopefully make the distinction between the two different rules more visible (subsections “Connections onto PCs”, “Connections onto PV neurons - non-local learning” and “Connections onto PV neurons - local approximation to backpropagation of error”). Regarding the performance of the local approximation of the backprop rule: The main goal of the present paper was to show that PE circuits can be learned by balancing excitation and inhibition, rather than on the specific synaptic learning rules that achieve an E/I balance. In our setting, the local, more biological learning rule does a very good job (see Figure 6). We also found that this local rule effectively establishes an E/I balance in networks with rich stimulus selectivity (Mackwood et al., 2020). Of course, we expect that the approximation will break down at some point, e.g., when the connection probability is too low. An extensive evaluation of the conditions under which the approximation works would in our view distract from the main message of the paper and should be done elsewhere. To justify our choice, we added a sentence in the Discussion: “Given that the main goal of the present paper was to show that PE circuits can be learned by balancing excitation and inhibition, we used the plasticity rule implementing a backpropagation of error, to ensure maximal generality.”
  78 in total

1.  A new cellular mechanism for coupling inputs arriving at different cortical layers.

Authors:  M E Larkum; J J Zhu; B Sakmann
Journal:  Nature       Date:  1999-03-25       Impact factor: 49.962

2.  Predictive coding as a model of biased competition in visual attention.

Authors:  M W Spratling
Journal:  Vision Res       Date:  2008-04-28       Impact factor: 1.886

Review 3.  Modulation of cortical circuits by top-down processing and arousal state in health and disease.

Authors:  Renata Batista-Brito; Edward Zagha; Jacob M Ratliff; Martin Vinck
Journal:  Curr Opin Neurobiol       Date:  2018-07-29       Impact factor: 6.627

4.  A cellular mechanism for cortical associations: an organizing principle for the cerebral cortex.

Authors:  Matthew Larkum
Journal:  Trends Neurosci       Date:  2012-12-25       Impact factor: 13.837

5.  Principles of connectivity among morphologically defined cell types in adult neocortex.

Authors:  Xiaolong Jiang; Shan Shen; Cathryn R Cadwell; Philipp Berens; Fabian Sinz; Alexander S Ecker; Saumil Patel; Andreas S Tolias
Journal:  Science       Date:  2015-11-27       Impact factor: 47.728

6.  Cell type-specific inhibitory inputs to dendritic and somatic compartments of parvalbumin-expressing neocortical interneuron.

Authors:  Hiroyuki Hioki; Shinichiro Okamoto; Michiteru Konno; Hiroshi Kameda; Jaerin Sohn; Eriko Kuramoto; Fumino Fujiyama; Takeshi Kaneko
Journal:  J Neurosci       Date:  2013-01-09       Impact factor: 6.167

7.  Four Unique Interneuron Populations Reside in Neocortical Layer 1.

Authors:  Benjamin Schuman; Robert P Machold; Yoshiko Hashikawa; János Fuzik; Gord J Fishell; Bernardo Rudy
Journal:  J Neurosci       Date:  2018-11-09       Impact factor: 6.709

8.  In vivo measurement of cell-type-specific synaptic connectivity and synaptic transmission in layer 2/3 mouse barrel cortex.

Authors:  Aurélie Pala; Carl C H Petersen
Journal:  Neuron       Date:  2014-12-24       Impact factor: 17.173

9.  Adults with autism overestimate the volatility of the sensory environment.

Authors:  Rebecca P Lawson; Christoph Mathys; Geraint Rees
Journal:  Nat Neurosci       Date:  2017-07-31       Impact factor: 24.884

10.  Learning-Related Plasticity in Dendrite-Targeting Layer 1 Interneurons.

Authors:  Elisabeth Abs; Rogier B Poorthuis; Daniella Apelblat; Karzan Muhammad; M Belen Pardi; Leona Enke; Dahlia Kushinsky; De-Lin Pu; Max Ferdinand Eizinger; Karl-Klaus Conzelmann; Ivo Spiegel; Johannes J Letzkus
Journal:  Neuron       Date:  2018-09-27       Impact factor: 17.173

View more
  6 in total

1.  Self-organization of a doubly asynchronous irregular network state for spikes and bursts.

Authors:  Filip Vercruysse; Richard Naud; Henning Sprekeler
Journal:  PLoS Comput Biol       Date:  2021-11-08       Impact factor: 4.475

2.  On the relationship between predictive coding and backpropagation.

Authors:  Robert Rosenbaum
Journal:  PLoS One       Date:  2022-03-31       Impact factor: 3.240

3.  NMDA receptors in visual cortex are necessary for normal visuomotor integration and skill learning.

Authors:  Felix C Widmer; Sean M O'Toole; Georg B Keller
Journal:  Elife       Date:  2022-02-16       Impact factor: 8.140

4.  Rate and oscillatory switching dynamics of a multilayer visual microcircuit model.

Authors:  Gerald Hahn; Arvind Kumar; Helmut Schmidt; Thomas R Knösche; Gustavo Deco
Journal:  Elife       Date:  2022-08-22       Impact factor: 8.713

5.  Opposing Influence of Top-down and Bottom-up Input on Excitatory Layer 2/3 Neurons in Mouse Primary Visual Cortex.

Authors:  Rebecca Jordan; Georg B Keller
Journal:  Neuron       Date:  2020-10-21       Impact factor: 17.173

6.  Prediction-error neurons in circuits with multiple neuron types: Formation, refinement, and functional implications.

Authors:  Loreen Hertäg; Claudia Clopath
Journal:  Proc Natl Acad Sci U S A       Date:  2022-03-23       Impact factor: 12.779

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.