Eyal Gruntman1, Sandro Romani1, Michael B Reiser2. 1. Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA, USA. 2. Janelia Research Campus, Howard Hughes Medical Institute, Ashburn, VA, USA. reiserm@janelia.hhmi.org.
Abstract
A neuron that extracts directionally selective motion information from upstream signals lacking this selectivity must compare visual responses from spatially offset inputs. Distinguishing among prevailing algorithmic models for this computation requires measuring fast neuronal activity and inhibition. In the Drosophila melanogaster visual system, a fourth-order neuron-T4-is the first cell type in the ON pathway to exhibit directionally selective signals. Here we use in vivo whole-cell recordings of T4 to show that directional selectivity originates from simple integration of spatially offset fast excitatory and slow inhibitory inputs, resulting in a suppression of responses to the nonpreferred motion direction. We constructed a passive, conductance-based model of a T4 cell that accurately predicts the neuron's response to moving stimuli. These results connect the known circuit anatomy of the motion pathway to the algorithmic mechanism by which the direction of motion is computed.
A neuron that extracts directionally selective motion information from upstream signals lacking this selectivity must compare visual responses from spatially offset inputs. Distinguishing among prevailing algorithmic models for this computation requires measuring fast neuronal activity and inhibition. In the Drosophila melanogaster visual system, a fourth-order neuron-T4-is the first cell type in the ON pathway to exhibit directionally selective signals. Here we use in vivo whole-cell recordings of T4 to show that directional selectivity originates from simple integration of spatially offset fast excitatory and slow inhibitory inputs, resulting in a suppression of responses to the nonpreferred motion direction. We constructed a passive, conductance-based model of a T4 cell that accurately predicts the neuron's response to moving stimuli. These results connect the known circuit anatomy of the motion pathway to the algorithmic mechanism by which the direction of motion is computed.
The computation of directional selectivity has been studied for decades in both
vertebrate and invertebrate visual systems and given rise to competing algorithmic
models[1]. The
Hassenstein-Reichardt (HR) detector uses a synergistic combination of offset excitatory
inputs[2] to enhance responses to
motion in the preferred direction, whereas the Barlow-Levick (BL) detector uses
inhibitory input to “veto” an offset excitatory input[3], suppressing motion in the non-preferred, or null
direction.The HR detector, which was originally formulated to account for motion detection
in insects, has become a canonical example of a neuronal computation[1,4]. This
model has endured since its elegant mechanism accounted for a wide array of behavioral
results[2,5,6] and the
detailed response properties of large motion sensitive output neurons of the fly visual
system[1,7,8]. Recent
progress on the visual circuits upstream of these motion sensitive neurons in
Drosophila has revealed that stimuli are processed through ON and
OFF pathways[9-14]. In the ON pathway, the columnar neurons T4, are
the first cell type to exhibit directionally selective signals[10,11,14,15]; four T4 subtypes are each directionally selective in one of four
cardinal directions[9,11,16].A prevalent hypothesis for how T4 cells could implement the HR detector was
proposed based on different input cell types providing the temporally[14] and spatially[9] offset inputs, however, further analysis of the
circuit[15,17] has ruled out this circuit mechanism. The other
recent approach to identifying the mechanism responsible for directional selectivity in
the T4 circuit has been to use calcium imaging to measure T4 responses to structured
visual stimuli. Several studies found evidence for enhanced responses to preferred
direction motion[18,19], while several others found both preferred
direction enhancement and null direction suppression[20-22]. A number of
groups have recently proposed that motion detection in flies may be implemented as a
hybrid mechanism featuring elements of both the HR and BL algorithms[15,21-23]. However,
direct evidence for either mechanism has been elusive, in large part because these
studies rely on calcium imaging. Calcium indicator responses are insensitive to fast
events, obscuring the small timing differences required by the HR detector, and are also
insensitive to hyperpolarization, thereby preventing the direct measurements of
inhibition, a defining feature of the BL model.
Results
To directly measure the physiological properties of T4 neurons, we used
targeted in-vivo whole-cell electrophysiology. We confirmed the
identity of GFP-labeled T4 neurons by measuring reliable depolarizations in response
to small ON flashes (Fig. 1a and Supplementary Fig. 1). We
used on-line stimulus generation and analysis to localize the center (within
~2°) of the receptive field (RF) of each recorded neuron (Supplementary Fig. 1b,c). To
measure directional selectivity in single T4 cells, we presented a narrow
(~2° wide) ON bar moving in 8 different directions through the
mapped RF center (Fig. 1b,c). The T4 membrane
potential showed small depolarizations in response to several directions of motion
(Fig. 1b), but large responses to movement
in a ~90° range, centered around the preferred direction (PD;
opposite to the null direction, ND), in agreement with the tuning width of measured
with calcium imaging[11]. Our moving
bar stimulus generates apparent motion: the bar appears to move in a series of
discrete (~2°) steps, where the speed of movement is determined by
the duration the bar remains at each position. To examine the computation of
directional selectivity across a behaviorally relevant range, we presented 4
movement speeds. The responses to the moving stimuli were averaged across individual
recordings once aligned to the PD (Fig. 1c). We
quantified the strength of directional selectivity using a directional selectivity
index (DSI, defined in Methods, analysis section) and saw significant tuning at all
stimulus speeds (p< 0.05, one sided unpaired t-test), with stronger
directional selectivity at the two slower speeds (p< 0.05, one sided paired
t-test, Fig. 1d). At slow speeds, the membrane
potential exhibited prominent hyperpolarizing responses, both following the response
to PD motion, and preceding the depolarizing response to ND motion (arrowheads in
Fig. 1c, detailed in Supplementary Fig. 2). The sequence of
depolarizing and hyperpolarizing signals depended on the direction of motion,
suggesting that inhibitory and excitatory inputs to T4 may be spatially offset in
agreement with recent anatomical findings[17].
Figure 1
Whole cell recordings of T4 cells show directionally selective
responses
(a) Top: schematic of Drosophila visual system with
example T4 cell; whole-cell recordings are targeted to cell body. Bottom: a
moving bar (1 × 9 LEDs) is presented as a sequence of discrete steps.
(b) Responses to a bright bar moving in 8 directions (indicated
by arrows). Individual trials in grey (n=3 trials), mean in maroon.
Center: response to each direction at the time of maximal directional
selectivity. (c) Baseline-subtracted responses (n=17 cells)
to bar motion, aligned to the PD of each cell (mean ± SEM). Arrowheads
indicate hyperpolarization following (preceding) response to PD (ND) motion.
Colored horizontal bars indicate stimulus presentation. (d)
Directional Selectivity Index for bar motion responses (n=17 cells).
Crosses represent outliers, and triangles denote data points outside of the plot
(see Methods for boxplot conventions).
Mapping the T4 receptive field reveals a spatiotemporal asymmetry
In order to characterize the fine structure of T4’s RF, we first
localized the RF center and identified the PD-ND axis. We decomposed the moving bar
stimulus into its elementary components—single position bar flashes with a
fixed duration—and presented them at all positions, randomly interleaved,
along the PD-ND axis. Aligning the single position flash responses (SPFRs) based on
the position of peak depolarization (Fig. 2a)
reveals two key properties. (1) The center and leading side of the T4 RF exhibit
excitatory responses. These responses are larger and last longer for longer flash
stimuli (corresponding to slower speeds). Additionally, longer flashes presented on
the leading side of the RF reveal depolarizing responses that are not observed for
short flashes. (2) The inputs along this axis are spatially asymmetrical;
hyperpolarizing inputs only appear on the trailing side of the RF (most clearly seen
for the longer duration flashes). Overlaying voltage traces from leading and
trailing RF positions exposes a temporal asymmetry in the inputs (Fig. 2b,d). Responses on the trailing side decay faster
than responses at equivalent positions on the leading side. This temporal sharpening
can be seen across speeds, including the shortest duration flash where
hyperpolarization was not directly recorded (arrowheads in Fig. 2b).
Figure 2
T4 cells receive spatially, but not temporally, offset excitatory and
inhibitory inputs
(a) Averaged baseline-subtracted responses (mean ±SEM) to
single position bar flash stimuli (top) along the PD-ND axis of each cell
(n=17 cells). (b) Mean responses from indicated positions
in a aligned to stimulus presentation (grey rectangles). Arrowheads
emphasize differences in decay times. (c) Maximum depolarizing and
hyperpolarizing responses (mean ± SEM) by stimulus position for 80ms
flashes (n=17 cells). (d) Response onset time and decay
time to flashes near the RF center (median and quartiles). Cell number varies
for each position and duration combination (see Methods Onset time calculation
for details). (e) Slope of the linear regression of onset time
against position and the slope of the decay time against position, calculated
separately for each cell and each duration (n=17, 17, 14, 9 cells;
* slope < 0; p < 0.01, one sided unpaired t-test; grey
crosses represent outliers; see Methods for boxplot conventions).
For the longer duration flash stimuli our RF mapping (Fig. 2c and Supplementary Fig. 3), shows a
spatially offset distribution of depolarizing and hyperpolarizing inputs to T4, with
the hyperpolarizing input on the trailing side of RF. Remarkably the offset between
the peaks of excitatory and inhibitory inputs (~6° of visual angle;
Supplementary Fig. 3b)
corresponds to the approximate angular separation between adjacent ommatidia on the
fly eye, matching the predicted structure for the inputs of the
Drosophila motion detector[5]. Because we do not measure excitatory and inhibitory
currents directly, the underlying overlap between excitation and inhibition is
expected to extended beyond positions where we can robustly measure hyperpolarizing
input.If T4 implements an HR-like mechanism, we expect that SPFRs on the leading
side of the RF would be time delayed relative to trailing side SPFRs[2,14]. A simple expectation for relative time delays between offset
visual stimulation generated through differential filtering by either upstream cells
and/or synaptic transmission is that the responses would occur with a relative
temporal offset. However, the onset time of the flash response is constant across
the RF (Fig. 2d,e). Nevertheless, a significant
position effect is seen in the decay time (Fig.
2d,e), which we attribute to temporal sharpening (arrowheads in Fig. 2b) due to overlapping inhibitory inputs on
the trailing side of the RF (Fig. 2c). Rather
than a relative time delay between offset inputs, we find that the temporal
differences in the SPFRs can be explained by fast excitatory inputs on the leading
side combined with slower inhibitory inputs on the trailing side of the RF.
Directional selectivity results from ND suppression
The classical models for motion detection nonlinearly combine offset inputs,
resulting in either an enhancement of PD responses (HR) or a suppression of ND
responses (BL). A simple test for directional selectivity is to deliver flash
stimuli at two positions along T4’s RF and see whether the response to
sequential stimulation in one direction is larger than the response to the opposite
sequence of stimulation. We implemented this two-step apparent motion stimulus using
single position flashes (160 ms duration) for pairs of positions around the RF
center (Fig. 3a). To assess the effect of
direction, we compared each two-step response to the superposition of the
corresponding two single position responses (offset in time and summed; Fig. 3b). This comparison reveals no enhanced
response to the second bar appearing towards the PD (left column), but a reduction
in the second bar response when it appears towards the ND (arrow, right column).
Flashes separated by ~10° (bottom row) do not appear to interact,
showing neither enhancement nor suppression. Furthermore, the suppression is most
prominent for bar pairs ‘moving’ in the ND that begin on the
trailing side, consistent with our finding that the distribution of inhibitory
inputs is skewed to this side of the RF. By comparing the apparent motion responses
to the same bar pairs, but in opposite directions (Fig. 3c), we see that the leading side of the RF—corresponding
to the primarily excitatory domain of the SPFRs (Fig.
2)—is motion blind (see also Supplementary Fig. 4). Strong
directional selectivity is only seen for the bar pairs that include the trailing
side of the RF, and is only produced by ND suppression.
Figure 3
T4 responses to two-step apparent motion show null direction suppression, but
no preferred direction enhancement
(a) Averaged baseline-subtracted responses (mean ±SEM) to
160 ms flash stimuli presented along the PD-ND axis (n=3 cells). Single
position responses are shown along the diagonal, PD motion (red) and ND motion
(blue) are shown in the lower and upper triangle, respectively. Stimulus
positions indicated correspond to those in Figure
2. Apparent motion is presented as two flash stimuli timed to match
their appearance during 14°/sec motion. Space-time plots of the stimuli
are depicted in each inset. (b) Comparison of mean measured
responses to two-step stimulation to the superposition (time-aligned sum) of the
single position responses. Each row shows responses to stimuli presented at the
same positions but in the reversed temporal order. (c) Mean PD and
ND responses from the corresponding rows in b are compared, with
the Directional Selectivity Index indicated. The DSIs are averaged from 3 cells
(individual values for top panel: 0.02, 0.06, 0.1; for middle panel: 0.47, 0.5,
0.16).
The superposition of stationary responses generates directional
selectivity
The two-step apparent motion responses (Fig.
3) suggest that it is the integration of offset excitatory and inhibitory
inputs that generates directionally selectivity. However, is this mechanism
sufficient to explain the directionally selective responses to moving bars? To
address this question, we compared the responses to moving bar stimuli, comprised of
an ordered sequence of single position bar flashes (Fig. 4a,b), with the superposition of SPFRs (the sum after appropriate
temporal alignment; Fig. 4c). Importantly, the
only difference between the summed PD and the summed ND responses was the order in
which the SPFRs were aligned. This comparison is conceptually similar to our
treatment of the two-step apparent motion responses, but now many interactions
across the entire receptive field are being probed. We find that the simple sum of
SPFRs (Fig. 4c) captures the essential dynamics
of the moving bar response (Fig. 4c,d) and
shows significant directional selectivity at all four speeds (Fig. 4e).
Figure 4
Ordered summation of single position flash responses qualitatively reproduces
moving bar responses
(a) Schematized T4 dendrites overlaid on 9 bar stimulus positions
(colors as in Figure 2). (b)
Mean PD and ND motion responses from an example cell to the 28°/s bar
motion. (c) Top: Single Position Flash Responses (SPFRs) from the
example cell in b colored by position, temporally aligned to
account for moving bar position. Bottom: Summed SPFRs for PD and ND motion.
(d) Comparison of mean measured moving bar responses and mean
summed SPFRs (n=31 trajectories, 16 cells). (e) Boxplots
for Directional Selectivity Index for measured and summed responses across
speeds (n=31 trials, 16 cells; see Methods for boxplot conventions).
Crosses represent outlier, triangle denotes a point outside the scope of the
plot (* DSI > 0, p < 0.01 one sided unpaired t-test).
(f) Same data as in d, juxtaposing measured
responses and summed SPFRs for PD and ND motion separately.
The summed responses approximated the measured responses without any
nonlinear integration, a surprising result since both the HR and BL models of motion
detection require a nonlinear interaction to compute directional selectivity. To be
clear, we are not claiming that directional selectivity emerges from linear
operations (for example, the DSI computation is nonlinear), but rather emphasize
that these directionally asymmetric responses expose a significant discrepancy
between the T4 mechanism and the classical models. To gain further insight, we
examined features of the SPFRs for contributions to directional selectivity in the
summed responses, and found that it is the position dependent differences (such as
the response shape, sign, and magnitude) that account for the DSI of the summed
responses. (Supplementary Fig.
5). However, the summation of aligned SPFRs only accounts for
approximately half of the measured T4 DSI (Fig.
4e). Summing negative and positive potentials will undercount the effect
of inhibition. For example, in shunting inhibition a small inhibitory current can
eliminate a much larger excitatory current. The effect of this undercounting is most
prominently seen in the overestimated ND responses (Fig. 4f). Does this mismatch suggest that T4 implements a nonlinear step
like the classical models, or could biophysically realistic integration of
inhibitory inputs explain the additional measured DSI?
A conductance-based simulation quantitatively predicts the T4 motion
response
To answer this question, we built a conductance-based model of a T4 neuron
reconstructed from electron microscopy data (Fig.
5a). In our model, we randomly distributed excitatory and inhibitory
synapses along the dendritic arbor of the cell, but fit the synaptic strengths, the
time course of the excitatory and inhibitory conductance changes, and membrane and
axial resistivity, using an optimization procedure that minimizes the error between
simulated responses to bar flashes and our measured SPFRs (from all the positions
and all the speeds; Fig. 5a, Supplementary Fig. 6a,b,c). We then
used the resulting model to simulate T4’s response to a moving bar (Fig. 5b). Importantly, we fit our model’s
parameters using responses to stationary stimuli (the SPFRs), but used the model to
predict the responses to an independent motion stimulus (the moving bar responses),
thus testing the generalization of this model.
Figure 5
Asymmetric inputs in a conductance based model predict directionally
selective responses
(a) Anatomical reconstruction of a T4 cell used in the simulation,
with expanded dendrite showing positions of modeled excitatory (dark green) and
inhibitory (light green) synapses. Enlarged markers indicate active synapses for
an example stimulus position. Spatial filters, which determine synaptic weight,
are shown normalized and aligned to the dendrite, and temporal filters are shown
in inset. (b) Mean measured motion responses (from Figure 4) compared to the simulated predictions.
(c) Peak PD and ND responses measured in T4s, compared to the
Summed SPFRs (from Figure 4) and simulation
results. (d) Direction Selectivity Index for the mean measured
responses compared to: (Left) model simulation results and summed responses;
(Center) simulation results without inhibition; (Right) simulation results with
all synaptic inputs placed in same location, and a separate single-compartment
simulation (detailed in Methods).
In comparison to the summed responses (Fig.
4d), the simulated responses more accurately reproduced the measured
responses to both PD and ND motion (Fig. 5b,c).
Furthermore, the same simulation qualitatively reproduced the dynamics of T4
responses (Fig. 5b) and quantitatively
reproduced the magnitude of the directionally selective responses (Fig. 5c,d and Supplementary Fig. 6d,e,f) at all
tested speeds, suggesting that the same mechanism can account for directional
selectivity, even at the fastest speeds tested where hyperpolarization was not
directly measured (Fig. 2).One classical neuronal computation of directional selectivity based on
passive cable properties[24], is
that sequential depolarization directed towards the (thicker) axon generates
stronger responses than activation in the opposite direction. To test whether this
mechanism could contribute to directional selectivity in T4, we zeroed all
inhibitory conductance changes in our model. When the simulation was repeated, only
the excitatory synapses were activated as the “stimulus” swept in
the PD or ND. We found that in the absence of inhibition, the DSI of the simulated
T4 is abolished at all speeds (Fig. 5d,
middle), indicating that depolarizing inputs alone cannot produce directional
selectivity. This suggests that inhibitory inputs to T4, in addition to providing
slow hyperpolarization that characterizes responses to slow motion (Fig. 1c), also establish the temporal sharpening of
trailing side responses, critical for directional selectivity at faster speeds
(Supplementary Fig.
5b).The striking retinotopic alignment of T4 dendrites with the PD-ND axis was
an important anatomical clue in their identification as directionally selective
cells[9,25]. We used our simulation to test whether the
neuron’s morphology substantially contributes to directional selectivity. By
positioning all 154 synaptic inputs (inhibitory and excitatory) at the base of the
dendrite (all other parameters held constant), we found that these simulation
results were almost indistinguishable from the unmodified simulation (Fig. 5d, right). As a consequence of this result, we
expected that a single -compartment neuron simulation should also be able to capture
the response dynamics of T4. Indeed, this simpler model reproduces the moving bar
responses of T4 with only negligible differences to the multi-compartment simulation
results (Fig. 5d, right). These results suggest
that the critical role of T4’s elaborate and directional dendrite is limited
to collecting different input signals across a small region of the fly’s
eye. Taken together, these modelling results corroborate our central finding that
inhibitory inputs sculpt the essential asymmetry for directionally selective
responses in T4 cells—no additional mechanisms are required.
Discussion
We used intracellular recordings to probe the mechanism of directionally
selectivity for ON motion in T4 neurons. Using on-line stimulus generation, we
finely mapped the receptive field of T4 neurons (Fig.
2). Remarkably, this receptive field structure, comprised of responses to
stationary ON bars, can be used to predict the response of T4 to moving bars (Fig. 4). We then improved these predictions with
a parsimonious, biophysical model integrating offset excitatory and inhibitory
conductances (Fig. 5).In the present study, we show that directional selectivity in T4 neurons
arises solely from ND suppression. Recent studies that used calcium imaging of T4
neuron responses to a two-step apparent motion stimulus found evidence for PD
enhancement[18,19] or for both PD enhancement and ND
suppression[20,22]. These discrepancies may arise from
differences in the stimulus design, differences between single neuron and population
recordings, differences between measurements of membrane potential versus calcium
indicator fluorescence, or from combinations of these factors. Since the
transformation between membrane potential and calcium indicator fluorescence is
likely to be superlinear[21],
measured calcium responses may appear enhanced when compared to their summed
components, even though the underlying voltage response does not show this
enhancement (Supplementary Fig.
7). However, as Fisher, et al.[18] have controlled for this possibility, we believe that the
difference is most likely explained by differences in stimulus design (centered bars
versus edges[18]; centered narrow
bars versus centered wider spots[20]). Haag, et al.[20]
found PD enhancement in T4 cells, but only for stationary stimuli wider than those
used here. These larger stimuli, in addition to strongly depolarizing the measured
T4 cells, are also effectively stimulating neighboring T4 cells. But since T4
neurons synapse onto other T4 cells with exquisite precision—only cells with
the same directional tuning are connected, and only in the direction of
motion[17]—this
circuit mechanism should produce a superlinear response in one direction (along the
PD), but not in the opposite direction. This enhancement of PD motion over a
slightly larger spatial scale appears to be evidence for directed facilitation from
neighboring T4 cells, but not for an HR-like mechanism responsible for directional
selectivity (since blocking synaptic transmission in T4 neurons does not
substantially reduce directionally selective responses in T4 cells[15,22]). This factor may partially explain the measurement of PD
enhancement in Fisher et al.[18],
but future experiments will be required to clarify these discrepancies.Anatomical studies have shown that dendrites of T4 cells are aligned with
the PD[9,25] and receive spatially offset synaptic inputs from distinct
columnar neuron types[17]. The
arbors span several (3-4) retinotopic columns in the medulla[16], which given the ~5° visual
angle per column, corresponds well to our measured RF width of ~20°
(Fig. 2). The cholinergic columnar neurons
Mi1 and Tm3 preferentially synapse onto T4s in the central to distal part of the
arbor, while the GABAergic neurons Mi4, C3, and CT1, synapse mainly onto the base of
the dendrite[15,17,26,27]. This anatomical arrangement
agrees remarkably well with the functional organization of our measured T4 receptive
field, with leading-side excitation and trailing-side inhibition.A recent study from our lab used circuit perturbations to also propose a
hybrid mechanism for T4 motion responses[15]. Although we showed nonlinear integration of excitatory
inputs by T4, we did not show that this nonlinearity contributes to the computation
of directional selectivity, so these results do not conflict with the current
findings. In fact, EM-based circuit analysis[9,17], as well as a
number of functional studies[14,15,23,26,28,29],
suggests that multiple cell types (at least Mi1 and Tm3) contribute to the
excitatory ON component of our measured RF, without evidence for a spatial offset
required for an HR-like mechanism[17]. In our previous study, we also showed that silencing one of
the main inhibitory inputs to T4 (Mi4) does not reduce directional
selectivity[15]. Since our
current results strongly suggest that inhibition is necessary for computing
directional selectivity, it is entirely possible that the critical inhibitory inputs
to T4 are not simply contributed by a single cell type (other candidates include
CT1, C3, and TmY15[17]).
Synthesis of experimental results with classical computational models for motion
detection
Historically, the computation of directional selectivity has been
conceptualized with algorithmic models that have outlined the general properties
that transform non-selective inputs into a directionally selective output. Here we
compare our results with the implications of the classical models, and present an
algorithmic summary of our findings. Motion detector models typically describe a
‘fully-opponent’ computation, whereby local motion estimates in
opposing directions are subtracted to minimize non-motion signals[1]. Since T4 is now understood to reside one
step before this opponent subtraction[30], we only consider the subunits of each model type before this
subtractions stage. The classical model for insect motion detection is the
Hassenstein-Reichardt Detector[2].
This detector generates directional selectivity by delaying leading side excitatory
input and combining it with trailing side excitatory input with a nonlinear
amplification, thus enhancing the preferred direction response (Fig. 6a). If T4s directional selectivity was based on an
HR-like mechanism, we would expect to find: (1) excitatory inputs spanning the
receptive field, (2) differing temporal filtering of inputs resulting in a delayed
leading side, and (3) enhanced responses to PD motion (Fig. 6a). We did not find evidence for any of these properties: (1) T4
receives offset excitatory and inhibitory inputs (Fig.
2c), (2) leading side responses are not temporally delayed (Fig. 2d,e and Supplementary Fig. 8), and
(3) PD motion is not enhanced (Fig. 3,4,5).
Furthermore, the excitatory region of the receptive field is motion blind (Supplementary Fig. 4).
Figure 6
Comparison the T4 mechanism to classical computational models for directional
selectivity
(a) The Hassenstein-Reichardt detector[2] uses slower filtering on the leading side
arm and a multiplicative nonlinearity to produce coincidence detection of
excitatory signals. The time constants are labeled as
E-Lead and
E-Trail, where
‘E’ indicates filtering of an excitatory signal. The semicircles
represent two neighboring inputs (arms) to the motion detectors.
(b) The Barlow-Levick detector[3] combines a slow trailing side
‘inhibitory’ arm with a faster leading side
‘excitatory’ arm through an ‘AND-NOT’ operation
that vetoes motion in the null direction. (c) The Adelson-Bergen
detector[31], uses
oriented spatiotemporal filters, created from offset spatial and fast and slow
temporal filters, to produce directional selectivity. (d) A summary
of the T4 mechanism formatted for comparison to the algorithmic models, based on
the Torre-Poggio model[35]. The
simple RF structure of T4 (Supplementary Fig. 8), does not require the more complex spatial and
temporal filters used in the AB model. Instead we use filters fit to T4
measurements (Figure 5), that produce a
fast, excitatory leading signal and a delayed, inhibitory trailing signal. As in
the AB model, directionally selective responses can be obtained through a linear
combination of these signals (grey lines). The dynamic nonlinearity operates on
the magnitude of excitatory (E) and inhibitory (I) conductances (detailed in
Methods) to produce a directionally selective membrane potential response.
An alternative classical model is the Barlow-Levick detector[3] that integrates fast leading-side
excitation and slower trailing-side inhibition through the nonlinear ‘AND
NOT’ operation (Fig. 6b). Directional
selectivity is computed since motion is signaled when excitatory input precedes the
inhibitory input; in the reverse order ND motion is suppressed. We find several
properties in our T4 recordings that agree with this framework: (1) leading-side
excitatory responses and trailing-side inhibitory responses (Fig. 2), (2) longer temporal filtering on the trailing
side, such that inhibition outlasts excitation (Fig.
2, 5 and Supplementary Fig. 8), and
(3) clear evidence for ND suppression with both two-step and moving bar apparent
motion (Fig. 3c, 4f).While the organization of the T4 RF agrees with the algorithmic principle of
the BL model, we find an important difference in the integration mechanism. The BL
model uses a strong nonlinearity, and so does not produce directionally selective
responses from the superposition of the responses to stationary inputs (which our T4
recordings show, Fig. 3,4). This seemingly technical distinction is in fact quite
consequential, since in both the HR and BL models, the nonlinearity is a defining
characteristic—without it there are no directionally selective responses. In
contrast, our T4 results show that directional selectivity is already present in the
superposition of the stationary stimulus responses.A 3rd class of motion detectors—the Adelson-Bergen (AB)
motion energy model[31]—generates directional selectivity by using filters that are
oriented in space-time. These linear filters are combined spatial and temporal
filters that feature a prominent space-time tilt indicating the preferred direction.
In Fig. 6c we present only the relevant subunit
of the full AB model for comparison with our data. This subunit has (1) different
spatial filters on each of its arms, (2) excitatory and inhibitory inputs, (3) a
linear integration step that produces a directionally selective response from the
superposition of stationary responses (as seen in simple cells of the cat visual
cortex[32]), and (4) a
static amplifying nonlinearity. This flexible model, whose commonalities with
variants of the classical motion detection models have been broadly
discussed[1,31,33],
provides a useful framework for representing the T4 computation.In light of the commonalities and discrepancies between classical
algorithmic models and our results, we next provide a summary of our understanding
of T4 in a format that is directly comparable to these models (Fig. 6d), with an excitatory and an inhibitory arm. The
more complex spatial and temporal filters of the AB model, selected to match
cortical recordings, are replaced with the filters describing T4 responses (Fig. 5 and Methods). The spatial filters create
the leading vs. trailing offset, and the temporal filters create the fast vs. slow
difference, between the excitatory and inhibitory arms. In contrast to the
‘point sampling’ inputs of the classical HR and BL models, T4 shows
spatially extended responses to visual input (Fig.
2b), which has previously been shown to improve the accuracy of motion
detection[34]. While the sum
of the linear filter outputs produces asymmetric responses to PD and ND motion
(Fig. 6d), T4 neurons further enhance this
selectivity through nonlinear integration. While the AB model employs a static,
amplifying nonlinearity, we measure a suppressing nonlinearity, which is well
approximated with a simplified passive, biophysical model (similar to a previous
formulation[35]; Fig. 5d, 6d,
and Methods).
Spatiotemporal receptive fields of T4 cells
Representing the response properties of neurons using spatiotemporal
receptive fields, equivalent to the response of the spatiotemporal filters in the AB
model, has provided significant insight into transformation along the visual pathway
of mammals[36], and has recently
also been applied to the responses of directionally selective neurons in
flies[19,21]. We have replotted the averaged SPFRs to
approximate the spatiotemporal receptive field of a T4 neuron (Supplementary Fig. 8, left).
As expected for a directionally selective neuron, we find a distinctive tilt in this
receptive field, indicating the preferred direction. This RF has both excitatory and
inhibitory lobes, which both appear tilted, an organization that may suggest PD
enhancing and ND-suppressing mechanism[19,21]. To explore the
basis of the RF tilt, we used our multi-compartment T4 model, and replotted the
simulated SPFRs (Supplementary
Fig. 6c) as a spatiotemporal RF (Supplementary Fig. 8, middle). Since the model was fit to the
SPFR data, the qualitative agreement between these RFs is not surprising. The
simulated RF also features a tilted excitatory lobe. When we removed inhibition in
our model (Fig. 5d) and reproduced the RF
(Supplementary Fig. 8,
right), we found that the excitatory lobe was no longer tilted, consistent with our
inference that inhibition sharpens trailing side responses (Fig. 2b). This simulation shows that a tilted, bi-lobed
spatiotemporal RF does not necessarily support a hybrid mechanism, and reiterates
the lack of evidence in our data for PD enhancement.It is worth noting that the receptive fields of T5 cells, the OFF
directionally selective neurons, look qualitatively similar to those estimated for
T4 neurons[19-21]. Therefore, it is possible that T5 neurons
compute directional selectivity using the same simple mechanism that we have
established here for T4 neurons. Interestingly, no small field inhibitory
inputs[37,38] to T5 have thus far been identified, raising
the possibility of different implementations of the same motion computation in ON
and OFF pathways[39,40].
Methods
Histology
To visualize the expression pattern of the T4 driver line (SS02344),
brains of female flies were immunolabeled and imaged as described[41]. anti-Brp was used as a stain
for the neuropil marker (anti-nc82 1:30, Developmental Studies Hybridoma Bank)
and pJFRC225-5XUAS-IVS-myr::smFLAG (rat anti-FLAG 1:100, Novus Biologicals) in
VK00005[42]
was used as the reporter for GAL4 expression. For Supplementary Fig. 1a, the image
shown was generated from a confocal stack imaged on a Zeiss LSM 710 microscope
with a 63× objective, and resampled using Vaa3D[43].
Electrophysiology
Experiments were performed on 1-2 day old female Drosophila
melanogaster (flies were reared under constant light conditions at
24°C, some flies experienced periods of darkness, including overnight,
prior to dissection). To target T4 cells, a single genotype was used:
pJFRC28-10XUAS-IVS-GFP-p10[44] in attP2 crossed to stable split-GAL4 SS02344
(VT015785-p65ADZp (attP40); R42F06-ZpGdbd (attP2)) generously provided by
Aljoscha Nern in Gerry Rubin’s lab (line details with expression data
available from http://splitgal4.janelia.org/). Flies were briefly anesthetized
on ice and transferred to a special chilled vacuum holder where they were
mounted, with the head tilted down, to a customized platform machined from PEEK
using UV-cured glue (Loctite 3972). CAD files for the platform and vacuum holder
are available upon request. To reduce brain motion the proboscis was fixed to
the head with a small amount of the same glue. The posterior part of the cuticle
was removed using syringe needles and fine forceps. The perineural sheath was
peeled using fine forceps and, if needed, further removed with a suction pipette
under the microscope. To further reduce brain motion, muscle 16[45] was removed from between the
antenna.The brain was continuously perfused with an extracellular saline
containing (in mM): 103 NaCl, 3 KCl, 1.5 CaCl2 2H2O, 4
MgCl2 6H2O, 1 NaH2PO4 H2O, 26
NaHCO3, 5 N-Tris (hydroxymethyl) methyl-2- aminoethane-sulfonic
acid, 10 Glucose, and 10 Trehalose[46]. Osmolarity was adjusted to 275 mOsm, and saline was
bubbled with 95% O2/5% CO2 during the
experiment to reach a final pH of 7.3. Pressure-polished patch-clamp electrodes
were pulled for a resistance of 9.5-10.5 MΩ and filled with an
intracellular saline containing (in mM): 140 KAsp, 10 HEPES, 1.1 EGTA, 0.1
CaCl2, 4 MgATP, 0.5 NaGTP, and 5 Glutathione[46]. 250μM Alexa 594 Hydrazide was added to the
intracellular saline prior to each experiment, to reach a final osmolarity of
265 mOsm, with a pH of 7.3.The mounted, dissected flies were positioned on a rigid platform mounted
on an air table. Recordings were obtained from labeled T4 cell bodies under
visual control using a Sutter SOM microscope with a 60× water-immersion
objective. To visualize the GFP labeled cells, a monochrome, IR-sensitive CCD
camera (ThorLabs 1500M-GE) was mounted to the microscope, an IR LED provided
oblique illumination (ThorLabs M850F2), and a 460 nm LED provided GFP excitation
(Sutter TLED source). Images were acquired using Micro-Manager[47], to allow for automatic
contrast adjustment.All recordings were obtained from the left side of the brain. Current
clamp recordings were low-pass filtered at 10KHz using Axon multiClamp 700B
amplifier, and were sampled at 20KHz (National Instrument PCIe-7842R LX50
Multifunction RIO board) using custom LabView (2013 v.13.0.1f2; National
Instruments) and MATLAB (2015a; Mathworks) software. The membrane potential of
recorded cells was set around −65mV (uncorrected for liquid junction
potential), which required injecting a small, hyperpolarizing current (0-3 pA),
which after initial adjustment was maintained at a constant value throughout the
recording. To verify recording quality, current step injections were performed
intermittently, throughout the experiment. Recordings from cells in which either
visual or current step responses diminished noticeably were terminated.
Visual stimuli
The display was constructed from an updated version of the LED panels
previously described[48]. The
arena covered slightly more than one half of a cylinder (216° in azimuth
and ~72° in elevation) of the fly’s visual field, with
each pixel subtending an angle of ~2.25° on the fly eye. Green
LEDs (emission peak: 565 nm) were used, bright stimuli were ~72
cd/m2, and were presented on an intermediate intensity background
of ~31 cd/m2.Visual stimuli were generated using custom written MATLAB code that
allowed rapid generation of stimuli based on individual cell responses. In
contrast to the published stimulus control system[48], we have now implemented an FPGA-based
panel display controller, using the same PCIe card (National Instrument
PCIe-7842R LX50 Multifunction RIO board) that also acquired the
electrophysiology data. This new control system (implemented in LabView) streams
pattern data directly from PC file storage, allowing for on-line stimulus
generation. Furthermore, this new control system featured high precision (to 10
μs) timing and logging of all events, enabling reliable alignment of
electrophysiology data with visual stimuli.To map the receptive field (RF) center of each recorded cell, three
grids of flashing bright squares (on an intermediate intensity background) were
presented at increasing resolution. Each flash stimulus was presented for 140
ms. First, a 6 × 7 grid of non-overlapping 5 × 5 LEDs
(~11°×~11°) bright squares was
presented. If a response was detected, a denser 3 × 3 grid with
50%-overlapping 5 × 5 LEDs
(~11°×~11°) bright and dark squares (to
further verify these were T4 Cells) was presented at the estimated position of
the RF center (see Supplementary Fig. 1). If a recorded cell was consistently
responsive to the first two mapping stimuli, a third one was presented to
identify the RF center. A 5 × 5 grid of 3×3 LED bright squares
separated by 1 pixel-shifts was presented at the estimated center of the second
grid stimulus. The location of the peak response to this stimulus was used as
the RF center in subsequent experiments. Once the RF center was identified, the
moving bar stimulus was presented in 8 directions and 4 step durations. The bar
was 1 × 9 pixels. When moving in the cardinal directions, the motion
spanned 9 pixels. In the diagonal directions bar motion included more steps to
cover the same distance (9 steps vs. 13 steps). Once the preferred direction had
been estimated, bright bar (also 1 × 9 pixels) flashes were presented on
the relevant axis. To verify full coverage of RF, this stimulus was presented
over an area larger than the original motion window (at least 13 positions). In
addition to these stimuli, most cells were also presented with additional
stimuli following this procedure. All stimuli were presented in a pseudorandom
order within stimulus blocks. All stimuli were presented 3 times, except for
single bar flashes which were repeated 5 times. The inter-stimulus interval was
500ms for moving stimuli and 800ms for single bar flashes (to minimize the
effect of ongoing inhibition on the responses to subsequent stimuli).
Analysis
All data analysis was performed in MATLAB using custom written code.
Since the T4 baseline was typically stable, we included only trials in which the
mean pre-stimulus baseline did not differ from the overall pre-stimulus mean for
that group of stimuli by more than 10 mV. We also verified that the pre-stimulus
mean and overall mean for that trial did not differ by more than 15 mV (or 25 mV
for slow moving bars, due to their strong responses). This was designed to
identify those rare trials in which the trace became unstable only after the
stimulus was presented. Responses were later aligned to the appearance of the
bar stimulus and averaged (or the appearance of the bar in the central position
in case of the 8-orientation moving bar). T4 cells are expected to signal using
graded synapses. Consistent with this expectation, we find that T4 recordings
only occasionally feature very weak, fast transients (~1-2 mV in size)
that could not be verified as spikes. Therefore, we have focused our analysis on
the graded (sub-threshold) components of T4’s responses.
Determining PD
First, 8 direction responses were aligned to the center position for
each cell. Second, for the duration of bar presentation, the mean vector
response was calculated for each time point as . With θk being the
direction of motion (in 45° intervals), and
R(θk,t) the response at that direction at time t. Supplementary Fig. 2
shows the result of this procedure for the normalized vector magnitude and . This procedure can be thought of as
mimicking a downstream neuron, receiving input from T4s with the same RF
center but different PDs. for the cell was selected at the time point
when the mean vector magnitude was maximal (repeated for 4 step durations
and averaged). In the polar plot of Fig.
1b, this time point is used for the response to each bar
direction. The PD was determined as the θk with the
minimal difference to . For Fig.
1c, responses were then circularly shifted to align the PD with
rightward motions for plotting and averaging purposes.
DSI calculation
Direction selectivity index was defined as , with each response defined as the 0.995
quantile (a robust estimate of the max) within the stimulus presentation
window. DSI for the model and model variations was calculated in the same
manner.
Single Position Flash Response – depolarization
Responses were defined as the 0.995 quantile (a robust estimate of
the max) of the response during the time between bar appearance and flash
duration + 75ms. If this number did not exceed 3 standard deviations
of the pre-stimulus baseline (for all bar flashes for that cell), the
response was defined as zero.
Single Position Flash Response – hyperpolarization
Same as above only the time window was until end of trial (due to
slower time course for inhibition) and the threshold was 2 SDs (due to lower
magnitude of hyperpolarization). These calculations were used for Fig. 2c and Supplementary Fig. 3.
Depolarization (Hyperpolarizatio) normalization
All detected averaged SPFRs for a given duration were normalized to
the maximal depolarizing (hyperpolarizing) absolute response for that
duration and that cell. Fig. 2c shows
the average of these normalized responses for a single duration. Supplementary Fig. 3a
shows the sums for each cell, for 4 normalized depolarizing responses, and
the 2 slowest hyperpolarizing responses (since hyperpolarization was hard to
detect for brief flashes). A position which showed the maximal response for
all durations, will therefore, have a value of 4 for depolarizations and 2
for hyperpolarizations.
Onset time calculation
Data during a window of 200ms before stimulus presentation from all
the single bar flashes presented to a cell was used for a per-cell estimate
of the standard deviation of the baseline. Only presentations in which the
average SPFR was detected as depolarizing were used for onset time
calculation. Onset time was defined as the time from stimulus presentation
(after it was corrected for a small display latency) in which the response
crossed 0.5 × baseline S.D. This threshold value did not exceed
0.5mV. This calculation is used in Fig. 2d and
e. Because positions in the center of the RF feature a mixture of
excitation and inhibition, the peak time is ‘contaminated’
by the inhibitory contribution and could not be used as an independent,
reliable measure of the properties of the excitatory input.
Decay time calculation
The decay time (from 80% to 20% of maximal response)
was calculated for all the cells and all the positions in which a
depolarizing response was detected. If 20% of max response was not
attained by the time recording ended, that data point was excluded. This
calculation is used in Fig. 2d and e.
Number of cells which passed the above threshold and were included in figure 2d is: 160ms – 17, 17, 17,
17, 17, 15, 9; 80ms – 15, 16, 17, 17, 16, 15, 6; 40ms – 11,
13, 16, 16, 16, 10, 8; 20ms – 1, 8, 11, 17, 12, 8, 4.
Slope calculation (onset and decay times)
To calculate the slope in Fig
2e, onset (decay) time values from all the positions of an
individual cell were fit with a linear regression. Fits were performed only
when more than 4 positions showed responses, to account for cases in which
not all positions showed detectable depolarizing responses (especially due
to fast flashes).
Summed SPFRs (superposition)
Single bar flash responses were aligned to the time of the
corresponding position appearance in the moving bar stimulus. Responses were
padded with zeros (since all were baselines subtracted) to extend brief
single bar responses to the timescale of a moving bar. This procedure was
used both for the two-step apparent motion stimuli and the moving bar
stimuli (Figs. 3 and 4, Supplementary Figs. 4, 5 and
7). For moving bar analysis, responses in which hyperpolarization did
not return to zero by the end of the recording were padded with a linear fit
to the last 250ms that was extended to zero (to avoid abrupt changes) Fig. 4: For this analysis one cell was
not included, since the dataset for this recording was incomplete.
Rectified SPFR sums
Same as for standard sum, but now all negative values in individual
SPFRs were set to zeros (Supplementary Fig. 5b).
Scaled center response SPFR sums
Individual SPFRs were replaced with a central response trace after
rectification scaled to the amplitude of the original positional response.
This was done to eliminate the positional change in response width (Supplementary Fig.
5b).
Average traces in Fig. 4d and f and
Supplementary Fig.
5a
Positions from all cells were aligned to the center zero position
before averaging. All trajectories that were longer than 8 positions were
included in this analysis. Some were extracted from the original 8 direction
stimuli (Fig. 1), which was used to
determine PD. And some from additional trajectories that were presented just
along the PD axis.
Squared sum in Supplementary Fig. 7
The averaged measured data (presented in Supplementary Fig.
7a) was squared (after baseline subtraction) for both the individual
presentations and the two-step positions. To generate the expected sum,
individual squared responses were temporally aligned and summed.
Spatiotemporal maps in Supplementary Fig. 8
Averaged SPFRs were aligned temporally to stimulus appearance. A
2-dimensional Gaussian was used to smooth the data both temporally
(σ = 250ms) and spatially (σ = 1 position)
using MATLAB function imgaussfilt. Contour plots were generated using MATLAB
function contourf, with a fixed ‘level list’ that was
manually generated in order to include smaller steps for the hyperpolarized
range. An identical procedure was performed on the simulated SPFRs and
simulated SPFRs without inhibition (see Conductance model for details).
Statistics
To determine statistically significant differences, one sided unpaired
Student’s t test were used for comparing groups (Fig. 2e and 4e, and Supplemental Fig. 5b). Data distribution was assumed to be normal
but this was not formally tested. No statistical methods were used to
pre-determine sample sizes, however our sample sizes are similar to those
reported in previous publications[49-51]. Data
collection and analysis were not performed blind to the conditions of the
experiments.
Data Plotting Conventions
All boxplots presented (apart for Fig.
2d) were plotted with MATLAB conventions. Box represents quartile,
line represents median, and whiskers represent the farthest point within the
q3 + IQR (q1 – IQR range). Boxplots
for Fig. 2d depict only median and
quartiles due to spatial constraints. For Fig.
1: triangles denote data points outside of the plot. For 20ms:
−0.19 and −1.14; for 40ms: −0.09 and −0.7. For
Fig. 4e: triangle denotes data point
outside of plot for measured 40ms: −0.7. Most plots used the cbrewer
color library from the MathWorks file exchange.
Conductance model
A model T4 neuron was implemented using the morphology of a single T4
cell, reconstructed from Electron Microscopy data. The neuron morphology was
generously shared by Kazunori Shonomiya and Janelia’s FlyEM project
team. The FlyEM team collected a data set containing approximately half of a
Drosophila optic lobe, that was imaged with isotropic 8-nm
voxels by focused ion-beam milling scanning electron microscopy (FIB-SEM). The
sample was prepared from the head of a female fly as previously reported, using
high-pressure freezing followed by freeze-substituted embedding[9,52]. A 153 × 85 × 180 μm volume
containing connected regions of the lamina, medulla, lobula, and lobula plate
was imaged. The imaged volume was segmented automatically based on an algorithm
similar to one previously described[53]. NeuTu-EM (https://github.com/janelia-flyem/NeuTu/tree/flyem_release) was
then used to proofread the segmented volume, where segmented fragments are
neurons are merged and split to form the complete morphology of neuron. A
reconstructed T4 neurons was identified based on its distinctive morphology,
with dendritic compartments spanning ~20 μm in the medulla and
an axon projecting to a distinct layer of the Lobula Plate. The reconstructed
neuron morphology contained 344 sections.To correct for a small number of inconsistencies in the reconstructed
morphology, the diameter of each dendritic section was smoothed by performing a
moving average of the diameter of five adjacent sections (using the TREES
toolbox[54]). The
simulation was implemented and run using NEURON v. 7.4 (http://www.neuron.yale.edu/neuron/). Analog synapses were placed
randomly throughout the dendritic arbor (after this compartment, containing 235
sections, was defined manually), and a recording electrode was attached to the
soma. After identifying the primary dendritic axis (used to simulate stimulation
along the PD-ND axis), we assigned each dendritic section a value corresponding
to its projection on this axis (x*, between 0-1). We subdivided the axis
into M=11 intervals, where each interval contains an equal number of
dendritic sections (21 sections), to map onto the 11 stimulated positions
required to enclose the T4 receptive field. We used 11 intervals since the
average traces in Fig. 4d, which were the
reference for the simulation, spanned positions −5 to 5 (see Analysis
subsection). Within an interval, we randomly selected a fixed number of
dendritic sections Ne = 9 (Ni = 5) where excitatory (inhibitory)
graded synapses were placed. The reversal potential for excitatory synapses was
set to 0 mV and to −70 mV for inhibitory synapses. The resting membrane
potential was set to −65 mV. To simulate a visual input, mimicking the
appearance of the bar at one position, all the synapses in the region were
activated with conductance dynamics that were uniform for all E and different,
but uniform for all I synapses. Our model simplifies the transformation from
visual input to a synaptic conductance change, so while we are not explicitly
modelling the optical and neuronal pre-filtering that is upstream of T4, these
details are implicitly incorporated into the model, since these elements
contribute to the SPFRs, which the model was fit to reproduce. To simulate an
analog synapse, we used the single electrode voltage clamp (SEVC) point process
and injected the inverse of our calculated conductance (see below) into the SEVC
Rs (zero conductance was change to 1e9 resistance).We computed the E and I conductance time course at each synapse ( , ) according to the following:
where C = {E, I}, and , denote the rise (decay) time constants of the
synapse, is the synaptic input, and scales the conductance based on the location of
the synapse on the PD-ND axis. We modeled the amplitude as a Gaussian profile
with an overall amplitude parameter , a peak location along the dendrite , and a width . We chose a Gaussian profile since it
reasonably approximates the spatial profile of E and I inputs measured for the
SPFRs (Fig. 2c, Supplementary Fig. 3a), and to
reduce the number of parameters needed to describe the inputs to the simulated
T4 neuron. The synaptic input, , was modeled as a pulse of unit amplitude with
duration T, where T = 20,40,80,160ms, and was identical for all the
synapses located within one of the M intervals.Three additional parameters of the neuronal model are the membrane
resistivity (Rm), membrane capacitance (Cm) and axial
resistivity (Ra). For all simulations, Cm was fixed at
Cm = 1 μF/cm2. We optimized the
remaining model parameters, by performing non-linear least square minimization
between the numerical simulation results combining all stimulus positions
(M=11 intervals) and durations (T) resulting in 44 simulated SPFR
responses, and the corresponding measured SPFRs (Supplementary Fig. 6c). The
minimization used lsqcurvefit() function from MATLAB’s Optimization
Toolbox. Having fit the model parameters using the single position flash
stimuli, we used the same optimized parameters to simulate the dynamics of the
model in response to moving bars (apparent motion stimulus). The moving bar
stimulus was implemented by sequentially activating the synaptic inputs along
the dendritic axis, with each interval being active for a duration T.
Model manipulations
(1) We moved all synaptic inputs (99 excitatory and 55 inhibitory
synapses) to the first dendritic section (Fig.
5d) (2) We removed all conductance changes mediated by the
inhibitory synapses, while keeping all other settings of the model fixed.
This was accomplished by setting the duration of all inhibitory synaptic
inputs to zero (Fig. 5d).
Single compartment conductance model
We modeled the time course of the membrane potential of a T4 neuron, , according to the single compartment dynamics
where is the membrane capacitance, is the leak conductance (resulting in the
integration time constant of the neuron , are the resting, excitatory and inhibitory
reversal potentials respectively (values in Table 1). The excitatory and
inhibitory conductances, , were modeled as a Gaussian weighted linear
combination of stimulus location-specific conductances, where, as for the model described in the
previous section, ., the conductance change elicited by the
presentation of a stimulus in one of locations follows the dynamics
Stimuli had characteristics identical to those
described in the previous section. All the parameters (with the exception of the
reversal potentials) were optimized to minimize the mean squared error between
model responses to single bar inputs and average T4 SPFRs.As with our original model, solutions to the single compartment model
conformed to one of the two solution clusters. The solutions either had fast
(< 10 ms) time constants for E and slow membrane time constant (solution
cluster 1), or slower E time constants and negligible membrane time constant
(solution cluster 2).When the time constant of the neuron is negligible compared to the time
scale of the stimuli and conductances ( , the relative change of membrane potential with
respect to the resting potential can be written as where , , and . This is the expression used in the model
schematics depicted in Figure 6.Note that if one combination of excitatory and inhibitory conductances
elicit an hyperpolarizing response, while a second one produces a depolarizing
response, under some conditions it is possible to obtain a
supralinear response to the superposition of the two sets of conductances
For instance, when , supralinearity is obtained if .
Data Availability
The data used to generate the primary results of this study are
available here: https://doi.org/10.25378/janelia.5576101The split-GAL4 driver used to target T4 cells (SS02344: VT015785-p65ADZp
(attP40); R42F06-ZpGdbd (attP2)) generously provided by Aljoscha Nern in Gerry
Rubin’s lab (line details with expression data available from http://splitgal4.janelia.org/).A Life Science Reporting Summary is included.
Code Availability
The essential code used to generate the primary results and conduct the
simulations for this study are available here: https://figshare.com/projects/Gruntman_et_al_2017_Data/26347Supplementary Figure 1: Mapping T4 Receptive Field center.
(a) Top: Expression pattern of the driver line used to
target T4 cells (SS02344; reoriented substack of a confocal image volume).
Similar images were obtained from at least 5 samples. Bottom: schematic of
experimental setup. Blue square represents the full mapping stimulus grid
from c. (b) Example single cell responses to ON
square flashes (~11° × ~11°),
presented for 140ms at each location on the indicated stimulus grid (inset).
Individual trials in grey (n=3 trials), mean in red. Inset: stimulus
grid of non-overlapping squares, subtending 78.75° ×
67.5° of the visual space. (c) Responses from example
cell in b to bright and dark square flashes (n=3 trials
each). Note stimulus grid contains overlapping position to map RF center
more precisely. Inset: stimulus grid highlighting two of the nine stimulus
positions with darker squares. This mapping procedure was performed for all
the cells in the dataset.Supplementary Figure 2: Dynamic characterization of the moving bar
responses. (a) Circular mean of preferred direction for all
recorded neurons (n=17 cells). Lines connect data from the same
cell, dot color depicts different speeds, and dot size the normalized mean
response vector magnitude (scale in inset). (b) Histogram of
the data in a, showing three distinct peaks corresponding to
three of the four cardinal directions. (c) Responses from an
example cell to eight different directions (rows) at four different speeds
(columns). Responses were temporally aligned to the time the bar crossed the
RF center. (d,e) Mean normalized vector calculated
for each time point for responses in c, decomposed into
magnitude (d) and angle (e). Vector was calculated
from temporally aligned responses presented in c. This
procedure can be considered as mimicking a downstream cell’s readout
of differently tuned T4 cells with RFs centered at the same spatial
position. Note, that for the two slower speeds the instantaneous PD flips by
180° after the bar passes the center (seen clearly in 10 out of 17
cells), since PD response is now hyperpolarizing while ND response is
depolarizing (denoted with grey lines).Supplementary Figure 3: Peak depolarization/hyperpolarization of T4
receptive field. (a) normalized peak depolarization (red) and
peak hyperpolarization (blue) for all recorded neurons. PD is always from
left to right, with peak depolarization aligned to position zero. A value of
4 indicates that a particular position gave the strongest depolarizing
response at all four speeds, and a 2 indicates the largest hyperpolarizing
response (since only the two slowest speeds were considered).
(b) angular distance between peaks of the hyperpolarization
and depolarization curves in a. Black line indicates the
mean.Supplementary Figure 4: T4 responses to two-step apparent motion
show null direction suppression, but no preferred direction enhancement.
Averaged baseline-subtracted responses (mean ±SEM) to 160 ms flash
stimuli presented along the PD-ND axis (n=17 cells; expanded version
of stimulus protocol shown in Figure 3,
applied to all recorded T4 neurons, here the five positions sampled are
shifted by one position towards the leading side). Stimulus positions
indicated correspond to those in Figure
2. Apparent motion is presented as two flash stimuli timed to
match their appearance during 14°/sec motion. PD motion indicated in
red and ND motion in blue. Single position responses are shown in grey along
the diagonal. Grey traces above and below the diagonal are time-aligned sums
(computed as the superposition) of the diagonal SPFRs. Note no enhancement
for second bar responses on the lower triangle, and clear suppression on
most responses in the upper triangle.Supplementary Figure 5: Removing features of the SPFRs eliminates
directional selectivity. (a) Comparison of PD and ND responses
(mean ±SEM, n=31 trials, 16 cells) for measured, summed,
rectified (summed after hyperpolarization removal), and scaled (rectified
and central position response scaled to the magnitude of each position)
traces. Data presented in measured and summed is reproduced from Figure 4d. (b) Boxplots for
Directional Selectivity Index for the same data as in a by
manipulation and speed, grey crosses represent outliers (* DSI
> 0, p < 0.01 one sided unpaired t-test). The response
reconstructed from amplitude-scaled versions of the SPFR at the central
position (which lack the temporal sharpening of the depolarized component of
trailing-edge SPFRs) do not show directional selectivity (see Methods for
boxplot conventions). (c) Schematic for procedures applied in
rectified and scaled versions of the SPFR sums in a.Supplementary Figure 6: detailed simulation results.
(a) Summary of 1000 independent nonlinear least squares fits of
simulation parameters (from randomized initial conditions) optimized to
reproduce the measured SPFRs. Plot of sorted sum of squared errors (SSE)
between each optimization result and the original SPFRs summed over all
positions and all speeds. The smallest error solutions are found in distinct
clusters that are color-coded and are further examined in b and
c. (b) Spatial and temporal filters for
excitatory and inhibitory conductances. Top: normalized conductance weights
for excitatory (thicker line) and inhibitory synapses in three optimization
solution-clusters. Each subplot shows resulting Gaussian distribution from
10, nearly indistinguishable, solutions (chosen at random) from each
solution-cluster. Gaussian amplitudes were scaled by RM (inverse
of the leak conductance), and the excitatory amplitude was further scaled by
10 for comparison. In solutions 1 and 2, the excitatory distribution is
broader, and shifted towards the leading side (0 marks the leading end of
the dendrite). Bottom: excitatory and inhibitory conductance pulse for 160ms
flash. In solutions 1 and 2 the excitatory conductance is much faster. Figure 5 data is based on Solution 1,
example parameter values in Table 1. (c) Comparison of measured
SPFR data used for fitting and the simulation results. Only 160ms flashes
are shown, but the model was fit to SPFRs at all 4 speeds. Each row shows a
single solution from each solution-cluster labeled in a. The
3rd solution does not account for hyperpolarization on the
trailing side and was therefore excluded from further analysis.
(d) Comparison of mean measured responses to simulated
predictions with parameters from solution-cluster 2 (Table 1) to moving bars
in the PD and ND. (e) Peak PD and ND responses measured in T4s,
compared to the solution-cluster 2 simulation results from d.
(f) DSI for the mean measured responses compared to the
simulation results from d. Note, Figure 5 b,c,d show these comparisons using solution 1
parameters.Supplementary Figure 7: The voltage to calcium transformation may
produce enhancement artifacts. (a) Averaged baseline-subtracted
responses (mean ±SEM) to 40 ms flash stimuli presented along the
PD-ND axis (n=3, same cells as in Figure 3) in black. Single position responses are shown along
the diagonal. Stimulus positions indicated correspond to those
in Figure 2. The time-aligned sum of
the single position responses is shown in grey. Lines demarcate the maximal
response to second bar in 3 example positions. Arrows point from summed
prediction to measured response. (b) As a proxy for possible
effect of calcium imaging[21], the same data as in a are plotted after
squaring the measured responses (in green). Grey traces represent the sum of
squared single position responses. Note that in example i, the
slight suppression in a, becomes an enhanced response in
b, in example ii minimal enhancement becomes
strong enhancement, and in example iii suppression is
maintained.Supplementary Figure 8: The space-time tilt in the T4 spatiotemporal
receptive fields likely originates from the interaction of excitatory and
inhibitory inputs. Left: SPFRs from Figure
2a (for 160 ms flashes) were replotted to approximate the
spatiotemporal RF of T4 cells. Dashed line indicates stimulus onset. The RF
features an excitatory lobe peaking in the center (by construction) and an
inhibitory lobe following in time and offset spatially towards the trailing
side. In this representation, the leading side responses appear to rise more
slowly than central responses, which could conform with the temporal delay
requirement of an HR-like mechanism. Center: the spatiotemporal RF plotted
from simulated SPFRs generated from the (Solution 1) model parameters. Lines
correspond to response traces shown in the inset. The apparent difference in
onset times for different positions is the result of different response
magnitudes, since the time constants at different positions in the model are
identical by construction. Right: similar to center, but from simulations
where the inhibitory inputs were removed. Note that in the model, removing
inhibition completely removes the inseparable property of the filter,
showing that the contribution of inhibition is sufficient to account for the
tilt in the excitatory lobe.
Authors: Matthew S Maisak; Juergen Haag; Georg Ammer; Etienne Serbe; Matthias Meier; Aljoscha Leonhardt; Tabea Schilling; Armin Bahl; Gerald M Rubin; Aljoscha Nern; Barry J Dickson; Dierk F Reiff; Elisabeth Hopp; Alexander Borst Journal: Nature Date: 2013-08-08 Impact factor: 49.962
Authors: Helen H Yang; François St-Pierre; Xulu Sun; Xiaozhe Ding; Michael Z Lin; Thomas R Clandinin Journal: Cell Date: 2016-06-02 Impact factor: 41.582
Authors: Sebastian Molina-Obando; Juan Felipe Vargas-Fique; Miriam Henning; Burak Gür; T Moritz Schladt; Junaid Akhtar; Thomas K Berger; Marion Silies Journal: Elife Date: 2019-09-19 Impact factor: 8.140
Authors: Margarida Agrochao; Ryosuke Tanaka; Emilio Salazar-Gatzimas; Damon A Clark Journal: Proc Natl Acad Sci U S A Date: 2020-08-24 Impact factor: 11.205