Literature DB >> 29311742

Simple integration of fast excitation and offset, delayed inhibition computes directional selectivity in Drosophila.

Eyal Gruntman¹, Sandro Romani¹, Michael B Reiser².

Abstract

A neuron that extracts directionally selective motion information from upstream signals lacking this selectivity must compare visual responses from spatially offset inputs. Distinguishing among prevailing algorithmic models for this computation requires measuring fast neuronal activity and inhibition. In the Drosophila melanogaster visual system, a fourth-order neuron-T4-is the first cell type in the ON pathway to exhibit directionally selective signals. Here we use in vivo whole-cell recordings of T4 to show that directional selectivity originates from simple integration of spatially offset fast excitatory and slow inhibitory inputs, resulting in a suppression of responses to the nonpreferred motion direction. We constructed a passive, conductance-based model of a T4 cell that accurately predicts the neuron's response to moving stimuli. These results connect the known circuit anatomy of the motion pathway to the algorithmic mechanism by which the direction of motion is computed.

Entities: Chemical

Mesh：

Substances：

Year: 2018 PMID： 29311742 PMCID： PMC5967973 DOI： 10.1038/s41593-017-0046-4

Source DB: PubMed Journal: Nat Neurosci ISSN： 1097-6256 Impact factor: 24.884

The computation of directional selectivity has been studied for decades in both vertebrate and invertebrate visual systems and given rise to competing algorithmic models[1]. The Hassenstein-Reichardt (HR) detector uses a synergistic combination of offset excitatory inputs[2] to enhance responses to motion in the preferred direction, whereas the Barlow-Levick (BL) detector uses inhibitory input to “veto” an offset excitatory input[3], suppressing motion in the non-preferred, or null direction. The HR detector, which was originally formulated to account for motion detection in insects, has become a canonical example of a neuronal computation[1,4]. This model has endured since its elegant mechanism accounted for a wide array of behavioral results[2,5,6] and the detailed response properties of large motion sensitive output neurons of the fly visual system[1,7,8]. Recent progress on the visual circuits upstream of these motion sensitive neurons in Drosophila has revealed that stimuli are processed through ON and OFF pathways[9-14]. In the ON pathway, the columnar neurons T4, are the first cell type to exhibit directionally selective signals[10,11,14,15]; four T4 subtypes are each directionally selective in one of four cardinal directions[9,11,16]. A prevalent hypothesis for how T4 cells could implement the HR detector was proposed based on different input cell types providing the temporally[14] and spatially[9] offset inputs, however, further analysis of the circuit[15,17] has ruled out this circuit mechanism. The other recent approach to identifying the mechanism responsible for directional selectivity in the T4 circuit has been to use calcium imaging to measure T4 responses to structured visual stimuli. Several studies found evidence for enhanced responses to preferred direction motion[18,19], while several others found both preferred direction enhancement and null direction suppression[20-22]. A number of groups have recently proposed that motion detection in flies may be implemented as a hybrid mechanism featuring elements of both the HR and BL algorithms[15,21-23]. However, direct evidence for either mechanism has been elusive, in large part because these studies rely on calcium imaging. Calcium indicator responses are insensitive to fast events, obscuring the small timing differences required by the HR detector, and are also insensitive to hyperpolarization, thereby preventing the direct measurements of inhibition, a defining feature of the BL model.

Results

To directly measure the physiological properties of T4 neurons, we used targeted in-vivo whole-cell electrophysiology. We confirmed the identity of GFP-labeled T4 neurons by measuring reliable depolarizations in response to small ON flashes (Fig. 1a and Supplementary Fig. 1). We used on-line stimulus generation and analysis to localize the center (within ~2°) of the receptive field (RF) of each recorded neuron (Supplementary Fig. 1b,c). To measure directional selectivity in single T4 cells, we presented a narrow (~2° wide) ON bar moving in 8 different directions through the mapped RF center (Fig. 1b,c). The T4 membrane potential showed small depolarizations in response to several directions of motion (Fig. 1b), but large responses to movement in a ~90° range, centered around the preferred direction (PD; opposite to the null direction, ND), in agreement with the tuning width of measured with calcium imaging[11]. Our moving bar stimulus generates apparent motion: the bar appears to move in a series of discrete (~2°) steps, where the speed of movement is determined by the duration the bar remains at each position. To examine the computation of directional selectivity across a behaviorally relevant range, we presented 4 movement speeds. The responses to the moving stimuli were averaged across individual recordings once aligned to the PD (Fig. 1c). We quantified the strength of directional selectivity using a directional selectivity index (DSI, defined in Methods, analysis section) and saw significant tuning at all stimulus speeds (p< 0.05, one sided unpaired t-test), with stronger directional selectivity at the two slower speeds (p< 0.05, one sided paired t-test, Fig. 1d). At slow speeds, the membrane potential exhibited prominent hyperpolarizing responses, both following the response to PD motion, and preceding the depolarizing response to ND motion (arrowheads in Fig. 1c, detailed in Supplementary Fig. 2). The sequence of depolarizing and hyperpolarizing signals depended on the direction of motion, suggesting that inhibitory and excitatory inputs to T4 may be spatially offset in agreement with recent anatomical findings[17].

Figure 1

Whole cell recordings of T4 cells show directionally selective responses

(a) Top: schematic of Drosophila visual system with example T4 cell; whole-cell recordings are targeted to cell body. Bottom: a moving bar (1 × 9 LEDs) is presented as a sequence of discrete steps. (b) Responses to a bright bar moving in 8 directions (indicated by arrows). Individual trials in grey (n=3 trials), mean in maroon. Center: response to each direction at the time of maximal directional selectivity. (c) Baseline-subtracted responses (n=17 cells) to bar motion, aligned to the PD of each cell (mean ± SEM). Arrowheads indicate hyperpolarization following (preceding) response to PD (ND) motion. Colored horizontal bars indicate stimulus presentation. (d) Directional Selectivity Index for bar motion responses (n=17 cells). Crosses represent outliers, and triangles denote data points outside of the plot (see Methods for boxplot conventions).

Mapping the T4 receptive field reveals a spatiotemporal asymmetry

In order to characterize the fine structure of T4’s RF, we first localized the RF center and identified the PD-ND axis. We decomposed the moving bar stimulus into its elementary components—single position bar flashes with a fixed duration—and presented them at all positions, randomly interleaved, along the PD-ND axis. Aligning the single position flash responses (SPFRs) based on the position of peak depolarization (Fig. 2a) reveals two key properties. (1) The center and leading side of the T4 RF exhibit excitatory responses. These responses are larger and last longer for longer flash stimuli (corresponding to slower speeds). Additionally, longer flashes presented on the leading side of the RF reveal depolarizing responses that are not observed for short flashes. (2) The inputs along this axis are spatially asymmetrical; hyperpolarizing inputs only appear on the trailing side of the RF (most clearly seen for the longer duration flashes). Overlaying voltage traces from leading and trailing RF positions exposes a temporal asymmetry in the inputs (Fig. 2b,d). Responses on the trailing side decay faster than responses at equivalent positions on the leading side. This temporal sharpening can be seen across speeds, including the shortest duration flash where hyperpolarization was not directly recorded (arrowheads in Fig. 2b).

Figure 2

T4 cells receive spatially, but not temporally, offset excitatory and inhibitory inputs

(a) Averaged baseline-subtracted responses (mean ±SEM) to single position bar flash stimuli (top) along the PD-ND axis of each cell (n=17 cells). (b) Mean responses from indicated positions in a aligned to stimulus presentation (grey rectangles). Arrowheads emphasize differences in decay times. (c) Maximum depolarizing and hyperpolarizing responses (mean ± SEM) by stimulus position for 80ms flashes (n=17 cells). (d) Response onset time and decay time to flashes near the RF center (median and quartiles). Cell number varies for each position and duration combination (see Methods Onset time calculation for details). (e) Slope of the linear regression of onset time against position and the slope of the decay time against position, calculated separately for each cell and each duration (n=17, 17, 14, 9 cells; * slope < 0; p < 0.01, one sided unpaired t-test; grey crosses represent outliers; see Methods for boxplot conventions).

For the longer duration flash stimuli our RF mapping (Fig. 2c and Supplementary Fig. 3), shows a spatially offset distribution of depolarizing and hyperpolarizing inputs to T4, with the hyperpolarizing input on the trailing side of RF. Remarkably the offset between the peaks of excitatory and inhibitory inputs (~6° of visual angle; Supplementary Fig. 3b) corresponds to the approximate angular separation between adjacent ommatidia on the fly eye, matching the predicted structure for the inputs of the Drosophila motion detector[5]. Because we do not measure excitatory and inhibitory currents directly, the underlying overlap between excitation and inhibition is expected to extended beyond positions where we can robustly measure hyperpolarizing input. If T4 implements an HR-like mechanism, we expect that SPFRs on the leading side of the RF would be time delayed relative to trailing side SPFRs[2,14]. A simple expectation for relative time delays between offset visual stimulation generated through differential filtering by either upstream cells and/or synaptic transmission is that the responses would occur with a relative temporal offset. However, the onset time of the flash response is constant across the RF (Fig. 2d,e). Nevertheless, a significant position effect is seen in the decay time (Fig. 2d,e), which we attribute to temporal sharpening (arrowheads in Fig. 2b) due to overlapping inhibitory inputs on the trailing side of the RF (Fig. 2c). Rather than a relative time delay between offset inputs, we find that the temporal differences in the SPFRs can be explained by fast excitatory inputs on the leading side combined with slower inhibitory inputs on the trailing side of the RF.

Directional selectivity results from ND suppression

The classical models for motion detection nonlinearly combine offset inputs, resulting in either an enhancement of PD responses (HR) or a suppression of ND responses (BL). A simple test for directional selectivity is to deliver flash stimuli at two positions along T4’s RF and see whether the response to sequential stimulation in one direction is larger than the response to the opposite sequence of stimulation. We implemented this two-step apparent motion stimulus using single position flashes (160 ms duration) for pairs of positions around the RF center (Fig. 3a). To assess the effect of direction, we compared each two-step response to the superposition of the corresponding two single position responses (offset in time and summed; Fig. 3b). This comparison reveals no enhanced response to the second bar appearing towards the PD (left column), but a reduction in the second bar response when it appears towards the ND (arrow, right column). Flashes separated by ~10° (bottom row) do not appear to interact, showing neither enhancement nor suppression. Furthermore, the suppression is most prominent for bar pairs ‘moving’ in the ND that begin on the trailing side, consistent with our finding that the distribution of inhibitory inputs is skewed to this side of the RF. By comparing the apparent motion responses to the same bar pairs, but in opposite directions (Fig. 3c), we see that the leading side of the RF—corresponding to the primarily excitatory domain of the SPFRs (Fig. 2)—is motion blind (see also Supplementary Fig. 4). Strong directional selectivity is only seen for the bar pairs that include the trailing side of the RF, and is only produced by ND suppression.

Figure 3

T4 responses to two-step apparent motion show null direction suppression, but no preferred direction enhancement

(a) Averaged baseline-subtracted responses (mean ±SEM) to 160 ms flash stimuli presented along the PD-ND axis (n=3 cells). Single position responses are shown along the diagonal, PD motion (red) and ND motion (blue) are shown in the lower and upper triangle, respectively. Stimulus positions indicated correspond to those in Figure 2. Apparent motion is presented as two flash stimuli timed to match their appearance during 14°/sec motion. Space-time plots of the stimuli are depicted in each inset. (b) Comparison of mean measured responses to two-step stimulation to the superposition (time-aligned sum) of the single position responses. Each row shows responses to stimuli presented at the same positions but in the reversed temporal order. (c) Mean PD and ND responses from the corresponding rows in b are compared, with the Directional Selectivity Index indicated. The DSIs are averaged from 3 cells (individual values for top panel: 0.02, 0.06, 0.1; for middle panel: 0.47, 0.5, 0.16).

The superposition of stationary responses generates directional selectivity

The two-step apparent motion responses (Fig. 3) suggest that it is the integration of offset excitatory and inhibitory inputs that generates directionally selectivity. However, is this mechanism sufficient to explain the directionally selective responses to moving bars? To address this question, we compared the responses to moving bar stimuli, comprised of an ordered sequence of single position bar flashes (Fig. 4a,b), with the superposition of SPFRs (the sum after appropriate temporal alignment; Fig. 4c). Importantly, the only difference between the summed PD and the summed ND responses was the order in which the SPFRs were aligned. This comparison is conceptually similar to our treatment of the two-step apparent motion responses, but now many interactions across the entire receptive field are being probed. We find that the simple sum of SPFRs (Fig. 4c) captures the essential dynamics of the moving bar response (Fig. 4c,d) and shows significant directional selectivity at all four speeds (Fig. 4e).

Figure 4

Ordered summation of single position flash responses qualitatively reproduces moving bar responses

(a) Schematized T4 dendrites overlaid on 9 bar stimulus positions (colors as in Figure 2). (b) Mean PD and ND motion responses from an example cell to the 28°/s bar motion. (c) Top: Single Position Flash Responses (SPFRs) from the example cell in b colored by position, temporally aligned to account for moving bar position. Bottom: Summed SPFRs for PD and ND motion. (d) Comparison of mean measured moving bar responses and mean summed SPFRs (n=31 trajectories, 16 cells). (e) Boxplots for Directional Selectivity Index for measured and summed responses across speeds (n=31 trials, 16 cells; see Methods for boxplot conventions). Crosses represent outlier, triangle denotes a point outside the scope of the plot (* DSI > 0, p < 0.01 one sided unpaired t-test). (f) Same data as in d, juxtaposing measured responses and summed SPFRs for PD and ND motion separately.

The summed responses approximated the measured responses without any nonlinear integration, a surprising result since both the HR and BL models of motion detection require a nonlinear interaction to compute directional selectivity. To be clear, we are not claiming that directional selectivity emerges from linear operations (for example, the DSI computation is nonlinear), but rather emphasize that these directionally asymmetric responses expose a significant discrepancy between the T4 mechanism and the classical models. To gain further insight, we examined features of the SPFRs for contributions to directional selectivity in the summed responses, and found that it is the position dependent differences (such as the response shape, sign, and magnitude) that account for the DSI of the summed responses. (Supplementary Fig. 5). However, the summation of aligned SPFRs only accounts for approximately half of the measured T4 DSI (Fig. 4e). Summing negative and positive potentials will undercount the effect of inhibition. For example, in shunting inhibition a small inhibitory current can eliminate a much larger excitatory current. The effect of this undercounting is most prominently seen in the overestimated ND responses (Fig. 4f). Does this mismatch suggest that T4 implements a nonlinear step like the classical models, or could biophysically realistic integration of inhibitory inputs explain the additional measured DSI?

A conductance-based simulation quantitatively predicts the T4 motion response

To answer this question, we built a conductance-based model of a T4 neuron reconstructed from electron microscopy data (Fig. 5a). In our model, we randomly distributed excitatory and inhibitory synapses along the dendritic arbor of the cell, but fit the synaptic strengths, the time course of the excitatory and inhibitory conductance changes, and membrane and axial resistivity, using an optimization procedure that minimizes the error between simulated responses to bar flashes and our measured SPFRs (from all the positions and all the speeds; Fig. 5a, Supplementary Fig. 6a,b,c). We then used the resulting model to simulate T4’s response to a moving bar (Fig. 5b). Importantly, we fit our model’s parameters using responses to stationary stimuli (the SPFRs), but used the model to predict the responses to an independent motion stimulus (the moving bar responses), thus testing the generalization of this model.

Figure 5

Asymmetric inputs in a conductance based model predict directionally selective responses

(a) Anatomical reconstruction of a T4 cell used in the simulation, with expanded dendrite showing positions of modeled excitatory (dark green) and inhibitory (light green) synapses. Enlarged markers indicate active synapses for an example stimulus position. Spatial filters, which determine synaptic weight, are shown normalized and aligned to the dendrite, and temporal filters are shown in inset. (b) Mean measured motion responses (from Figure 4) compared to the simulated predictions. (c) Peak PD and ND responses measured in T4s, compared to the Summed SPFRs (from Figure 4) and simulation results. (d) Direction Selectivity Index for the mean measured responses compared to: (Left) model simulation results and summed responses; (Center) simulation results without inhibition; (Right) simulation results with all synaptic inputs placed in same location, and a separate single-compartment simulation (detailed in Methods).

In comparison to the summed responses (Fig. 4d), the simulated responses more accurately reproduced the measured responses to both PD and ND motion (Fig. 5b,c). Furthermore, the same simulation qualitatively reproduced the dynamics of T4 responses (Fig. 5b) and quantitatively reproduced the magnitude of the directionally selective responses (Fig. 5c,d and Supplementary Fig. 6d,e,f) at all tested speeds, suggesting that the same mechanism can account for directional selectivity, even at the fastest speeds tested where hyperpolarization was not directly measured (Fig. 2). One classical neuronal computation of directional selectivity based on passive cable properties[24], is that sequential depolarization directed towards the (thicker) axon generates stronger responses than activation in the opposite direction. To test whether this mechanism could contribute to directional selectivity in T4, we zeroed all inhibitory conductance changes in our model. When the simulation was repeated, only the excitatory synapses were activated as the “stimulus” swept in the PD or ND. We found that in the absence of inhibition, the DSI of the simulated T4 is abolished at all speeds (Fig. 5d, middle), indicating that depolarizing inputs alone cannot produce directional selectivity. This suggests that inhibitory inputs to T4, in addition to providing slow hyperpolarization that characterizes responses to slow motion (Fig. 1c), also establish the temporal sharpening of trailing side responses, critical for directional selectivity at faster speeds (Supplementary Fig. 5b). The striking retinotopic alignment of T4 dendrites with the PD-ND axis was an important anatomical clue in their identification as directionally selective cells[9,25]. We used our simulation to test whether the neuron’s morphology substantially contributes to directional selectivity. By positioning all 154 synaptic inputs (inhibitory and excitatory) at the base of the dendrite (all other parameters held constant), we found that these simulation results were almost indistinguishable from the unmodified simulation (Fig. 5d, right). As a consequence of this result, we expected that a single -compartment neuron simulation should also be able to capture the response dynamics of T4. Indeed, this simpler model reproduces the moving bar responses of T4 with only negligible differences to the multi-compartment simulation results (Fig. 5d, right). These results suggest that the critical role of T4’s elaborate and directional dendrite is limited to collecting different input signals across a small region of the fly’s eye. Taken together, these modelling results corroborate our central finding that inhibitory inputs sculpt the essential asymmetry for directionally selective responses in T4 cells—no additional mechanisms are required.

Discussion

We used intracellular recordings to probe the mechanism of directionally selectivity for ON motion in T4 neurons. Using on-line stimulus generation, we finely mapped the receptive field of T4 neurons (Fig. 2). Remarkably, this receptive field structure, comprised of responses to stationary ON bars, can be used to predict the response of T4 to moving bars (Fig. 4). We then improved these predictions with a parsimonious, biophysical model integrating offset excitatory and inhibitory conductances (Fig. 5). In the present study, we show that directional selectivity in T4 neurons arises solely from ND suppression. Recent studies that used calcium imaging of T4 neuron responses to a two-step apparent motion stimulus found evidence for PD enhancement[18,19] or for both PD enhancement and ND suppression[20,22]. These discrepancies may arise from differences in the stimulus design, differences between single neuron and population recordings, differences between measurements of membrane potential versus calcium indicator fluorescence, or from combinations of these factors. Since the transformation between membrane potential and calcium indicator fluorescence is likely to be superlinear[21], measured calcium responses may appear enhanced when compared to their summed components, even though the underlying voltage response does not show this enhancement (Supplementary Fig. 7). However, as Fisher, et al.[18] have controlled for this possibility, we believe that the difference is most likely explained by differences in stimulus design (centered bars versus edges[18]; centered narrow bars versus centered wider spots[20]). Haag, et al.[20] found PD enhancement in T4 cells, but only for stationary stimuli wider than those used here. These larger stimuli, in addition to strongly depolarizing the measured T4 cells, are also effectively stimulating neighboring T4 cells. But since T4 neurons synapse onto other T4 cells with exquisite precision—only cells with the same directional tuning are connected, and only in the direction of motion[17]—this circuit mechanism should produce a superlinear response in one direction (along the PD), but not in the opposite direction. This enhancement of PD motion over a slightly larger spatial scale appears to be evidence for directed facilitation from neighboring T4 cells, but not for an HR-like mechanism responsible for directional selectivity (since blocking synaptic transmission in T4 neurons does not substantially reduce directionally selective responses in T4 cells[15,22]). This factor may partially explain the measurement of PD enhancement in Fisher et al.[18], but future experiments will be required to clarify these discrepancies. Anatomical studies have shown that dendrites of T4 cells are aligned with the PD[9,25] and receive spatially offset synaptic inputs from distinct columnar neuron types[17]. The arbors span several (3-4) retinotopic columns in the medulla[16], which given the ~5° visual angle per column, corresponds well to our measured RF width of ~20° (Fig. 2). The cholinergic columnar neurons Mi1 and Tm3 preferentially synapse onto T4s in the central to distal part of the arbor, while the GABAergic neurons Mi4, C3, and CT1, synapse mainly onto the base of the dendrite[15,17,26,27]. This anatomical arrangement agrees remarkably well with the functional organization of our measured T4 receptive field, with leading-side excitation and trailing-side inhibition. A recent study from our lab used circuit perturbations to also propose a hybrid mechanism for T4 motion responses[15]. Although we showed nonlinear integration of excitatory inputs by T4, we did not show that this nonlinearity contributes to the computation of directional selectivity, so these results do not conflict with the current findings. In fact, EM-based circuit analysis[9,17], as well as a number of functional studies[14,15,23,26,28,29], suggests that multiple cell types (at least Mi1 and Tm3) contribute to the excitatory ON component of our measured RF, without evidence for a spatial offset required for an HR-like mechanism[17]. In our previous study, we also showed that silencing one of the main inhibitory inputs to T4 (Mi4) does not reduce directional selectivity[15]. Since our current results strongly suggest that inhibition is necessary for computing directional selectivity, it is entirely possible that the critical inhibitory inputs to T4 are not simply contributed by a single cell type (other candidates include CT1, C3, and TmY15[17]).

Synthesis of experimental results with classical computational models for motion detection

Historically, the computation of directional selectivity has been conceptualized with algorithmic models that have outlined the general properties that transform non-selective inputs into a directionally selective output. Here we compare our results with the implications of the classical models, and present an algorithmic summary of our findings. Motion detector models typically describe a ‘fully-opponent’ computation, whereby local motion estimates in opposing directions are subtracted to minimize non-motion signals[1]. Since T4 is now understood to reside one step before this opponent subtraction[30], we only consider the subunits of each model type before this subtractions stage. The classical model for insect motion detection is the Hassenstein-Reichardt Detector[2]. This detector generates directional selectivity by delaying leading side excitatory input and combining it with trailing side excitatory input with a nonlinear amplification, thus enhancing the preferred direction response (Fig. 6a). If T4s directional selectivity was based on an HR-like mechanism, we would expect to find: (1) excitatory inputs spanning the receptive field, (2) differing temporal filtering of inputs resulting in a delayed leading side, and (3) enhanced responses to PD motion (Fig. 6a). We did not find evidence for any of these properties: (1) T4 receives offset excitatory and inhibitory inputs (Fig. 2c), (2) leading side responses are not temporally delayed (Fig. 2d,e and Supplementary Fig. 8), and (3) PD motion is not enhanced (Fig. 3,4,5). Furthermore, the excitatory region of the receptive field is motion blind (Supplementary Fig. 4).

Figure 6

Comparison the T4 mechanism to classical computational models for directional selectivity

(a) The Hassenstein-Reichardt detector[2] uses slower filtering on the leading side arm and a multiplicative nonlinearity to produce coincidence detection of excitatory signals. The time constants are labeled as E-Lead and E-Trail, where ‘E’ indicates filtering of an excitatory signal. The semicircles represent two neighboring inputs (arms) to the motion detectors. (b) The Barlow-Levick detector[3] combines a slow trailing side ‘inhibitory’ arm with a faster leading side ‘excitatory’ arm through an ‘AND-NOT’ operation that vetoes motion in the null direction. (c) The Adelson-Bergen detector[31], uses oriented spatiotemporal filters, created from offset spatial and fast and slow temporal filters, to produce directional selectivity. (d) A summary of the T4 mechanism formatted for comparison to the algorithmic models, based on the Torre-Poggio model[35]. The simple RF structure of T4 (Supplementary Fig. 8), does not require the more complex spatial and temporal filters used in the AB model. Instead we use filters fit to T4 measurements (Figure 5), that produce a fast, excitatory leading signal and a delayed, inhibitory trailing signal. As in the AB model, directionally selective responses can be obtained through a linear combination of these signals (grey lines). The dynamic nonlinearity operates on the magnitude of excitatory (E) and inhibitory (I) conductances (detailed in Methods) to produce a directionally selective membrane potential response.

An alternative classical model is the Barlow-Levick detector[3] that integrates fast leading-side excitation and slower trailing-side inhibition through the nonlinear ‘AND NOT’ operation (Fig. 6b). Directional selectivity is computed since motion is signaled when excitatory input precedes the inhibitory input; in the reverse order ND motion is suppressed. We find several properties in our T4 recordings that agree with this framework: (1) leading-side excitatory responses and trailing-side inhibitory responses (Fig. 2), (2) longer temporal filtering on the trailing side, such that inhibition outlasts excitation (Fig. 2, 5 and Supplementary Fig. 8), and (3) clear evidence for ND suppression with both two-step and moving bar apparent motion (Fig. 3c, 4f). While the organization of the T4 RF agrees with the algorithmic principle of the BL model, we find an important difference in the integration mechanism. The BL model uses a strong nonlinearity, and so does not produce directionally selective responses from the superposition of the responses to stationary inputs (which our T4 recordings show, Fig. 3,4). This seemingly technical distinction is in fact quite consequential, since in both the HR and BL models, the nonlinearity is a defining characteristic—without it there are no directionally selective responses. In contrast, our T4 results show that directional selectivity is already present in the superposition of the stationary stimulus responses. A 3rd class of motion detectors—the Adelson-Bergen (AB) motion energy model[31]—generates directional selectivity by using filters that are oriented in space-time. These linear filters are combined spatial and temporal filters that feature a prominent space-time tilt indicating the preferred direction. In Fig. 6c we present only the relevant subunit of the full AB model for comparison with our data. This subunit has (1) different spatial filters on each of its arms, (2) excitatory and inhibitory inputs, (3) a linear integration step that produces a directionally selective response from the superposition of stationary responses (as seen in simple cells of the cat visual cortex[32]), and (4) a static amplifying nonlinearity. This flexible model, whose commonalities with variants of the classical motion detection models have been broadly discussed[1,31,33], provides a useful framework for representing the T4 computation. In light of the commonalities and discrepancies between classical algorithmic models and our results, we next provide a summary of our understanding of T4 in a format that is directly comparable to these models (Fig. 6d), with an excitatory and an inhibitory arm. The more complex spatial and temporal filters of the AB model, selected to match cortical recordings, are replaced with the filters describing T4 responses (Fig. 5 and Methods). The spatial filters create the leading vs. trailing offset, and the temporal filters create the fast vs. slow difference, between the excitatory and inhibitory arms. In contrast to the ‘point sampling’ inputs of the classical HR and BL models, T4 shows spatially extended responses to visual input (Fig. 2b), which has previously been shown to improve the accuracy of motion detection[34]. While the sum of the linear filter outputs produces asymmetric responses to PD and ND motion (Fig. 6d), T4 neurons further enhance this selectivity through nonlinear integration. While the AB model employs a static, amplifying nonlinearity, we measure a suppressing nonlinearity, which is well approximated with a simplified passive, biophysical model (similar to a previous formulation[35]; Fig. 5d, 6d, and Methods).

Spatiotemporal receptive fields of T4 cells

Representing the response properties of neurons using spatiotemporal receptive fields, equivalent to the response of the spatiotemporal filters in the AB model, has provided significant insight into transformation along the visual pathway of mammals[36], and has recently also been applied to the responses of directionally selective neurons in flies[19,21]. We have replotted the averaged SPFRs to approximate the spatiotemporal receptive field of a T4 neuron (Supplementary Fig. 8, left). As expected for a directionally selective neuron, we find a distinctive tilt in this receptive field, indicating the preferred direction. This RF has both excitatory and inhibitory lobes, which both appear tilted, an organization that may suggest PD enhancing and ND-suppressing mechanism[19,21]. To explore the basis of the RF tilt, we used our multi-compartment T4 model, and replotted the simulated SPFRs (Supplementary Fig. 6c) as a spatiotemporal RF (Supplementary Fig. 8, middle). Since the model was fit to the SPFR data, the qualitative agreement between these RFs is not surprising. The simulated RF also features a tilted excitatory lobe. When we removed inhibition in our model (Fig. 5d) and reproduced the RF (Supplementary Fig. 8, right), we found that the excitatory lobe was no longer tilted, consistent with our inference that inhibition sharpens trailing side responses (Fig. 2b). This simulation shows that a tilted, bi-lobed spatiotemporal RF does not necessarily support a hybrid mechanism, and reiterates the lack of evidence in our data for PD enhancement. It is worth noting that the receptive fields of T5 cells, the OFF directionally selective neurons, look qualitatively similar to those estimated for T4 neurons[19-21]. Therefore, it is possible that T5 neurons compute directional selectivity using the same simple mechanism that we have established here for T4 neurons. Interestingly, no small field inhibitory inputs[37,38] to T5 have thus far been identified, raising the possibility of different implementations of the same motion computation in ON and OFF pathways[39,40].

Methods

Histology

To visualize the expression pattern of the T4 driver line (SS02344), brains of female flies were immunolabeled and imaged as described[41]. anti-Brp was used as a stain for the neuropil marker (anti-nc82 1:30, Developmental Studies Hybridoma Bank) and pJFRC225-5XUAS-IVS-myr::smFLAG (rat anti-FLAG 1:100, Novus Biologicals) in VK00005[42] was used as the reporter for GAL4 expression. For Supplementary Fig. 1a, the image shown was generated from a confocal stack imaged on a Zeiss LSM 710 microscope with a 63× objective, and resampled using Vaa3D[43].

Electrophysiology

Experiments were performed on 1-2 day old female Drosophila melanogaster (flies were reared under constant light conditions at 24°C, some flies experienced periods of darkness, including overnight, prior to dissection). To target T4 cells, a single genotype was used: pJFRC28-10XUAS-IVS-GFP-p10[44] in attP2 crossed to stable split-GAL4 SS02344 (VT015785-p65ADZp (attP40); R42F06-ZpGdbd (attP2)) generously provided by Aljoscha Nern in Gerry Rubin’s lab (line details with expression data available from http://splitgal4.janelia.org/). Flies were briefly anesthetized on ice and transferred to a special chilled vacuum holder where they were mounted, with the head tilted down, to a customized platform machined from PEEK using UV-cured glue (Loctite 3972). CAD files for the platform and vacuum holder are available upon request. To reduce brain motion the proboscis was fixed to the head with a small amount of the same glue. The posterior part of the cuticle was removed using syringe needles and fine forceps. The perineural sheath was peeled using fine forceps and, if needed, further removed with a suction pipette under the microscope. To further reduce brain motion, muscle 16[45] was removed from between the antenna. The brain was continuously perfused with an extracellular saline containing (in mM): 103 NaCl, 3 KCl, 1.5 CaCl2 2H2O, 4 MgCl2 6H2O, 1 NaH2PO4 H2O, 26 NaHCO3, 5 N-Tris (hydroxymethyl) methyl-2- aminoethane-sulfonic acid, 10 Glucose, and 10 Trehalose[46]. Osmolarity was adjusted to 275 mOsm, and saline was bubbled with 95% O2/5% CO2 during the experiment to reach a final pH of 7.3. Pressure-polished patch-clamp electrodes were pulled for a resistance of 9.5-10.5 MΩ and filled with an intracellular saline containing (in mM): 140 KAsp, 10 HEPES, 1.1 EGTA, 0.1 CaCl2, 4 MgATP, 0.5 NaGTP, and 5 Glutathione[46]. 250μM Alexa 594 Hydrazide was added to the intracellular saline prior to each experiment, to reach a final osmolarity of 265 mOsm, with a pH of 7.3. The mounted, dissected flies were positioned on a rigid platform mounted on an air table. Recordings were obtained from labeled T4 cell bodies under visual control using a Sutter SOM microscope with a 60× water-immersion objective. To visualize the GFP labeled cells, a monochrome, IR-sensitive CCD camera (ThorLabs 1500M-GE) was mounted to the microscope, an IR LED provided oblique illumination (ThorLabs M850F2), and a 460 nm LED provided GFP excitation (Sutter TLED source). Images were acquired using Micro-Manager[47], to allow for automatic contrast adjustment. All recordings were obtained from the left side of the brain. Current clamp recordings were low-pass filtered at 10KHz using Axon multiClamp 700B amplifier, and were sampled at 20KHz (National Instrument PCIe-7842R LX50 Multifunction RIO board) using custom LabView (2013 v.13.0.1f2; National Instruments) and MATLAB (2015a; Mathworks) software. The membrane potential of recorded cells was set around −65mV (uncorrected for liquid junction potential), which required injecting a small, hyperpolarizing current (0-3 pA), which after initial adjustment was maintained at a constant value throughout the recording. To verify recording quality, current step injections were performed intermittently, throughout the experiment. Recordings from cells in which either visual or current step responses diminished noticeably were terminated.

Visual stimuli

The display was constructed from an updated version of the LED panels previously described[48]. The arena covered slightly more than one half of a cylinder (216° in azimuth and ~72° in elevation) of the fly’s visual field, with each pixel subtending an angle of ~2.25° on the fly eye. Green LEDs (emission peak: 565 nm) were used, bright stimuli were ~72 cd/m2, and were presented on an intermediate intensity background of ~31 cd/m2. Visual stimuli were generated using custom written MATLAB code that allowed rapid generation of stimuli based on individual cell responses. In contrast to the published stimulus control system[48], we have now implemented an FPGA-based panel display controller, using the same PCIe card (National Instrument PCIe-7842R LX50 Multifunction RIO board) that also acquired the electrophysiology data. This new control system (implemented in LabView) streams pattern data directly from PC file storage, allowing for on-line stimulus generation. Furthermore, this new control system featured high precision (to 10 μs) timing and logging of all events, enabling reliable alignment of electrophysiology data with visual stimuli. To map the receptive field (RF) center of each recorded cell, three grids of flashing bright squares (on an intermediate intensity background) were presented at increasing resolution. Each flash stimulus was presented for 140 ms. First, a 6 × 7 grid of non-overlapping 5 × 5 LEDs (~11°×~11°) bright squares was presented. If a response was detected, a denser 3 × 3 grid with 50%-overlapping 5 × 5 LEDs (~11°×~11°) bright and dark squares (to further verify these were T4 Cells) was presented at the estimated position of the RF center (see Supplementary Fig. 1). If a recorded cell was consistently responsive to the first two mapping stimuli, a third one was presented to identify the RF center. A 5 × 5 grid of 3×3 LED bright squares separated by 1 pixel-shifts was presented at the estimated center of the second grid stimulus. The location of the peak response to this stimulus was used as the RF center in subsequent experiments. Once the RF center was identified, the moving bar stimulus was presented in 8 directions and 4 step durations. The bar was 1 × 9 pixels. When moving in the cardinal directions, the motion spanned 9 pixels. In the diagonal directions bar motion included more steps to cover the same distance (9 steps vs. 13 steps). Once the preferred direction had been estimated, bright bar (also 1 × 9 pixels) flashes were presented on the relevant axis. To verify full coverage of RF, this stimulus was presented over an area larger than the original motion window (at least 13 positions). In addition to these stimuli, most cells were also presented with additional stimuli following this procedure. All stimuli were presented in a pseudorandom order within stimulus blocks. All stimuli were presented 3 times, except for single bar flashes which were repeated 5 times. The inter-stimulus interval was 500ms for moving stimuli and 800ms for single bar flashes (to minimize the effect of ongoing inhibition on the responses to subsequent stimuli).

Analysis

All data analysis was performed in MATLAB using custom written code. Since the T4 baseline was typically stable, we included only trials in which the mean pre-stimulus baseline did not differ from the overall pre-stimulus mean for that group of stimuli by more than 10 mV. We also verified that the pre-stimulus mean and overall mean for that trial did not differ by more than 15 mV (or 25 mV for slow moving bars, due to their strong responses). This was designed to identify those rare trials in which the trace became unstable only after the stimulus was presented. Responses were later aligned to the appearance of the bar stimulus and averaged (or the appearance of the bar in the central position in case of the 8-orientation moving bar). T4 cells are expected to signal using graded synapses. Consistent with this expectation, we find that T4 recordings only occasionally feature very weak, fast transients (~1-2 mV in size) that could not be verified as spikes. Therefore, we have focused our analysis on the graded (sub-threshold) components of T4’s responses.

Determining PD

First, 8 direction responses were aligned to the center position for each cell. Second, for the duration of bar presentation, the mean vector response was calculated for each time point as . With θk being the direction of motion (in 45° intervals), and R(θk,t) the response at that direction at time t. Supplementary Fig. 2 shows the result of this procedure for the normalized vector magnitude and . This procedure can be thought of as mimicking a downstream neuron, receiving input from T4s with the same RF center but different PDs. for the cell was selected at the time point when the mean vector magnitude was maximal (repeated for 4 step durations and averaged). In the polar plot of Fig. 1b, this time point is used for the response to each bar direction. The PD was determined as the θk with the minimal difference to . For Fig. 1c, responses were then circularly shifted to align the PD with rightward motions for plotting and averaging purposes.

DSI calculation

Direction selectivity index was defined as , with each response defined as the 0.995 quantile (a robust estimate of the max) within the stimulus presentation window. DSI for the model and model variations was calculated in the same manner.

Single Position Flash Response – depolarization

Responses were defined as the 0.995 quantile (a robust estimate of the max) of the response during the time between bar appearance and flash duration + 75ms. If this number did not exceed 3 standard deviations of the pre-stimulus baseline (for all bar flashes for that cell), the response was defined as zero.

Single Position Flash Response – hyperpolarization

Same as above only the time window was until end of trial (due to slower time course for inhibition) and the threshold was 2 SDs (due to lower magnitude of hyperpolarization). These calculations were used for Fig. 2c and Supplementary Fig. 3.

Depolarization (Hyperpolarizatio) normalization

All detected averaged SPFRs for a given duration were normalized to the maximal depolarizing (hyperpolarizing) absolute response for that duration and that cell. Fig. 2c shows the average of these normalized responses for a single duration. Supplementary Fig. 3a shows the sums for each cell, for 4 normalized depolarizing responses, and the 2 slowest hyperpolarizing responses (since hyperpolarization was hard to detect for brief flashes). A position which showed the maximal response for all durations, will therefore, have a value of 4 for depolarizations and 2 for hyperpolarizations.

Onset time calculation

Data during a window of 200ms before stimulus presentation from all the single bar flashes presented to a cell was used for a per-cell estimate of the standard deviation of the baseline. Only presentations in which the average SPFR was detected as depolarizing were used for onset time calculation. Onset time was defined as the time from stimulus presentation (after it was corrected for a small display latency) in which the response crossed 0.5 × baseline S.D. This threshold value did not exceed 0.5mV. This calculation is used in Fig. 2d and e. Because positions in the center of the RF feature a mixture of excitation and inhibition, the peak time is ‘contaminated’ by the inhibitory contribution and could not be used as an independent, reliable measure of the properties of the excitatory input.

Decay time calculation

The decay time (from 80% to 20% of maximal response) was calculated for all the cells and all the positions in which a depolarizing response was detected. If 20% of max response was not attained by the time recording ended, that data point was excluded. This calculation is used in Fig. 2d and e. Number of cells which passed the above threshold and were included in figure 2d is: 160ms – 17, 17, 17, 17, 17, 15, 9; 80ms – 15, 16, 17, 17, 16, 15, 6; 40ms – 11, 13, 16, 16, 16, 10, 8; 20ms – 1, 8, 11, 17, 12, 8, 4.

Slope calculation (onset and decay times)

To calculate the slope in Fig 2e, onset (decay) time values from all the positions of an individual cell were fit with a linear regression. Fits were performed only when more than 4 positions showed responses, to account for cases in which not all positions showed detectable depolarizing responses (especially due to fast flashes).

Summed SPFRs (superposition)

Single bar flash responses were aligned to the time of the corresponding position appearance in the moving bar stimulus. Responses were padded with zeros (since all were baselines subtracted) to extend brief single bar responses to the timescale of a moving bar. This procedure was used both for the two-step apparent motion stimuli and the moving bar stimuli (Figs. 3 and 4, Supplementary Figs. 4, 5 and 7). For moving bar analysis, responses in which hyperpolarization did not return to zero by the end of the recording were padded with a linear fit to the last 250ms that was extended to zero (to avoid abrupt changes) Fig. 4: For this analysis one cell was not included, since the dataset for this recording was incomplete.

Rectified SPFR sums

Same as for standard sum, but now all negative values in individual SPFRs were set to zeros (Supplementary Fig. 5b).

Scaled center response SPFR sums

Individual SPFRs were replaced with a central response trace after rectification scaled to the amplitude of the original positional response. This was done to eliminate the positional change in response width (Supplementary Fig. 5b).

Average traces in Fig. 4d and f and Supplementary Fig. 5a

Positions from all cells were aligned to the center zero position before averaging. All trajectories that were longer than 8 positions were included in this analysis. Some were extracted from the original 8 direction stimuli (Fig. 1), which was used to determine PD. And some from additional trajectories that were presented just along the PD axis.

Squared sum in Supplementary Fig. 7

The averaged measured data (presented in Supplementary Fig. 7a) was squared (after baseline subtraction) for both the individual presentations and the two-step positions. To generate the expected sum, individual squared responses were temporally aligned and summed.

Spatiotemporal maps in Supplementary Fig. 8

Averaged SPFRs were aligned temporally to stimulus appearance. A 2-dimensional Gaussian was used to smooth the data both temporally (σ = 250ms) and spatially (σ = 1 position) using MATLAB function imgaussfilt. Contour plots were generated using MATLAB function contourf, with a fixed ‘level list’ that was manually generated in order to include smaller steps for the hyperpolarized range. An identical procedure was performed on the simulated SPFRs and simulated SPFRs without inhibition (see Conductance model for details).

Statistics

To determine statistically significant differences, one sided unpaired Student’s t test were used for comparing groups (Fig. 2e and 4e, and Supplemental Fig. 5b). Data distribution was assumed to be normal but this was not formally tested. No statistical methods were used to pre-determine sample sizes, however our sample sizes are similar to those reported in previous publications[49-51]. Data collection and analysis were not performed blind to the conditions of the experiments.

Data Plotting Conventions

All boxplots presented (apart for Fig. 2d) were plotted with MATLAB conventions. Box represents quartile, line represents median, and whiskers represent the farthest point within the q3 + IQR (q1 – IQR range). Boxplots for Fig. 2d depict only median and quartiles due to spatial constraints. For Fig. 1: triangles denote data points outside of the plot. For 20ms: −0.19 and −1.14; for 40ms: −0.09 and −0.7. For Fig. 4e: triangle denotes data point outside of plot for measured 40ms: −0.7. Most plots used the cbrewer color library from the MathWorks file exchange.

Conductance model

A model T4 neuron was implemented using the morphology of a single T4 cell, reconstructed from Electron Microscopy data. The neuron morphology was generously shared by Kazunori Shonomiya and Janelia’s FlyEM project team. The FlyEM team collected a data set containing approximately half of a Drosophila optic lobe, that was imaged with isotropic 8-nm voxels by focused ion-beam milling scanning electron microscopy (FIB-SEM). The sample was prepared from the head of a female fly as previously reported, using high-pressure freezing followed by freeze-substituted embedding[9,52]. A 153 × 85 × 180 μm volume containing connected regions of the lamina, medulla, lobula, and lobula plate was imaged. The imaged volume was segmented automatically based on an algorithm similar to one previously described[53]. NeuTu-EM (https://github.com/janelia-flyem/NeuTu/tree/flyem_release) was then used to proofread the segmented volume, where segmented fragments are neurons are merged and split to form the complete morphology of neuron. A reconstructed T4 neurons was identified based on its distinctive morphology, with dendritic compartments spanning ~20 μm in the medulla and an axon projecting to a distinct layer of the Lobula Plate. The reconstructed neuron morphology contained 344 sections. To correct for a small number of inconsistencies in the reconstructed morphology, the diameter of each dendritic section was smoothed by performing a moving average of the diameter of five adjacent sections (using the TREES toolbox[54]). The simulation was implemented and run using NEURON v. 7.4 (http://www.neuron.yale.edu/neuron/). Analog synapses were placed randomly throughout the dendritic arbor (after this compartment, containing 235 sections, was defined manually), and a recording electrode was attached to the soma. After identifying the primary dendritic axis (used to simulate stimulation along the PD-ND axis), we assigned each dendritic section a value corresponding to its projection on this axis (x*, between 0-1). We subdivided the axis into M=11 intervals, where each interval contains an equal number of dendritic sections (21 sections), to map onto the 11 stimulated positions required to enclose the T4 receptive field. We used 11 intervals since the average traces in Fig. 4d, which were the reference for the simulation, spanned positions −5 to 5 (see Analysis subsection). Within an interval, we randomly selected a fixed number of dendritic sections Ne = 9 (Ni = 5) where excitatory (inhibitory) graded synapses were placed. The reversal potential for excitatory synapses was set to 0 mV and to −70 mV for inhibitory synapses. The resting membrane potential was set to −65 mV. To simulate a visual input, mimicking the appearance of the bar at one position, all the synapses in the region were activated with conductance dynamics that were uniform for all E and different, but uniform for all I synapses. Our model simplifies the transformation from visual input to a synaptic conductance change, so while we are not explicitly modelling the optical and neuronal pre-filtering that is upstream of T4, these details are implicitly incorporated into the model, since these elements contribute to the SPFRs, which the model was fit to reproduce. To simulate an analog synapse, we used the single electrode voltage clamp (SEVC) point process and injected the inverse of our calculated conductance (see below) into the SEVC Rs (zero conductance was change to 1e9 resistance). We computed the E and I conductance time course at each synapse ( , ) according to the following: where C = {E, I}, and , denote the rise (decay) time constants of the synapse, is the synaptic input, and scales the conductance based on the location of the synapse on the PD-ND axis. We modeled the amplitude as a Gaussian profile with an overall amplitude parameter , a peak location along the dendrite , and a width . We chose a Gaussian profile since it reasonably approximates the spatial profile of E and I inputs measured for the SPFRs (Fig. 2c, Supplementary Fig. 3a), and to reduce the number of parameters needed to describe the inputs to the simulated T4 neuron. The synaptic input, , was modeled as a pulse of unit amplitude with duration T, where T = 20,40,80,160ms, and was identical for all the synapses located within one of the M intervals. Three additional parameters of the neuronal model are the membrane resistivity (Rm), membrane capacitance (Cm) and axial resistivity (Ra). For all simulations, Cm was fixed at Cm = 1 μF/cm2. We optimized the remaining model parameters, by performing non-linear least square minimization between the numerical simulation results combining all stimulus positions (M=11 intervals) and durations (T) resulting in 44 simulated SPFR responses, and the corresponding measured SPFRs (Supplementary Fig. 6c). The minimization used lsqcurvefit() function from MATLAB’s Optimization Toolbox. Having fit the model parameters using the single position flash stimuli, we used the same optimized parameters to simulate the dynamics of the model in response to moving bars (apparent motion stimulus). The moving bar stimulus was implemented by sequentially activating the synaptic inputs along the dendritic axis, with each interval being active for a duration T.

Model manipulations

(1) We moved all synaptic inputs (99 excitatory and 55 inhibitory synapses) to the first dendritic section (Fig. 5d) (2) We removed all conductance changes mediated by the inhibitory synapses, while keeping all other settings of the model fixed. This was accomplished by setting the duration of all inhibitory synaptic inputs to zero (Fig. 5d).

Single compartment conductance model

We modeled the time course of the membrane potential of a T4 neuron, , according to the single compartment dynamics where is the membrane capacitance, is the leak conductance (resulting in the integration time constant of the neuron , are the resting, excitatory and inhibitory reversal potentials respectively (values in Table 1). The excitatory and inhibitory conductances, , were modeled as a Gaussian weighted linear combination of stimulus location-specific conductances, where, as for the model described in the previous section, . , the conductance change elicited by the presentation of a stimulus in one of locations follows the dynamics Stimuli had characteristics identical to those described in the previous section. All the parameters (with the exception of the reversal potentials) were optimized to minimize the mean squared error between model responses to single bar inputs and average T4 SPFRs. As with our original model, solutions to the single compartment model conformed to one of the two solution clusters. The solutions either had fast (< 10 ms) time constants for E and slow membrane time constant (solution cluster 1), or slower E time constants and negligible membrane time constant (solution cluster 2). When the time constant of the neuron is negligible compared to the time scale of the stimuli and conductances ( , the relative change of membrane potential with respect to the resting potential can be written as where , , and . This is the expression used in the model schematics depicted in Figure 6. Note that if one combination of excitatory and inhibitory conductances elicit an hyperpolarizing response, while a second one produces a depolarizing response, under some conditions it is possible to obtain a supralinear response to the superposition of the two sets of conductances For instance, when , supralinearity is obtained if .

Data Availability

The data used to generate the primary results of this study are available here: https://doi.org/10.25378/janelia.5576101 The split-GAL4 driver used to target T4 cells (SS02344: VT015785-p65ADZp (attP40); R42F06-ZpGdbd (attP2)) generously provided by Aljoscha Nern in Gerry Rubin’s lab (line details with expression data available from http://splitgal4.janelia.org/). A Life Science Reporting Summary is included.

Code Availability

The essential code used to generate the primary results and conduct the simulations for this study are available here: https://figshare.com/projects/Gruntman_et_al_2017_Data/26347 Supplementary Figure 1: Mapping T4 Receptive Field center. (a) Top: Expression pattern of the driver line used to target T4 cells (SS02344; reoriented substack of a confocal image volume). Similar images were obtained from at least 5 samples. Bottom: schematic of experimental setup. Blue square represents the full mapping stimulus grid from c. (b) Example single cell responses to ON square flashes (~11° × ~11°), presented for 140ms at each location on the indicated stimulus grid (inset). Individual trials in grey (n=3 trials), mean in red. Inset: stimulus grid of non-overlapping squares, subtending 78.75° × 67.5° of the visual space. (c) Responses from example cell in b to bright and dark square flashes (n=3 trials each). Note stimulus grid contains overlapping position to map RF center more precisely. Inset: stimulus grid highlighting two of the nine stimulus positions with darker squares. This mapping procedure was performed for all the cells in the dataset. Supplementary Figure 2: Dynamic characterization of the moving bar responses. (a) Circular mean of preferred direction for all recorded neurons (n=17 cells). Lines connect data from the same cell, dot color depicts different speeds, and dot size the normalized mean response vector magnitude (scale in inset). (b) Histogram of the data in a, showing three distinct peaks corresponding to three of the four cardinal directions. (c) Responses from an example cell to eight different directions (rows) at four different speeds (columns). Responses were temporally aligned to the time the bar crossed the RF center. (d,e) Mean normalized vector calculated for each time point for responses in c, decomposed into magnitude (d) and angle (e). Vector was calculated from temporally aligned responses presented in c. This procedure can be considered as mimicking a downstream cell’s readout of differently tuned T4 cells with RFs centered at the same spatial position. Note, that for the two slower speeds the instantaneous PD flips by 180° after the bar passes the center (seen clearly in 10 out of 17 cells), since PD response is now hyperpolarizing while ND response is depolarizing (denoted with grey lines). Supplementary Figure 3: Peak depolarization/hyperpolarization of T4 receptive field. (a) normalized peak depolarization (red) and peak hyperpolarization (blue) for all recorded neurons. PD is always from left to right, with peak depolarization aligned to position zero. A value of 4 indicates that a particular position gave the strongest depolarizing response at all four speeds, and a 2 indicates the largest hyperpolarizing response (since only the two slowest speeds were considered). (b) angular distance between peaks of the hyperpolarization and depolarization curves in a. Black line indicates the mean. Supplementary Figure 4: T4 responses to two-step apparent motion show null direction suppression, but no preferred direction enhancement. Averaged baseline-subtracted responses (mean ±SEM) to 160 ms flash stimuli presented along the PD-ND axis (n=17 cells; expanded version of stimulus protocol shown in Figure 3, applied to all recorded T4 neurons, here the five positions sampled are shifted by one position towards the leading side). Stimulus positions indicated correspond to those in Figure 2. Apparent motion is presented as two flash stimuli timed to match their appearance during 14°/sec motion. PD motion indicated in red and ND motion in blue. Single position responses are shown in grey along the diagonal. Grey traces above and below the diagonal are time-aligned sums (computed as the superposition) of the diagonal SPFRs. Note no enhancement for second bar responses on the lower triangle, and clear suppression on most responses in the upper triangle. Supplementary Figure 5: Removing features of the SPFRs eliminates directional selectivity. (a) Comparison of PD and ND responses (mean ±SEM, n=31 trials, 16 cells) for measured, summed, rectified (summed after hyperpolarization removal), and scaled (rectified and central position response scaled to the magnitude of each position) traces. Data presented in measured and summed is reproduced from Figure 4d. (b) Boxplots for Directional Selectivity Index for the same data as in a by manipulation and speed, grey crosses represent outliers (* DSI > 0, p < 0.01 one sided unpaired t-test). The response reconstructed from amplitude-scaled versions of the SPFR at the central position (which lack the temporal sharpening of the depolarized component of trailing-edge SPFRs) do not show directional selectivity (see Methods for boxplot conventions). (c) Schematic for procedures applied in rectified and scaled versions of the SPFR sums in a. Supplementary Figure 6: detailed simulation results. (a) Summary of 1000 independent nonlinear least squares fits of simulation parameters (from randomized initial conditions) optimized to reproduce the measured SPFRs. Plot of sorted sum of squared errors (SSE) between each optimization result and the original SPFRs summed over all positions and all speeds. The smallest error solutions are found in distinct clusters that are color-coded and are further examined in b and c. (b) Spatial and temporal filters for excitatory and inhibitory conductances. Top: normalized conductance weights for excitatory (thicker line) and inhibitory synapses in three optimization solution-clusters. Each subplot shows resulting Gaussian distribution from 10, nearly indistinguishable, solutions (chosen at random) from each solution-cluster. Gaussian amplitudes were scaled by RM (inverse of the leak conductance), and the excitatory amplitude was further scaled by 10 for comparison. In solutions 1 and 2, the excitatory distribution is broader, and shifted towards the leading side (0 marks the leading end of the dendrite). Bottom: excitatory and inhibitory conductance pulse for 160ms flash. In solutions 1 and 2 the excitatory conductance is much faster. Figure 5 data is based on Solution 1, example parameter values in Table 1. (c) Comparison of measured SPFR data used for fitting and the simulation results. Only 160ms flashes are shown, but the model was fit to SPFRs at all 4 speeds. Each row shows a single solution from each solution-cluster labeled in a. The 3rd solution does not account for hyperpolarization on the trailing side and was therefore excluded from further analysis. (d) Comparison of mean measured responses to simulated predictions with parameters from solution-cluster 2 (Table 1) to moving bars in the PD and ND. (e) Peak PD and ND responses measured in T4s, compared to the solution-cluster 2 simulation results from d. (f) DSI for the mean measured responses compared to the simulation results from d. Note, Figure 5 b,c,d show these comparisons using solution 1 parameters. Supplementary Figure 7: The voltage to calcium transformation may produce enhancement artifacts. (a) Averaged baseline-subtracted responses (mean ±SEM) to 40 ms flash stimuli presented along the PD-ND axis (n=3, same cells as in Figure 3) in black. Single position responses are shown along the diagonal. Stimulus positions indicated correspond to those in Figure 2. The time-aligned sum of the single position responses is shown in grey. Lines demarcate the maximal response to second bar in 3 example positions. Arrows point from summed prediction to measured response. (b) As a proxy for possible effect of calcium imaging[21], the same data as in a are plotted after squaring the measured responses (in green). Grey traces represent the sum of squared single position responses. Note that in example i, the slight suppression in a, becomes an enhanced response in b, in example ii minimal enhancement becomes strong enhancement, and in example iii suppression is maintained. Supplementary Figure 8: The space-time tilt in the T4 spatiotemporal receptive fields likely originates from the interaction of excitatory and inhibitory inputs. Left: SPFRs from Figure 2a (for 160 ms flashes) were replotted to approximate the spatiotemporal RF of T4 cells. Dashed line indicates stimulus onset. The RF features an excitatory lobe peaking in the center (by construction) and an inhibitory lobe following in time and offset spatially towards the trailing side. In this representation, the leading side responses appear to rise more slowly than central responses, which could conform with the temporal delay requirement of an HR-like mechanism. Center: the spatiotemporal RF plotted from simulated SPFRs generated from the (Solution 1) model parameters. Lines correspond to response traces shown in the inset. The apparent difference in onset times for different positions is the result of different response magnitudes, since the time constants at different positions in the model are identical by construction. Right: similar to center, but from simulations where the inhibitory inputs were removed. Note that in the model, removing inhibition completely removes the inseparable property of the filter, showing that the contribution of inhibition is sufficient to account for the tilt in the excitatory lobe.

47 in total

1. Functional Specialization of Neural Input Elements to the Drosophila ON Motion Detector.

Authors: Georg Ammer; Aljoscha Leonhardt; Armin Bahl; Barry J Dickson; Alexander Borst
Journal: Curr Biol Date: 2015-07-30 Impact factor: 10.834

2. Fly motion vision is based on Reichardt detectors regardless of the signal-to-noise ratio.

Authors: J Haag; W Denk; A Borst
Journal: Proc Natl Acad Sci U S A Date: 2004-11-08 Impact factor: 11.205

3. Adaptation without parameter change: Dynamic gain control in motion detection.

Authors: Alexander Borst; Virginia L Flanagin; Haim Sompolinsky
Journal: Proc Natl Acad Sci U S A Date: 2005-04-15 Impact factor: 11.205

4. Neuronal basis for parallel visual processing in the fly.

Authors: N J Strausfeld; J K Lee
Journal: Vis Neurosci Date: 1991 Jul-Aug Impact factor: 3.241

5. A directional tuning map of Drosophila elementary motion detectors.

Authors: Matthew S Maisak; Juergen Haag; Georg Ammer; Etienne Serbe; Matthias Meier; Aljoscha Leonhardt; Tabea Schilling; Armin Bahl; Gerald M Rubin; Aljoscha Nern; Barry J Dickson; Dierk F Reiff; Elisabeth Hopp; Alexander Borst
Journal: Nature Date: 2013-08-08 Impact factor: 49.962

6. Optimized tools for multicolor stochastic labeling reveal diverse stereotyped cell arrangements in the fly visual system.

Authors: Aljoscha Nern; Barret D Pfeiffer; Gerald M Rubin
Journal: Proc Natl Acad Sci U S A Date: 2015-05-11 Impact factor: 11.205

7. Defining the computational structure of the motion detector in Drosophila.

Authors: Damon A Clark; Limor Bursztyn; Mark A Horowitz; Mark J Schnitzer; Thomas R Clandinin
Journal: Neuron Date: 2011-06-23 Impact factor: 17.173

8. Subcellular Imaging of Voltage and Calcium Signals Reveals Neural Processing In Vivo.

Authors: Helen H Yang; François St-Pierre; Xulu Sun; Xiaozhe Ding; Michael Z Lin; Thomas R Clandinin
Journal: Cell Date: 2016-06-02 Impact factor: 41.582

9. Neural Mechanisms for Drosophila Contrast Vision.

Authors: Armin Bahl; Etienne Serbe; Matthias Meier; Georg Ammer; Alexander Borst
Journal: Neuron Date: 2015-12-07 Impact factor: 17.173

10. Role of GABAergic inhibition in shaping odor-evoked spatiotemporal patterns in the Drosophila antennal lobe.

Authors: Rachel I Wilson; Gilles Laurent
Journal: J Neurosci Date: 2005-10-05 Impact factor: 6.709

27 in total

1. Asymmetric ON-OFF processing of visual motion cancels variability induced by the structure of natural scenes.

Authors: James E Fitzgerald; Damon A Clark; Juyue Chen; Holly B Mandel
Journal: Elife Date: 2019-10-15 Impact factor: 8.140

2. The Neuronal Basis of an Illusory Motion Percept Is Explained by Decorrelation of Parallel Motion Pathways.

Authors: Emilio Salazar-Gatzimas; Margarida Agrochao; James E Fitzgerald; Damon A Clark
Journal: Curr Biol Date: 2018-11-21 Impact factor: 10.834

3. ON selectivity in the Drosophila visual system is a multisynaptic process involving both glutamatergic and GABAergic inhibition.

Authors: Sebastian Molina-Obando; Juan Felipe Vargas-Fique; Miriam Henning; Burak Gür; T Moritz Schladt; Junaid Akhtar; Thomas K Berger; Marion Silies
Journal: Elife Date: 2019-09-19 Impact factor: 8.140