| Literature DB >> 31840585 |
Ryan Calmus1, Benjamin Wilson1, Yukiko Kikuchi1, Christopher I Petkov1.
Abstract
Understanding how the brain forms representations of structured information distributed in time is a challenging endeavour for the neuroscientific community, requiring computationally and neurobiologically informed approaches. The neural mechanisms for segmenting continuous streams of sensory input and establishing representations of dependencies remain largely unknown, as do the transformations and computations occurring between the brain regions involved in these aspects of sequence processing. We propose a blueprint for a neurobiologically informed and informing computational model of sequence processing (entitled: Vector-symbolic Sequencing of Binding INstantiating Dependencies, or VS-BIND). This model is designed to support the transformation of serially ordered elements in sensory sequences into structured representations of bound dependencies, readily operates on multiple timescales, and encodes or decodes sequences with respect to chunked items wherever dependencies occur in time. The model integrates established vector symbolic additive and conjunctive binding operators with neurobiologically plausible oscillatory dynamics, and is compatible with modern spiking neural network simulation methods. We show that the model is capable of simulating previous findings from structured sequence processing tasks that engage fronto-temporal regions, specifying mechanistic roles for regions such as prefrontal areas 44/45 and the frontal operculum during interactions with sensory representations in temporal cortex. Finally, we are able to make predictions based on the configuration of the model alone that underscore the importance of serial position information, which requires input from time-sensitive cells, known to reside in the hippocampus and dorsolateral prefrontal cortex. This article is part of the theme issue 'Towards mechanistic models of meaning composition'.Entities:
Keywords: binding; chunking; computational modelling; language; sequence learning; serial order
Mesh:
Year: 2019 PMID: 31840585 PMCID: PMC6939361 DOI: 10.1098/rstb.2019.0304
Source DB: PubMed Journal: Philos Trans R Soc Lond B Biol Sci ISSN: 0962-8436 Impact factor: 6.237
Figure 1.Neurobiologically informed heuristic model of structured sequence processing, by Wilson, Marslen-Wilson and Petkov. (a) Fronto-temporal regions involved in sequence processing, from [34]. DLPFC, dorsolateral prefrontal cortex; vFOC ventral frontal opercular cortex; ATL, anterior temporal lobe; STG, superior temporal gyrus. (Copyright © 2017 Benjamin Wilson, William D. Marslen-Wilson and Christopher I. Petkov, CC BY 4.0.) (b) Predicted combinatorial codes illustrated as neural patterns implemented by coordination between different regions. (Online version in colour.)
Figure 2.Spatial and temporal coding within a spiking model. Top row, left panel: vector symbolic architecture (VSA) operations using circular convolution to accomplish binding (A ⨂ B) [61]. Sparse, high-dimensional random vectors represent distinct symbols A and B (blue text). To aid visual comparison, vectors are shown reshaped into squares, meaning the dimensionality of each vector equals the number of ‘pixels’ in each box (here, 256-dimensional, plotted as 16 × 16). These vectors can also be considered directions in high-dimensional space (inset, projected down to 3D using principal components analysis). Results of the VSA operators are shown (upper left main panel, black text, clockwise from left: superposition, binding, unbinding and inversion; see text, §4), with arrows indicating the flow of operands. A noisy recovered vector (rightmost square) can be cleaned up with an autoassociative memory to produce A (top square). Top row, right panel (from [21]; copyright © 2014 Bekolay, Bergstra, Hunsberger, DeWolf, Stewart, Rasmussen, Choo, Voelker and Eliasmith, CC BY 3.0): Core properties of the Neural Engineering Framework (NEF) [22] as implemented in Nengo [21]. To the left are tuning curves of individual neurons (a, top plot). In vector terms, each neuron fires maximally to its own preferred direction. Nonlinear encoding of an input signal (a, middle plot) yields spike trains for each neuron (a, bottom plot). Decoding (b) is possible using linear methods. Combining decoding with encoding, one can determine synaptic weights representing transformations between populations (c). Here, VSA representations are simply high-dimensional signals encoded like any other. Operations like convolution can be learned by simulated spiking networks incorporating spike-timing-dependent plasticity (STDP) in Nengo [22]. Finally, dynamic signals can be represented (d), of relevance for understanding oscillatory mechanisms [26,28,29]. Nengo is agnostic about neural models, with many spiking models available [80–82]. Bottom row: since the NEF provides mechanisms for spatial (VSA operations) and temporal (dynamic) manipulation of representations, possibilities exceed that of a static system. Simple interactions between segregated populations (networks shown in blue boxes) lead to controlled functional relationships. Thresholded dynamic activity, e.g. arising from an oscillator (P1, leftmost panel) can trigger discrete combinatorial operations. These operations can be segregated over time (middle left panel) through control by interacting, antisynchronous oscillations (P1 and P2). Likewise, common driving signals can synchronously strengthen representations in disparate regions (middle right panel, P1 multiplicatively modulating P2 and P3) for downstream processing such as feature binding. Finally, the phase of an oscillator, rather than its amplitude (rightmost panel) can drive downstream encodings such as those of relative position. (Online version in colour.)
Figure 3.Neurobiologically informed vector symbolic encoding of sequence structure. Vector symbolic operators can account for the processing of a variety of sequencing dependencies. The solid arrows in these charts indicate the flow of information during encoding of a stimulus sequence only. These describe transformations of latent vector symbolic representations, as opposed to neural activation patterns. Representation strength is denoted by the shading and thickness of each box border. For clarity, representations are shown separated along the horizontal axis, though separate boxes do not necessarily imply separate neural populations are engaged, especially if describing identical computations, which could be undertaken by neurons of a single region. We suggest sensory representations (bottom row) are maintained within the supplementary motor area (not shown) and retrieved as needed. Operations unfold dynamically following principles outlined in figure 2. The final encoded sequence representation is found at the top of each diagram. Each is a reduced representation whose constituents can be inspected without serially unpacking all bindings; the superposed final result of the adjacent relationship encoding (leftmost diagram), for example, can be interrogated to recover its secondary element by simply binding it with ¬2°. Serial elements packaged into a single representation are considered to be chunked in the traditional sense (inset box), but identical operations can be applied to non-adjacent (middle diagram) or nested pairs of elements (rightmost diagram), using separate item (I) and chunk (C) position encodings. Selective fading along the vertical axis represents salience filtering in the non-adjacent example (middle). Finally, the dashed, curved arrow shows just one case in which sub-symbolic feedback from a cognitively abstract cortical region might ultimately influence the representation of individual elements in sensory cortex (there may be many such pathways, but one exemplar is shown). Thus, although the figure, for simplicity, suggests VS-BIND is largely a feedforward model, feedback influences feature and can, for example, allow certain areas to influence sensory cortical representations. (Online version in colour.)