| Literature DB >> 27445895 |
Abstract
This paper proposes how various disciplinary theories of cognition may be combined into a unifying, sub-symbolic, computational theory of cognition. The following theories are considered for integration: psychological theories, including the theory of event coding, event segmentation theory, the theory of anticipatory behavioral control, and concept development; artificial intelligence and machine learning theories, including reinforcement learning and generative artificial neural networks; and theories from theoretical and computational neuroscience, including predictive coding and free energy-based inference. In the light of such a potential unification, it is discussed how abstract cognitive, conceptualized knowledge and understanding may be learned from actively gathered sensorimotor experiences. The unification rests on the free energy-based inference principle, which essentially implies that the brain builds a predictive, generative model of its environment. Neural activity-oriented inference causes the continuous adaptation of the currently active predictive encodings. Neural structure-oriented inference causes the longer term adaptation of the developing generative model as a whole. Finally, active inference strives for maintaining internal homeostasis, causing goal-directed motor behavior. To learn abstract, hierarchical encodings, however, it is proposed that free energy-based inference needs to be enhanced with structural priors, which bias cognitive development toward the formation of particular, behaviorally suitable encoding structures. As a result, it is hypothesized how abstract concepts can develop from, and thus how they are structured by and grounded in, sensorimotor experiences. Moreover, it is sketched-out how symbol-like thought can be generated by a temporarily active set of predictive encodings, which constitute a distributed neural attractor in the form of an interactive free-energy minimum. The activated, interactive network attractor essentially characterizes the semantics of a concept or a concept composition, such as an actual or imagined situation in our environment. Temporal successions of attractors then encode unfolding semantics, which may be generated by a behavioral or mental interaction with an actual or imagined situation in our environment. Implications, further predictions, possible verification, and falsifications, as well as potential enhancements into a fully spelled-out unified theory of cognition are discussed at the end of the paper.Entities:
Keywords: anticipatory behavior; conceptualization; embodiment; free energy-based inference; homeostasis; learning; planning; predictive coding
Year: 2016 PMID: 27445895 PMCID: PMC4915327 DOI: 10.3389/fpsyg.2016.00925
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1An overall predictive processing loop continuously generates temporal predictions, compares the resulting distributed prior probabilistic state estimate with the incoming sensory information, and fuses the independent information sources yielding a distributed, probabilistic local posterior state estimation. Finally, the internal active predictive encodings are adapted further toward establishing mutual consistencies, yielding an approximate global posterior distributed probabilistic state estimation. All the types of probabilistic state estimations are encoded sub-symbolically by means of neural activities, which essentially constitute the currently active predictive encodings.
Figure 2A distributed illustration (highly simplified) of a possible predictive encoding of the compositional concept “a ball lies in a bowl,” including some of the most important active predictive encodings. Note how the (A) ball and (B) bowl concepts are activated, sketching out the respective top-down predictions of their visual appearance as well as the respective temporal predictive encodings, which characterize potential motion interaction consequences of other items with the activated items. (C) The relative spatial predictive encodings together with the temporal predictive interaction encodings realize the “lies in” concept in the composition. While bidirectional arrows show ball- and bowl-respective active predictive encoding interactions, the darkness of arrows within individual illustrations indicates the current strength of activation. Note how, for example, the temporal interaction consequence encodings of ball and bowl approximately cancel each other out, thus generating a somewhat stable free energy minimum.
Glossary of terminology used in the paper.
| Abstraction | An encoding that generalizes away from particular features in space and/or in time and/or over feature-specific aspects; typically, an abstracted encoding corresponds to a higher level, top-down predictive encoding |
| Active predictive encoding | An encoding that is currently active and that thus predicts the activity of other predictive encodings—just like a set of firing neurons that activate other neurons via their axons, the reached synapses, and the connected dendrites. |
| Cause | A physical property of an item, which may cause sensory signals and determine physical interactions (in analogy to Friston, |
| Concept | A subset of predictive encodings that specify (possibly relative) item properties, orientations, positions, and/or forces that are essential for a particular event to take place |
| Concept composition | A non-contradictory combination of concepts |
| Event schemata | An encoding of an event together with event boundary encodings that specify when the event can occur and how it typically ends (in analogy to Hard et al., |
| Dynamic event | An active set of temporal predictive encodings, which predict changes of causes, positions, and/or orientations of items in the environment, typically together with the forces that cause the changes, over an extended period of time |
| Episode encodings | A set of events and their typical ordering in time |
| Event | An active set of predictive encodings, which apply over an extended period of time (in analogy to Zacks and Tversky, |
| Event boundary | A particular state in the environment upon which one or several predictive encodings become applicable and/or one or several other predictive encodings are no longer applicable (in analogy to Zacks and Tversky, |
| Force | A physical force in the environment—including but not limited to motor activity—which causes items to change |
| Item | A body, body-part, object, material, thing, sensor, muscle, etc., that is, anything that exists in the environment and that can interact with other items |
| Modality | Sensory or motor signals provided by the respective sensors or activators |
| Module | A set of predictive encodings that integrates particular sensory and/or motor encodings or abstractions of such encodings in a particular manner |
| Orientation | Angular information about an item in the environment relative to other items in the environment |
| Position | Localization of an item in the environment relative to other items in the environment |
| Predictive encoding | Any form of predictive, neural encoding, which—when active—predicts the activity of other encodings—akin to a neuron or a set of neurons including the connectivity to other neurons via axon, synapses, and connected dendrites |
| Spatial predictive encodings | Predictions that map other predictive encodings onto each other |
| Static event | An active set of spatial and top-down predictive encodings of causes, positions, and/or orientations of items in the environment over an extended period of time |
| Temporal predictive encodings | Predictions forward in time, that is, predictions about changes in causes, positions, and/or orientations of items in the environment due to forces |
| Top-down predictive encodings | Predictions about more sensory- or motor-grounded signals in more abstract, generalizing forms, typically involving sensory/feature abstractions |
Main propositions toward a unifying, sub-symbolic, computational theory of cognition.
| 1. | The brain is a modular, probabilistic, predictive encoding system that continuously strives to minimize free energy in a distributed manner; |
| 2. | Predictive encodings are separated into temporal, spatial, and top-down predictive encodings; |
| 3. | Modularity develops in the brain to be able to flexibly relate particular predictive encodings across space and time and to be able to form effective abstractions and generalizations; |
| 4. | Behavior, attention, and thought are anticipatory because they are generated by active inference mechanisms, which activate temporal predictive encodings inversely due to differences in current and desired internal homeostatic states, which in turn activate associated forces, motor behavior, attention, and thought itself; |
| 5. | Concepts are approximately consistent free energy minima in a distributed set of active, interconnected predictive encodings; |
| 6. | Concept compositions are combinations of such concepts that are temporarily consistently related to each other (approximating a larger, more distributed free energy minimum); |
| 7. | Particular scenarios, such as the current or an imagined world state, are perceived, imagined, or remembered in the form of compositional concepts; |
| 8. | Episodes are perceived, imagined, or remembered in the form of compositional concepts and a concatenation of event schema-encodings (typically on multiple levels of abstraction), which specify how the scenario changes (or may change or changed) over time; |
| 9. | The cognitive pursuance of a particular “idea” or a particular “thought” corresponds to the active exploration of concepts, concept compositions, scenarios, and/or episodes by means of event schema-based activity changes over time. |