| Literature DB >> 34248515 |
Daniel Ari Friedman1,2, Alec Tschantz3,4, Maxwell J D Ramstead5,6,7,8, Karl Friston7, Axel Constant9.
Abstract
In this paper, we introduce an active inference model of ant colony foraging behavior, and implement the model in a series of in silico experiments. Active inference is a multiscale approach to behavioral modeling that is being applied across settings in theoretical biology and ethology. The ant colony is a classic case system in the function of distributed systems in terms of stigmergic decision-making and information sharing. Here we specify and simulate a Markov decision process (MDP) model for ant colony foraging. We investigate a well-known paradigm from laboratory ant colony behavioral experiments, the alternating T-maze paradigm, to illustrate the ability of the model to recover basic colony phenomena such as trail formation after food location discovery. We conclude by outlining how the active inference ant colony foraging behavioral model can be extended and situated within a nested multiscale framework and systems approaches to biology more generally.Entities:
Keywords: T-maze; active inference; ants; behavioral modeling; collective behavior; eco-evo-devo; foraging; stigmergy
Year: 2021 PMID: 34248515 PMCID: PMC8264549 DOI: 10.3389/fnbeh.2021.647732
Source DB: PubMed Journal: Front Behav Neurosci ISSN: 1662-5153 Impact factor: 3.558
Figure 1Overview of the foraging paradigm and simulation presented in this paper. (A) In this paper we used a T-maze foraging paradigm. Foraging nestmates issue from the Nest location in blue at the bottom of the T-maze and the food switches location between the arms of the T-maze during the simulation. The gray shaded area is where ants can forage within. (B) During the simulation, described in the order of the labeling of the figure: (1) All agents (simulated ants) start at the same nest location. (2) Their task is to reach a food resource on the map, which is on one arm of a T-maze. (3) While unladen (e.g., heading out on a foraging trip), the ant does not deposit pheromone, and moves toward locally increased density of pheromone. After encountering the food, the ant leaves a pheromone trail on its way back toward the nest, again moving toward locally increased density of pheromone. (4) Crucially, ants only have sensory access to their immediate surroundings and do not know where the food resource is located on the map. All they can sense is what surrounds them immediately and its current location, as well as outcomes associated with each of these states. (C) Expanded table of Parameters and Equations. For each ant, at each time step, there are 10 possible observations, which correspond to 10 levels of pheromone density (which range from 1 to 10). Each ant can select policies that transition between surrounding locations (which are modeled as hidden states that the agent has to infer based on sensory cues). This means that the generative model of each ant is made of a likelihood and a prior that are limited to the representations of these nine locations. The policies available to the ant correspond to transitions from the current location to one of nine possible locations (i.e., same location, up, right, down, left, top-left, top-right, bottom-left, and bottom-right), and evolves over one time step. (5) The environment in which the ant navigates is a 40 × 40 matrix, with each location corresponding to an index from 1 to 1,600 that allows us to respecify the likelihood matrix of the ant after each step (see Appendix). Each location in the environment generates a given outcome, which can change based on whether the ant passed over that location and left a pheromone trail. There is no modulation of the likelihood precision in our simulation, which means that each ant always has an unbiased sensory access to its surroundings.
Figure 2Pheromone trails of the colony with 70 nestmates over 2,000 timesteps and three food location switches. The sequence of images starts on the top right and reads from right to left (1st row), from left to right (2nd row), from right to left (3rd row), and from left to right (4th row). We chose a representative simulation and did not tune the parameters to discover optimal decision making. See Supplementary Files 6, 7 for example simulation outputs and to see the switching behavior (pheromone density locking onto one arm, then gradually locking onto the other arm) that is observed in the case of strict priors but not flat priors.
Figure 3Round trips and distance coefficient. (A) The number of completed round trips (Y axis) through time (X axis) for different numbers of nestmates in the simulation (traces of different color as per legend) for a representative trace run of each colony size. (B) Swarm distance coefficient (Y axis, see section The Foraging Task for equation) plotted through time (X axis) for different numbers of nestmates in the simulation (traces of different color as per legend) for the same representative runs.