| Literature DB >> 31792198 |
Josh Merel1, Matthew Botvinick2, Greg Wayne2.
Abstract
Advances in artificial intelligence are stimulating interest in neuroscience. However, most attention is given to discrete tasks with simple action spaces, such as board games and classic video games. Less discussed in neuroscience are parallel advances in "synthetic motor control". While motor neuroscience has recently focused on optimization of single, simple movements, AI has progressed to the generation of rich, diverse motor behaviors across multiple tasks, at humanoid scale. It is becoming clear that specific, well-motivated hierarchical design elements repeatedly arise when engineering these flexible control systems. We review these core principles of hierarchical control, relate them to hierarchy in the nervous system, and highlight research themes that we anticipate will be critical in solving challenges at this disciplinary intersection.Entities:
Mesh:
Year: 2019 PMID: 31792198 PMCID: PMC6889345 DOI: 10.1038/s41467-019-13239-6
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1a Interaction cycle between an embodied control system and a physical environment to generate behavior. b A flat controller with no architectural segregation of different inputs. c A basic, brain-inspired two-stage hierarchy: a lower-level motor controller directly generates motor commands to the effectors based on input from proprioceptive sensors and modulatory input from a higher-level controller, which is responsive to additional signals, including vision and task context signals.
Summary of key principles of hierarchical control.
| Core principle | Brief summary | Motivation/utility |
|---|---|---|
| Information factorization | Different information is routed to different subsystems. | Factored learning can require less experience per subsystem. Subsystems are invariant to hidden information and therefore are reusable across contexts. |
| Partial autonomy | Lower-level systems function somewhat autonomously, with modulation from higher-level systems. | System is more robust and lower-level does not require costly micromanagement. |
| Amortized control | Movements that have been successfully executed multiple times are compressed into a system that can rapidly reproduce them. | Re-execution of frequently repeated movements should be more computationally efficient than novel variations. |
| Modular objectives | Specific subsystems may be trained to optimize specific objectives, distinct from the global task objective. | Training of subsystems can leverage error signals that are denser or more well known than the global task objective. |
| Multi-joint coordination | Movement is produced in a manner that reflects common patterns across the body. | Exploration and action-selection can exploit commonly co-occurring multi-joint patterns. |
| Temporal abstraction | Common temporal motifs are abstracted. | Behavior specification or planning can occur at a coarser timescale. |