Literature DB >> 26009260

Prediction across sensory modalities: A neurocomputational model of the McGurk effect.

Itsaso Olasagasti1, Sophie Bouton2, Anne-Lise Giraud2.   

Abstract

The McGurk effect is a textbook illustration of the automaticity with which the human brain integrates audio-visual speech. It shows that even incongruent audiovisual (AV) speech stimuli can be combined into percepts that correspond neither to the auditory nor to the visual input, but to a mix of both. Typically, when presented with, e.g., visual /aga/ and acoustic /aba/ we perceive an illusory /ada/. In the inverse situation, however, when acoustic /aga/ is paired with visual /aba/, we perceive a combination of both stimuli, i.e., /abga/ or /agba/. Here we assessed the role of dynamic cross-modal predictions in the outcome of AV speech integration using a computational model that processes continuous audiovisual speech sensory inputs in a predictive coding framework. The model involves three processing levels: sensory units, units that encode the dynamics of stimuli, and multimodal recognition/identity units. The model exhibits a dynamic prediction behavior because evidence about speech tokens can be asynchronous across sensory modality, allowing for updating the activity of the recognition units from one modality while sending top-down predictions to the other modality. We explored the model's response to congruent and incongruent AV stimuli and found that, in the two-dimensional feature space spanned by the speech second formant and lip aperture, fusion stimuli are located in the neighborhood of congruent /ada/, which therefore provides a valid match. Conversely, stimuli that lead to combination percepts do not have a unique valid neighbor. In that case, acoustic and visual cues are both highly salient and generate conflicting predictions in the other modality that cannot be fused, forcing the elaboration of a combinatorial solution. We propose that dynamic predictive mechanisms play a decisive role in the dichotomous perception of incongruent audiovisual inputs.
Copyright © 2015. Published by Elsevier Ltd.

Entities:  

Keywords:  Audiovisual integration; Computational modeling; McGurk effect; Predictive coding

Mesh:

Year:  2015        PMID: 26009260     DOI: 10.1016/j.cortex.2015.04.008

Source DB:  PubMed          Journal:  Cortex        ISSN: 0010-9452            Impact factor:   4.027


  8 in total

1.  Rethinking the McGurk effect as a perceptual illusion.

Authors:  Laura M Getz; Joseph C Toscano
Journal:  Atten Percept Psychophys       Date:  2021-04-21       Impact factor: 2.199

2.  Contrast Is in the Eye of the Beholder: Infelicitous Beat Gesture Increases Cognitive Load During Online Spoken Discourse Comprehension.

Authors:  Laura M Morett; Jennifer M Roche; Scott H Fraundorf; James C McPartland
Journal:  Cogn Sci       Date:  2020-10

3.  Language/Culture Modulates Brain and Gaze Processes in Audiovisual Speech Perception.

Authors:  Satoko Hisanaga; Kaoru Sekiyama; Tomohiko Igasaki; Nobuki Murayama
Journal:  Sci Rep       Date:  2016-10-13       Impact factor: 4.379

4.  A Causal Inference Model Explains Perception of the McGurk Effect and Other Incongruent Audiovisual Speech.

Authors:  John F Magnotti; Michael S Beauchamp
Journal:  PLoS Comput Biol       Date:  2017-02-16       Impact factor: 4.475

5.  A causal inference explanation for enhancement of multisensory integration by co-articulation.

Authors:  John F Magnotti; Kristen B Smith; Marcelo Salinas; Jacqunae Mays; Lin L Zhu; Michael S Beauchamp
Journal:  Sci Rep       Date:  2018-12-21       Impact factor: 4.379

6.  Differential Auditory and Visual Phase-Locking Are Observed during Audio-Visual Benefit and Silent Lip-Reading for Speech Perception.

Authors:  Máté Aller; Heidi Solberg Økland; Lucy J MacGregor; Helen Blank; Matthew H Davis
Journal:  J Neurosci       Date:  2022-06-27       Impact factor: 6.709

7.  A dynamical framework to relate perceptual variability with multisensory information processing.

Authors:  Bhumika Thakur; Abhishek Mukherjee; Abhijit Sen; Arpan Banerjee
Journal:  Sci Rep       Date:  2016-08-09       Impact factor: 4.379

8.  Integrating prediction errors at two time scales permits rapid recalibration of speech sound categories.

Authors:  Itsaso Olasagasti; Anne-Lise Giraud
Journal:  Elife       Date:  2020-03-30       Impact factor: 8.140

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.