Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Prediction across sensory modalities: A neurocomputational model of the McGurk effect.

Literature DB >> 26009260

Prediction across sensory modalities: A neurocomputational model of the McGurk effect.

Itsaso Olasagasti¹, Sophie Bouton², Anne-Lise Giraud².

Abstract

The McGurk effect is a textbook illustration of the automaticity with which the human brain integrates audio-visual speech. It shows that even incongruent audiovisual (AV) speech stimuli can be combined into percepts that correspond neither to the auditory nor to the visual input, but to a mix of both. Typically, when presented with, e.g., visual /aga/ and acoustic /aba/ we perceive an illusory /ada/. In the inverse situation, however, when acoustic /aga/ is paired with visual /aba/, we perceive a combination of both stimuli, i.e., /abga/ or /agba/. Here we assessed the role of dynamic cross-modal predictions in the outcome of AV speech integration using a computational model that processes continuous audiovisual speech sensory inputs in a predictive coding framework. The model involves three processing levels: sensory units, units that encode the dynamics of stimuli, and multimodal recognition/identity units. The model exhibits a dynamic prediction behavior because evidence about speech tokens can be asynchronous across sensory modality, allowing for updating the activity of the recognition units from one modality while sending top-down predictions to the other modality. We explored the model's response to congruent and incongruent AV stimuli and found that, in the two-dimensional feature space spanned by the speech second formant and lip aperture, fusion stimuli are located in the neighborhood of congruent /ada/, which therefore provides a valid match. Conversely, stimuli that lead to combination percepts do not have a unique valid neighbor. In that case, acoustic and visual cues are both highly salient and generate conflicting predictions in the other modality that cannot be fused, forcing the elaboration of a combinatorial solution. We propose that dynamic predictive mechanisms play a decisive role in the dichotomous perception of incongruent audiovisual inputs.

Entities: Chemical Disease Species

Keywords: Audiovisual integration; Computational modeling; McGurk effect; Predictive coding

Mesh：

Year: 2015 PMID： 26009260 DOI： 10.1016/j.cortex.2015.04.008

Source DB: PubMed Journal: Cortex ISSN： 0010-9452 Impact factor: 4.027

Keyword Cloud
Cited

8 in total

1. Rethinking the McGurk effect as a perceptual illusion.

Authors: Laura M Getz; Joseph C Toscano
Journal: Atten Percept Psychophys Date: 2021-04-21 Impact factor: 2.199

2. Contrast Is in the Eye of the Beholder: Infelicitous Beat Gesture Increases Cognitive Load During Online Spoken Discourse Comprehension.

Authors: Laura M Morett; Jennifer M Roche; Scott H Fraundorf; James C McPartland
Journal: Cogn Sci Date: 2020-10

3. Language/Culture Modulates Brain and Gaze Processes in Audiovisual Speech Perception.

Authors: Satoko Hisanaga; Kaoru Sekiyama; Tomohiko Igasaki; Nobuki Murayama
Journal: Sci Rep Date: 2016-10-13 Impact factor: 4.379

4. A Causal Inference Model Explains Perception of the McGurk Effect and Other Incongruent Audiovisual Speech.

Authors: John F Magnotti; Michael S Beauchamp
Journal: PLoS Comput Biol Date: 2017-02-16 Impact factor: 4.475

5. A causal inference explanation for enhancement of multisensory integration by co-articulation.

Authors: John F Magnotti; Kristen B Smith; Marcelo Salinas; Jacqunae Mays; Lin L Zhu; Michael S Beauchamp
Journal: Sci Rep Date: 2018-12-21 Impact factor: 4.379

6. Differential Auditory and Visual Phase-Locking Are Observed during Audio-Visual Benefit and Silent Lip-Reading for Speech Perception.

Authors: Máté Aller; Heidi Solberg Økland; Lucy J MacGregor; Helen Blank; Matthew H Davis
Journal: J Neurosci Date: 2022-06-27 Impact factor: 6.709

7. A dynamical framework to relate perceptual variability with multisensory information processing.

Authors: Bhumika Thakur; Abhishek Mukherjee; Abhijit Sen; Arpan Banerjee
Journal: Sci Rep Date: 2016-08-09 Impact factor: 4.379

8. Integrating prediction errors at two time scales permits rapid recalibration of speech sound categories.

Authors: Itsaso Olasagasti; Anne-Lise Giraud
Journal: Elife Date: 2020-03-30 Impact factor: 8.140

8 in total