Literature DB >> 27656026

Eye Can Hear Clearly Now: Inverse Effectiveness in Natural Audiovisual Speech Processing Relies on Long-Term Crossmodal Temporal Integration.

Michael J Crosse1, Giovanni M Di Liberto1, Edmund C Lalor2.   

Abstract

UNLABELLED: Speech comprehension is improved by viewing a speaker's face, especially in adverse hearing conditions, a principle known as inverse effectiveness. However, the neural mechanisms that help to optimize how we integrate auditory and visual speech in such suboptimal conversational environments are not yet fully understood. Using human EEG recordings, we examined how visual speech enhances the cortical representation of auditory speech at a signal-to-noise ratio that maximized the perceptual benefit conferred by multisensory processing relative to unisensory processing. We found that the influence of visual input on the neural tracking of the audio speech signal was significantly greater in noisy than in quiet listening conditions, consistent with the principle of inverse effectiveness. Although envelope tracking during audio-only speech was greatly reduced by background noise at an early processing stage, it was markedly restored by the addition of visual speech input. In background noise, multisensory integration occurred at much lower frequencies and was shown to predict the multisensory gain in behavioral performance at a time lag of ∼250 ms. Critically, we demonstrated that inverse effectiveness, in the context of natural audiovisual (AV) speech processing, relies on crossmodal integration over long temporal windows. Our findings suggest that disparate integration mechanisms contribute to the efficient processing of AV speech in background noise. SIGNIFICANCE STATEMENT: The behavioral benefit of seeing a speaker's face during conversation is especially pronounced in challenging listening environments. However, the neural mechanisms underlying this phenomenon, known as inverse effectiveness, have not yet been established. Here, we examine this in the human brain using natural speech-in-noise stimuli that were designed specifically to maximize the behavioral benefit of audiovisual (AV) speech. We find that this benefit arises from our ability to integrate multimodal information over longer periods of time. Our data also suggest that the addition of visual speech restores early tracking of the acoustic speech signal during excessive background noise. These findings support and extend current mechanistic perspectives on AV speech perception.
Copyright © 2016 the authors 0270-6474/16/369888-08$15.00/0.

Entities:  

Keywords:  EEG; envelope tracking; multisensory integration; speech intelligibility; speech-in-noise; stimulus reconstruction

Mesh:

Year:  2016        PMID: 27656026      PMCID: PMC6705572          DOI: 10.1523/JNEUROSCI.1396-16.2016

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.167


  38 in total

1.  Spectral-temporal receptive fields of nonlinear auditory neurons obtained using natural sounds.

Authors:  F E Theunissen; K Sen; A J Doupe
Journal:  J Neurosci       Date:  2000-03-15       Impact factor: 6.167

2.  The effect of speechreading on masked detection thresholds for filtered speech.

Authors:  K W Grant
Journal:  J Acoust Soc Am       Date:  2001-05       Impact factor: 1.840

3.  The use of visible speech cues for improving auditory detection of spoken sentences.

Authors:  K W Grant; P F Seitz
Journal:  J Acoust Soc Am       Date:  2000-09       Impact factor: 1.840

4.  Visual prosody and speech intelligibility: head movement improves auditory speech perception.

Authors:  K G Munhall; Jeffery A Jones; Daniel E Callan; Takaaki Kuratate; Eric Vatikiotis-Bateson
Journal:  Psychol Sci       Date:  2004-02

5.  Seeing to hear better: evidence for early audio-visual interactions in speech identification.

Authors:  Jean-Luc Schwartz; Frédéric Berthommier; Christophe Savariaux
Journal:  Cognition       Date:  2004-09

6.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis.

Authors:  Arnaud Delorme; Scott Makeig
Journal:  J Neurosci Methods       Date:  2004-03-15       Impact factor: 2.390

7.  Bimodal speech: early suppressive visual effects in human auditory cortex.

Authors:  Julien Besle; Alexandra Fort; Claude Delpuech; Marie-Hélène Giard
Journal:  Eur J Neurosci       Date:  2004-10       Impact factor: 3.386

8.  Visual speech speeds up the neural processing of auditory speech.

Authors:  Virginie van Wassenhove; Ken W Grant; David Poeppel
Journal:  Proc Natl Acad Sci U S A       Date:  2005-01-12       Impact factor: 11.205

9.  Do you see what I am saying? Exploring visual enhancement of speech comprehension in noisy environments.

Authors:  Lars A Ross; Dave Saint-Amour; Victoria M Leavitt; Daniel C Javitt; John J Foxe
Journal:  Cereb Cortex       Date:  2006-06-19       Impact factor: 5.357

10.  Auditory-visual perception of speech.

Authors:  N P Erber
Journal:  J Speech Hear Disord       Date:  1975-11
View more
  24 in total

Review 1.  A multisensory perspective on object memory.

Authors:  Pawel J Matusz; Mark T Wallace; Micah M Murray
Journal:  Neuropsychologia       Date:  2017-04-08       Impact factor: 3.139

2.  Electrocorticography reveals continuous auditory and visual speech tracking in temporal and occipital cortex.

Authors:  Cristiano Micheli; Inga M Schepers; Müge Ozker; Daniel Yoshor; Michael S Beauchamp; Jochem W Rieger
Journal:  Eur J Neurosci       Date:  2018-08-12       Impact factor: 3.386

3.  General auditory and speech-specific contributions to cortical envelope tracking revealed using auditory chimeras.

Authors:  Kevin D Prinsloo; Edmund C Lalor
Journal:  J Neurosci       Date:  2022-08-30       Impact factor: 6.709

4.  Resolution of impaired multisensory processing in autism and the cost of switching sensory modality.

Authors:  Michael J Crosse; John J Foxe; Katy Tarrit; Edward G Freedman; Sophie Molholm
Journal:  Commun Biol       Date:  2022-06-30

5.  Left Motor δ Oscillations Reflect Asynchrony Detection in Multisensory Speech Perception.

Authors:  Emmanuel Biau; Benjamin G Schultz; Thomas C Gunter; Sonja A Kotz
Journal:  J Neurosci       Date:  2022-01-27       Impact factor: 6.709

6.  Semantic Context Enhances the Early Auditory Encoding of Natural Speech.

Authors:  Michael P Broderick; Andrew J Anderson; Edmund C Lalor
Journal:  J Neurosci       Date:  2019-08-01       Impact factor: 6.167

7.  Crossmodal Phase Reset and Evoked Responses Provide Complementary Mechanisms for the Influence of Visual Speech in Auditory Cortex.

Authors:  Pierre Mégevand; Manuel R Mercier; David M Groppe; Elana Zion Golumbic; Nima Mesgarani; Michael S Beauchamp; Charles E Schroeder; Ashesh D Mehta
Journal:  J Neurosci       Date:  2020-10-06       Impact factor: 6.167

8.  Microsaccadic Eye Movements but not Pupillary Dilation Response Characterizes the Crossmodal Freezing Effect.

Authors:  Lihan Chen; Hsin-I Liao
Journal:  Cereb Cortex Commun       Date:  2020-09-30

9.  The Multivariate Temporal Response Function (mTRF) Toolbox: A MATLAB Toolbox for Relating Neural Signals to Continuous Stimuli.

Authors:  Michael J Crosse; Giovanni M Di Liberto; Adam Bednar; Edmund C Lalor
Journal:  Front Hum Neurosci       Date:  2016-11-30       Impact factor: 3.169

10.  Integration of Visual Information in Auditory Cortex Promotes Auditory Scene Analysis through Multisensory Binding.

Authors:  Huriye Atilgan; Stephen M Town; Katherine C Wood; Gareth P Jones; Ross K Maddox; Adrian K C Lee; Jennifer K Bizley
Journal:  Neuron       Date:  2018-01-26       Impact factor: 17.173

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.