Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.

Literature DB >> 34503996

Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.

Maansi Desai¹, Jade Holder¹, Cassandra Villarreal¹, Nat Clark¹, Brittany Hoang¹, Liberty S Hamilton^2,3.

Abstract

In natural conversations, listeners must attend to what others are saying while ignoring extraneous background sounds. Recent studies have used encoding models to predict electroencephalography (EEG) responses to speech in noise-free listening situations, sometimes referred to as "speech tracking." Researchers have analyzed how speech tracking changes with different types of background noise. It is unclear, however, whether neural responses from acoustically rich, naturalistic environments with and without background noise can be generalized to more controlled stimuli. If encoding models for acoustically rich, naturalistic stimuli are generalizable to other tasks, this could aid in data collection from populations of individuals who may not tolerate listening to more controlled and less engaging stimuli for long periods of time. We recorded noninvasive scalp EEG while 17 human participants (8 male/9 female) listened to speech without noise and audiovisual speech stimuli containing overlapping speakers and background sounds. We fit multivariate temporal receptive field encoding models to predict EEG responses to pitch, the acoustic envelope, phonological features, and visual cues in both stimulus conditions. Our results suggested that neural responses to naturalistic stimuli were generalizable to more controlled datasets. EEG responses to speech in isolation were predicted accurately using phonological features alone, while responses to speech in a rich acoustic background were more accurate when including both phonological and acoustic features. Our findings suggest that naturalistic audiovisual stimuli can be used to measure receptive fields that are comparable and generalizable to more controlled audio-only stimuli.SIGNIFICANCE STATEMENT Understanding spoken language in natural environments requires listeners to parse acoustic and linguistic information in the presence of other distracting stimuli. However, most studies of auditory processing rely on highly controlled stimuli with no background noise, or with background noise inserted at specific times. Here, we compare models where EEG data are predicted based on a combination of acoustic, phonetic, and visual features in highly disparate stimuli-sentences from a speech corpus and speech embedded within movie trailers. We show that modeling neural responses to highly noisy, audiovisual movies can uncover tuning for acoustic and phonetic information that generalizes to simpler stimuli typically used in sensory neuroscience experiments.

Entities: Chemical

Keywords: electroencephalography; encoding models; natural stimuli; spectrotemporal receptive field; speech perception

Mesh：

Year: 2021 PMID： 34503996 PMCID： PMC8549533 DOI： 10.1523/JNEUROSCI.2891-20.2021

Source DB: PubMed Journal: J Neurosci ISSN： 0270-6474 Impact factor: 6.167

60 in total

1. The use of visible speech cues for improving auditory detection of spoken sentences.

Authors: K W Grant; P F Seitz
Journal: J Acoust Soc Am Date: 2000-09 Impact factor: 1.840

2. Hearing lips: gamma-band activity during audiovisual speech perception.

Authors: Jochen Kaiser; Ingo Hertrich; Hermann Ackermann; Klaus Mathiak; Werner Lutzenberger
Journal: Cereb Cortex Date: 2004-09-01 Impact factor: 5.357

3. Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.

Authors: Christian Brodbeck; L Elliot Hong; Jonathan Z Simon
Journal: Curr Biol Date: 2018-11-29 Impact factor: 10.834

4. An electrophysiological study of cross-modal repetition priming.

Authors: Phillip J Holcomb; Jane Anderson; Jonathan Grainger
Journal: Psychophysiology Date: 2005-09 Impact factor: 4.016

5. Prosodic pitch processing is represented in delta-band EEG and is dissociable from the cortical tracking of other acoustic and phonetic features.

Authors: Emily S Teoh; Madeline S Cappelloni; Edmund C Lalor
Journal: Eur J Neurosci Date: 2019-08-01 Impact factor: 3.386

6. Cortical encoding and neurophysiological tracking of intensity and pitch cues signaling English stress patterns in native and nonnative speakers.

Authors: Wei-Lun Chung; Gavin M Bidelman
Journal: Brain Lang Date: 2016-04-30 Impact factor: 2.381

Review 1. Linear Modeling of Neurophysiological Responses to Speech and Other Continuous Stimuli: Methodological Considerations for Applied Research.

Authors: Michael J Crosse; Nathaniel J Zuk; Giovanni M Di Liberto; Aaron R Nidiffer; Sophie Molholm; Edmund C Lalor
Journal: Front Neurosci Date: 2021-11-22 Impact factor: 4.677

1 in total

Generalizable EEG Encoding Models with Naturalistic Audiovisual Stimuli.

1. The use of visible speech cues for improving auditory detection of spoken sentences.

2. Hearing lips: gamma-band activity during audiovisual speech perception.

3. Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech.

4. An electrophysiological study of cross-modal repetition priming.

5. Prosodic pitch processing is represented in delta-band EEG and is dissociable from the cortical tracking of other acoustic and phonetic features.

6. Cortical encoding and neurophysiological tracking of intensity and pitch cues signaling English stress patterns in native and nonnative speakers.

7. Speech Intelligibility Predicted from Neural Entrainment of the Speech Envelope.

8. Topographic mapping of a hierarchy of temporal receptive windows using a narrated story.

9. Semantic Context Enhances the Early Auditory Encoding of Natural Speech.

10. Dynamic Encoding of Acoustic Features in Neural Responses to Continuous Speech.

Review 1. Linear Modeling of Neurophysiological Responses to Speech and Other Continuous Stimuli: Methodological Considerations for Applied Research.