Literature DB >> 25941506

Multisensory and sensorimotor interactions in speech perception.

Kaisa Tiippana1, Riikka Möttönen2, Jean-Luc Schwartz3.   

Abstract

Entities:  

Keywords:  McGurk effect; audiovisual; cognitive disorders; learning; multisensory; sensorimotor; somatosensory; speech perception

Year:  2015        PMID: 25941506      PMCID: PMC4403297          DOI: 10.3389/fpsyg.2015.00458

Source DB:  PubMed          Journal:  Front Psychol        ISSN: 1664-1078


× No keyword cloud information.
This research topic presents speech as a natural, well-learned, multisensory communication signal, processed by multiple mechanisms. Reflecting the general status of the field, most articles focus on audiovisual speech perception and many utilize the McGurk effect, which arises when discrepant visual and auditory speech stimuli are presented (McGurk and MacDonald, 1976). Tiippana (2014) argues that the McGurk effect can be used as a proxy for multisensory integration provided it is not interpreted too narrowly. Several articles shed new light on audiovisual speech perception in special populations. It is known that individuals with autism spectrum disorder (ASD, e.g., Saalasti et al., 2012) or language impairment (e.g., Meronen et al., 2013) are generally less influenced by the talking face than peers with typical development. Here Stevenson et al. (2014) propose that a deficit in multisensory integration could be a marker of ASD, and a component of the associated deficit in communication. However, three studies suggest that integration is not deficient in some communication disorders. Irwin and Brancazio (2014) show that children with ASD looked less at the mouth region, resulting in poorer visual speech perception and consequently weaker visual influence. Leybaert et al. (2014) report that children with specific language impairment recognized visual and auditory speech less accurately than their controls, affecting audiovisual speech perception, while audiovisual integration per se seemed unimpaired. In a similar vein, adult patients with aphasia showed unisensory deficits but still integrated audiovisual speech information (Andersen and Starrfelt, 2015). Multisensory information can influence response accuracy and processing speed (e.g., Molholm et al., 2002; Klucharev et al., 2003). Scarbel et al. (2014) show that oral responses to speech in noise were faster but less accurate than manual responses, suggesting that oral responses are planned at an earlier stage than manual responses. Sekiyama et al. (2014) show that older adults were more influenced by visual speech than younger adults and correlated this fact to their slower reaction times to auditory stimuli. Altieri and Hudock (2014) report variation in reaction time and accuracy benefits for audiovisual speech in hearing-impaired observers, emphasizing the importance of individual differences in integration. Finally, Heald and Nusbaum (2014) show that when there were two possible talkers instead of just one, audiovisual information appeared to distract the observer from the task of word recognition and slowed down their performance. This finding demonstrates that multisensory stimulation does not always facilitate performance. While multisensory stimulation is thought to be beneficial for learning (Shams and Seitz, 2008), evidence for this is still scarce. In the current research topic, the overall utility of multisensory learning is brought under question. In a paradigm training to associate novel words and pictures, Bernstein et al. (2014) show no benefit of audiovisual presentation compared with auditory presentation for normal hearing individuals, and even a degradation for adults with hearing impairment. In a study of cued speech, i.e., specific hand-signs for different speech sounds, Bayard et al. (2014) demonstrate that individuals with hearing impairment used the visual cues differently from their controls, even though both groups were experts in cued speech. Kelly et al. (2014) show that when normal hearing adults learned words in a foreign language, viewing or producing hand gestures accompanying audiovisual speech did not affect the outcome. Lee and Noppeney (2014) show that musicians had a narrower audiovisual temporal integration window for music, and to a smaller extent also for speech, implying that the effect transfers from the practiced music stimuli also to other stimulus types. Together, these findings suggest that long-term training and active use may be requisites for multisensory information to be useful in learning speech. Neurophysiological correlates of audiovisual speech perception were addressed in the research topic. By using electroencephalography (EEG) it was shown that attention (Alsius et al., 2014) and stimulus context (Ganesh et al., 2014) affected early event-related potentials (ERPs) to audiovisual speech. This provides further evidence that audiovisual interactions are not completely automatic. By using functional magnetic resonance imaging, Erickson et al. (2014) demonstrate a subdivision of posterior superior temporal areas for integrating congruent vs. incongruent audiovisual speech, and Callan et al. (2014) show that different regions in the premotor cortex were involved in unisensory-to-articulatory mapping and audiovisual integration. Interactions between auditory and motor brain areas during auditory speech perception were also investigated. By using magnetoencephalography, Alho et al. (2014) demonstrate that connectivity between auditory and motor areas increased from passive listening to clear speech to listening to speech in noise, and that the strength of this connectivity was positively correlated with the accuracy of syllable identification. Moreover, analyses of EEG oscillations revealed that alpha and beta rhythms generated in the sensorimotor and auditory areas were modulated during syllable discrimination tasks (Bowers et al., 2014; Jenson et al., 2014). By using theta-burst transcranial magnetic stimulation, Rogers et al. (2014) show that disrupting the lip area of the motor cortex impaired discrimination of lip-articulated speech sounds from sounds not articulated on the lips. The involvement of the motor processes is often considered to make speech perception “special,” i.e., essentially different from perception of non-speech stimuli. However, this remains a highly controversial view. Carbonell and Lotto (2014) claim that speech should not be considered special amongst other stimuli with regards to multisensory integration. Somatosensory information can also influence speech perception. Ito et al. (2014) used EEG to study how stretching the skin on both sides of the mouth influences processing of speech sounds, and displayed auditory-somatosensory interaction that was sensitive to intersensory timing. In another EEG study, Treille et al. (2014) report that haptic exploration of the talker's face during speech perception modulated ERPs. These findings confirm that auditory-somatosensory interactions contribute to speech processing. The current research topic shows that speech can be perceived via multiple senses and that speech perception relies on sophisticated unisensory, multisensory and sensorimotor mechanisms. Multisensory information can facilitate perception and learning of speech. Still, there is great variation in multisensory perception and integration in both typical and special populations at different ages, which should be studied further in the future.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  30 in total

1.  Electrophysiological indicators of phonetic and non-phonetic multisensory interactions during audiovisual speech perception.

Authors:  Vasily Klucharev; Riikka Möttönen; Mikko Sams
Journal:  Brain Res Cogn Brain Res       Date:  2003-12

2.  Hearing lips and seeing voices.

Authors:  H McGurk; J MacDonald
Journal:  Nature       Date:  1976 Dec 23-30       Impact factor: 49.962

3.  Audiovisual speech perception and eye gaze behavior of adults with asperger syndrome.

Authors:  Satu Saalasti; Jari Kätsyri; Kaisa Tiippana; Mari Laine-Hernandez; Lennart von Wendt; Mikko Sams
Journal:  J Autism Dev Disord       Date:  2012-08

Review 4.  Benefits of multisensory learning.

Authors:  Ladan Shams; Aaron R Seitz
Journal:  Trends Cogn Sci       Date:  2008-11       Impact factor: 20.229

5.  Atypical audio-visual speech perception and McGurk effects in children with specific language impairment.

Authors:  Jacqueline Leybaert; Lucie Macchi; Aurélie Huyse; François Champoux; Clémence Bayard; Cécile Colin; Frédéric Berthommier
Journal:  Front Psychol       Date:  2014-05-20

6.  How is the McGurk effect modulated by Cued Speech in deaf and hearing adults?

Authors:  Clémence Bayard; Cécile Colin; Jacqueline Leybaert
Journal:  Front Psychol       Date:  2014-05-19

7.  Seeing to hear? Patterns of gaze to speaking faces in children with autism spectrum disorders.

Authors:  Julia R Irwin; Lawrence Brancazio
Journal:  Front Psychol       Date:  2014-05-08

8.  Hearing impairment and audiovisual speech integration ability: a case study report.

Authors:  Nicholas Altieri; Daniel Hudock
Journal:  Front Psychol       Date:  2014-07-01

9.  Effect of attentional load on audiovisual speech perception: evidence from ERPs.

Authors:  Agnès Alsius; Riikka Möttönen; Mikko E Sams; Salvador Soto-Faraco; Kaisa Tiippana
Journal:  Front Psychol       Date:  2014-07-15

10.  Discrimination of speech and non-speech sounds following theta-burst stimulation of the motor cortex.

Authors:  Jack C Rogers; Riikka Möttönen; Rowan Boyles; Kate E Watkins
Journal:  Front Psychol       Date:  2014-07-15
View more
  2 in total

1.  Parallel linear dynamic models can mimic the McGurk effect in clinical populations.

Authors:  Nicholas Altieri; Cheng-Ta Yang
Journal:  J Comput Neurosci       Date:  2016-06-07       Impact factor: 1.621

2.  Motor engagement relates to accurate perception of phonemes and audiovisual words, but not auditory words.

Authors:  Kelly Michaelis; Makoto Miyakoshi; Gina Norato; Andrei V Medvedev; Peter E Turkeltaub
Journal:  Commun Biol       Date:  2021-01-25
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.