Literature DB >> 28179553

Mouth and Voice: A Relationship between Visual and Auditory Preference in the Human Superior Temporal Sulcus.

Lin L Zhu1,2, Michael S Beauchamp3.   

Abstract

Cortex in and around the human posterior superior temporal sulcus (pSTS) is known to be critical for speech perception. The pSTS responds to both the visual modality (especially biological motion) and the auditory modality (especially human voices). Using fMRI in single subjects with no spatial smoothing, we show that visual and auditory selectivity are linked. Regions of the pSTS were identified that preferred visually presented moving mouths (presented in isolation or as part of a whole face) or moving eyes. Mouth-preferring regions responded strongly to voices and showed a significant preference for vocal compared with nonvocal sounds. In contrast, eye-preferring regions did not respond to either vocal or nonvocal sounds. The converse was also true: regions of the pSTS that showed a significant response to speech or preferred vocal to nonvocal sounds responded more strongly to visually presented mouths than eyes. These findings can be explained by environmental statistics. In natural environments, humans see visual mouth movements at the same time as they hear voices, while there is no auditory accompaniment to visual eye movements. The strength of a voxel's preference for visual mouth movements was strongly correlated with the magnitude of its auditory speech response and its preference for vocal sounds, suggesting that visual and auditory speech features are coded together in small populations of neurons within the pSTS.SIGNIFICANCE STATEMENT Humans interacting face to face make use of auditory cues from the talker's voice and visual cues from the talker's mouth to understand speech. The human posterior superior temporal sulcus (pSTS), a brain region known to be important for speech perception, is complex, with some regions responding to specific visual stimuli and others to specific auditory stimuli. Using BOLD fMRI, we show that the natural statistics of human speech, in which voices co-occur with mouth movements, are reflected in the neural architecture of the pSTS. Different pSTS regions prefer visually presented faces containing either a moving mouth or moving eyes, but only mouth-preferring regions respond strongly to voices.
Copyright © 2017 the authors 0270-6474/17/372697-12$15.00/0.

Entities:  

Keywords:  audiovisual; face; multisensory; speech perception

Mesh:

Year:  2017        PMID: 28179553      PMCID: PMC5354323          DOI: 10.1523/JNEUROSCI.2914-16.2017

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.167


  76 in total

1.  Brain areas involved in perception of biological motion.

Authors:  E Grossman; M Donnelly; R Price; D Pickens; V Morgan; G Neighbor; R Blake
Journal:  J Cogn Neurosci       Date:  2000-09       Impact factor: 3.225

2.  Polysensory interactions along lateral temporal regions evoked by audiovisual speech.

Authors:  Tarra M Wright; Kevin A Pelphrey; Truett Allison; Martin J McKeown; Gregory McCarthy
Journal:  Cereb Cortex       Date:  2003-10       Impact factor: 5.357

3.  The superior temporal sulcus differentiates communicative and noncommunicative auditory signals.

Authors:  Sarah Shultz; Athena Vouloumanos; Kevin Pelphrey
Journal:  J Cogn Neurosci       Date:  2012-02-23       Impact factor: 3.225

4.  Functional anatomy of biological motion perception in posterior temporal cortex: an FMRI study of eye, mouth and hand movements.

Authors:  Kevin A Pelphrey; James P Morris; Charles R Michelich; Truett Allison; Gregory McCarthy
Journal:  Cereb Cortex       Date:  2005-03-02       Impact factor: 5.357

5.  Cerebral representation of non-verbal emotional perception: fMRI reveals audiovisual integration area between voice- and face-sensitive regions in the superior temporal sulcus.

Authors:  Benjamin Kreifelts; Thomas Ethofer; Thomas Shiozawa; Wolfgang Grodd; Dirk Wildgruber
Journal:  Neuropsychologia       Date:  2009-07-21       Impact factor: 3.139

6.  Visual face-movement sensitive cortex is relevant for auditory-only speech recognition.

Authors:  Philipp Riedel; Patrick Ragert; Stefanie Schelinski; Stefan J Kiebel; Katharina von Kriegstein
Journal:  Cortex       Date:  2014-12-23       Impact factor: 4.027

7.  Parietal, temporal, and occipital projections to cortex of the superior temporal sulcus in the rhesus monkey: a retrograde tracer study.

Authors:  B Seltzer; D N Pandya
Journal:  J Comp Neurol       Date:  1994-05-15       Impact factor: 3.215

8.  Crossmodal adaptation in right posterior superior temporal sulcus during face-voice emotional integration.

Authors:  Rebecca Watson; Marianne Latinus; Takao Noguchi; Oliver Garrod; Frances Crabbe; Pascal Belin
Journal:  J Neurosci       Date:  2014-05-14       Impact factor: 6.167

9.  FMRI responses to video and point-light displays of moving humans and manipulable objects.

Authors:  Michael S Beauchamp; Kathryn E Lee; James V Haxby; Alex Martin
Journal:  J Cogn Neurosci       Date:  2003-10-01       Impact factor: 3.225

10.  Connectivity analysis reveals a cortical network for eye gaze perception.

Authors:  Lauri Nummenmaa; Luca Passamonti; James Rowe; Andrew D Engell; Andrew J Calder
Journal:  Cereb Cortex       Date:  2009-12-16       Impact factor: 5.357

View more
  12 in total

1.  Electrocorticography reveals continuous auditory and visual speech tracking in temporal and occipital cortex.

Authors:  Cristiano Micheli; Inga M Schepers; Müge Ozker; Daniel Yoshor; Michael S Beauchamp; Jochem W Rieger
Journal:  Eur J Neurosci       Date:  2018-08-12       Impact factor: 3.386

2.  Vision perceptually restores auditory spectral dynamics in speech.

Authors:  John Plass; David Brang; Satoru Suzuki; Marcia Grabowecky
Journal:  Proc Natl Acad Sci U S A       Date:  2020-07-06       Impact factor: 11.205

3.  Cross-modal phonetic encoding facilitates the McGurk illusion and phonemic restoration.

Authors:  Noelle T Abbott; Antoine J Shahin
Journal:  J Neurophysiol       Date:  2018-10-10       Impact factor: 2.714

4.  Free viewing of talking faces reveals mouth and eye preferring regions of the human superior temporal sulcus.

Authors:  Johannes Rennig; Michael S Beauchamp
Journal:  Neuroimage       Date:  2018-08-06       Impact factor: 6.556

5.  Face viewing behavior predicts multisensory gain during speech perception.

Authors:  Johannes Rennig; Kira Wegner-Clemens; Michael S Beauchamp
Journal:  Psychon Bull Rev       Date:  2020-02

6.  Neural Mechanisms Underlying Cross-Modal Phonetic Encoding.

Authors:  Antoine J Shahin; Kristina C Backer; Lawrence D Rosenblum; Jess R Kerlin
Journal:  J Neurosci       Date:  2017-12-20       Impact factor: 6.167

7.  Different neural processes underlie visual speech perception in school-age children and adults: An event-related potentials study.

Authors:  Natalya Kaganovich; Elizabeth Ancel
Journal:  J Exp Child Psychol       Date:  2019-04-20

8.  The visual speech head start improves perception and reduces superior temporal cortex responses to auditory speech.

Authors:  Patrick J Karas; John F Magnotti; Brian A Metzger; Lin L Zhu; Kristen B Smith; Daniel Yoshor; Michael S Beauchamp
Journal:  Elife       Date:  2019-08-08       Impact factor: 8.140

9.  Responses to Visual Speech in Human Posterior Superior Temporal Gyrus Examined with iEEG Deconvolution.

Authors:  Brian A Metzger; John F Magnotti; Zhengjia Wang; Elizabeth Nesbitt; Patrick J Karas; Daniel Yoshor; Michael S Beauchamp
Journal:  J Neurosci       Date:  2020-07-29       Impact factor: 6.167

10.  Converging Evidence From Electrocorticography and BOLD fMRI for a Sharp Functional Boundary in Superior Temporal Gyrus Related to Multisensory Speech Processing.

Authors:  Muge Ozker; Daniel Yoshor; Michael S Beauchamp
Journal:  Front Hum Neurosci       Date:  2018-04-24       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.