Literature DB >> 20071527

How the human brain recognizes speech in the context of changing speakers.

Katharina von Kriegstein1, David R R Smith, Roy D Patterson, Stefan J Kiebel, Timothy D Griffiths.   

Abstract

We understand speech from different speakers with ease, whereas artificial speech recognition systems struggle with this task. It is unclear how the human brain solves this problem. The conventional view is that speech message recognition and speaker identification are two separate functions and that message processing takes place predominantly in the left hemisphere, whereas processing of speaker-specific information is located in the right hemisphere. Here, we distinguish the contribution of specific cortical regions, to speech recognition and speaker information processing, by controlled manipulation of task and resynthesized speaker parameters. Two functional magnetic resonance imaging studies provide evidence for a dynamic speech-processing network that questions the conventional view. We found that speech recognition regions in left posterior superior temporal gyrus/superior temporal sulcus (STG/STS) also encode speaker-related vocal tract parameters, which are reflected in the amplitude peaks of the speech spectrum, along with the speech message. Right posterior STG/STS activated specifically more to a speaker-related vocal tract parameter change during a speech recognition task compared with a voice recognition task. Left and right posterior STG/STS were functionally connected. Additionally, we found that speaker-related glottal fold parameters (e.g., pitch), which are not reflected in the amplitude peaks of the speech spectrum, are processed in areas immediately adjacent to primary auditory cortex, i.e., in areas in the auditory hierarchy earlier than STG/STS. Our results point to a network account of speech recognition, in which information about the speech message and the speaker's vocal tract are combined to solve the difficult task of understanding speech from different speakers.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20071527      PMCID: PMC2824128          DOI: 10.1523/JNEUROSCI.2742-09.2010

Source DB:  PubMed          Journal:  J Neurosci        ISSN: 0270-6474            Impact factor:   6.167


  55 in total

1.  Towards a functional neuroanatomy of speech perception.

Authors: 
Journal:  Trends Cogn Sci       Date:  2000-04       Impact factor: 20.229

2.  Modulation of neural responses to speech by directing attention to voices or verbal content.

Authors:  Katharina von Kriegstein; Evelyn Eger; Andreas Kleinschmidt; Anne Lise Giraud
Journal:  Brain Res Cogn Brain Res       Date:  2003-06

3.  The processing of temporal pitch and melody information in auditory cortex.

Authors:  Roy D Patterson; Stefan Uppenkamp; Ingrid S Johnsrude; Timothy D Griffiths
Journal:  Neuron       Date:  2002-11-14       Impact factor: 17.173

4.  Pre-lexical abstraction of speech in the auditory cortex.

Authors:  Jonas Obleser; Frank Eisner
Journal:  Trends Cogn Sci       Date:  2008-12-11       Impact factor: 20.229

5.  Right hemisphere speech perception revealed by amobarbital injection and electrical interference.

Authors:  D Boatman; J Hart; R P Lesser; N Honeycutt; N B Anderson; D Miglioretti; B Gordon
Journal:  Neurology       Date:  1998-08       Impact factor: 9.910

6.  Words and voices: episodic traces in spoken word identification and recognition memory.

Authors:  S D Goldinger
Journal:  J Exp Psychol Learn Mem Cogn       Date:  1996-09       Impact factor: 3.051

Review 7.  Auditory processing--speech, space and auditory objects.

Authors:  Sophie K Scott
Journal:  Curr Opin Neurobiol       Date:  2005-04       Impact factor: 6.627

8.  Distinct functional substrates along the right superior temporal sulcus for the processing of voices.

Authors:  Katharina V Kriegstein; Anne-Lise Giraud
Journal:  Neuroimage       Date:  2004-06       Impact factor: 6.556

9.  Human cortical organization for processing vocalizations indicates representation of harmonic structure as a signal attribute.

Authors:  James W Lewis; William J Talkington; Nathan A Walker; George A Spirou; Audrey Jajosky; Chris Frum; Julie A Brefczynski-Lewis
Journal:  J Neurosci       Date:  2009-02-18       Impact factor: 6.167

10.  Recognizing sequences of sequences.

Authors:  Stefan J Kiebel; Katharina von Kriegstein; Jean Daunizeau; Karl J Friston
Journal:  PLoS Comput Biol       Date:  2009-08-14       Impact factor: 4.475

View more
  25 in total

1.  Neural correlates of the perception of contrastive prosodic focus in French: a functional magnetic resonance imaging study.

Authors:  Marcela Perrone-Bertolotti; Marion Dohen; Hélène Lœvenbruck; Marc Sato; Cédric Pichat; Monica Baciu
Journal:  Hum Brain Mapp       Date:  2012-04-05       Impact factor: 5.038

2.  The pace of prosodic phrasing couples the listener's cortex to the reader's voice.

Authors:  Mathieu Bourguignon; Xavier De Tiège; Marc Op de Beeck; Noémie Ligot; Philippe Paquier; Patrick Van Bogaert; Serge Goldman; Riitta Hari; Veikko Jousmäki
Journal:  Hum Brain Mapp       Date:  2012-01-03       Impact factor: 5.038

3.  Multivariate sensitivity to voice during auditory categorization.

Authors:  Yune Sang Lee; Jonathan E Peelle; David Kraemer; Samuel Lloyd; Richard Granger
Journal:  J Neurophysiol       Date:  2015-08-05       Impact factor: 2.714

Review 4.  Central auditory disorders: toward a neuropsychology of auditory objects.

Authors:  Johanna C Goll; Sebastian J Crutch; Jason D Warren
Journal:  Curr Opin Neurol       Date:  2010-12       Impact factor: 5.710

5.  Brain systems mediating voice identity processing in blind humans.

Authors:  Cordula Hölig; Julia Föcker; Anna Best; Brigitte Röder; Christian Büchel
Journal:  Hum Brain Mapp       Date:  2014-03-17       Impact factor: 5.038

6.  Voice-sensitive brain networks encode talker-specific phonetic detail.

Authors:  Emily B Myers; Rachel M Theodore
Journal:  Brain Lang       Date:  2016-11-27       Impact factor: 2.381

7.  Non-native listeners' recognition of high-variability speech using PRESTO.

Authors:  Terrin N Tamati; David B Pisoni
Journal:  J Am Acad Audiol       Date:  2014-10       Impact factor: 1.664

8.  Training-induced brain activation and functional connectivity differentiate multi-talker and single-talker speech training.

Authors:  Zhizhou Deng; Bharath Chandrasekaran; Suiping Wang; Patrick C M Wong
Journal:  Neurobiol Learn Mem       Date:  2018-03-10       Impact factor: 2.877

Review 9.  Dynamic speech representations in the human temporal lobe.

Authors:  Matthew K Leonard; Edward F Chang
Journal:  Trends Cogn Sci       Date:  2014-06-03       Impact factor: 20.229

10.  Speaker Invariance for Phonetic Information: an fMRI Investigation.

Authors:  Caden Salvata; Sheila E Blumstein; Emily B Myers
Journal:  Lang Cogn Process       Date:  2011-08-19
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.