Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 How the human brain recognizes speech in the context of changing speakers.

Literature DB >> 20071527

How the human brain recognizes speech in the context of changing speakers.

Katharina von Kriegstein¹, David R R Smith, Roy D Patterson, Stefan J Kiebel, Timothy D Griffiths.

Abstract

We understand speech from different speakers with ease, whereas artificial speech recognition systems struggle with this task. It is unclear how the human brain solves this problem. The conventional view is that speech message recognition and speaker identification are two separate functions and that message processing takes place predominantly in the left hemisphere, whereas processing of speaker-specific information is located in the right hemisphere. Here, we distinguish the contribution of specific cortical regions, to speech recognition and speaker information processing, by controlled manipulation of task and resynthesized speaker parameters. Two functional magnetic resonance imaging studies provide evidence for a dynamic speech-processing network that questions the conventional view. We found that speech recognition regions in left posterior superior temporal gyrus/superior temporal sulcus (STG/STS) also encode speaker-related vocal tract parameters, which are reflected in the amplitude peaks of the speech spectrum, along with the speech message. Right posterior STG/STS activated specifically more to a speaker-related vocal tract parameter change during a speech recognition task compared with a voice recognition task. Left and right posterior STG/STS were functionally connected. Additionally, we found that speaker-related glottal fold parameters (e.g., pitch), which are not reflected in the amplitude peaks of the speech spectrum, are processed in areas immediately adjacent to primary auditory cortex, i.e., in areas in the auditory hierarchy earlier than STG/STS. Our results point to a network account of speech recognition, in which information about the speech message and the speaker's vocal tract are combined to solve the difficult task of understanding speech from different speakers.

Entities: Chemical

Mesh：

Substances：
Oxygen

Year: 2010 PMID： 20071527 PMCID： PMC2824128 DOI： 10.1523/JNEUROSCI.2742-09.2010

Source DB: PubMed Journal: J Neurosci ISSN： 0270-6474 Impact factor: 6.167

55 in total

1. Towards a functional neuroanatomy of speech perception.

Authors:
Journal: Trends Cogn Sci Date: 2000-04 Impact factor: 20.229

2. Modulation of neural responses to speech by directing attention to voices or verbal content.

Authors: Katharina von Kriegstein; Evelyn Eger; Andreas Kleinschmidt; Anne Lise Giraud
Journal: Brain Res Cogn Brain Res Date: 2003-06

3. The processing of temporal pitch and melody information in auditory cortex.

Authors: Roy D Patterson; Stefan Uppenkamp; Ingrid S Johnsrude; Timothy D Griffiths
Journal: Neuron Date: 2002-11-14 Impact factor: 17.173

4. Pre-lexical abstraction of speech in the auditory cortex.

Authors: Jonas Obleser; Frank Eisner
Journal: Trends Cogn Sci Date: 2008-12-11 Impact factor: 20.229

5. Right hemisphere speech perception revealed by amobarbital injection and electrical interference.

Authors: D Boatman; J Hart; R P Lesser; N Honeycutt; N B Anderson; D Miglioretti; B Gordon
Journal: Neurology Date: 1998-08 Impact factor: 9.910

6. Words and voices: episodic traces in spoken word identification and recognition memory.

Authors: S D Goldinger
Journal: J Exp Psychol Learn Mem Cogn Date: 1996-09 Impact factor: 3.051

Review 7. Auditory processing--speech, space and auditory objects.

Authors: Sophie K Scott
Journal: Curr Opin Neurobiol Date: 2005-04 Impact factor: 6.627

8. Distinct functional substrates along the right superior temporal sulcus for the processing of voices.

Authors: Katharina V Kriegstein; Anne-Lise Giraud
Journal: Neuroimage Date: 2004-06 Impact factor: 6.556

9. Human cortical organization for processing vocalizations indicates representation of harmonic structure as a signal attribute.

Authors: James W Lewis; William J Talkington; Nathan A Walker; George A Spirou; Audrey Jajosky; Chris Frum; Julie A Brefczynski-Lewis
Journal: J Neurosci Date: 2009-02-18 Impact factor: 6.167

10. Recognizing sequences of sequences.

Authors: Stefan J Kiebel; Katharina von Kriegstein; Jean Daunizeau; Karl J Friston
Journal: PLoS Comput Biol Date: 2009-08-14 Impact factor: 4.475

25 in total

1. Neural correlates of the perception of contrastive prosodic focus in French: a functional magnetic resonance imaging study.

Authors: Marcela Perrone-Bertolotti; Marion Dohen; Hélène Lœvenbruck; Marc Sato; Cédric Pichat; Monica Baciu
Journal: Hum Brain Mapp Date: 2012-04-05 Impact factor: 5.038

2. The pace of prosodic phrasing couples the listener's cortex to the reader's voice.

Authors: Mathieu Bourguignon; Xavier De Tiège; Marc Op de Beeck; Noémie Ligot; Philippe Paquier; Patrick Van Bogaert; Serge Goldman; Riitta Hari; Veikko Jousmäki
Journal: Hum Brain Mapp Date: 2012-01-03 Impact factor: 5.038

3. Multivariate sensitivity to voice during auditory categorization.

Authors: Yune Sang Lee; Jonathan E Peelle; David Kraemer; Samuel Lloyd; Richard Granger
Journal: J Neurophysiol Date: 2015-08-05 Impact factor: 2.714

Review 4. Central auditory disorders: toward a neuropsychology of auditory objects.

Authors: Johanna C Goll; Sebastian J Crutch; Jason D Warren
Journal: Curr Opin Neurol Date: 2010-12 Impact factor: 5.710

5. Brain systems mediating voice identity processing in blind humans.

Authors: Cordula Hölig; Julia Föcker; Anna Best; Brigitte Röder; Christian Büchel
Journal: Hum Brain Mapp Date: 2014-03-17 Impact factor: 5.038

6. Voice-sensitive brain networks encode talker-specific phonetic detail.

Authors: Emily B Myers; Rachel M Theodore
Journal: Brain Lang Date: 2016-11-27 Impact factor: 2.381

7. Non-native listeners' recognition of high-variability speech using PRESTO.

Authors: Terrin N Tamati; David B Pisoni
Journal: J Am Acad Audiol Date: 2014-10 Impact factor: 1.664

8. Training-induced brain activation and functional connectivity differentiate multi-talker and single-talker speech training.

Authors: Zhizhou Deng; Bharath Chandrasekaran; Suiping Wang; Patrick C M Wong
Journal: Neurobiol Learn Mem Date: 2018-03-10 Impact factor: 2.877

Review 9. Dynamic speech representations in the human temporal lobe.

Authors: Matthew K Leonard; Edward F Chang
Journal: Trends Cogn Sci Date: 2014-06-03 Impact factor: 20.229

10. Speaker Invariance for Phonetic Information: an fMRI Investigation.

Authors: Caden Salvata; Sheila E Blumstein; Emily B Myers
Journal: Lang Cogn Process Date: 2011-08-19