Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity.

Literature DB >> 27484713

Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity.

David A Moses¹, Nima Mesgarani, Matthew K Leonard, Edward F Chang.

Abstract

OBJECTIVE: The superior temporal gyrus (STG) and neighboring brain regions play a key role in human language processing. Previous studies have attempted to reconstruct speech information from brain activity in the STG, but few of them incorporate the probabilistic framework and engineering methodology used in modern speech recognition systems. In this work, we describe the initial efforts toward the design of a neural speech recognition (NSR) system that performs continuous phoneme recognition on English stimuli with arbitrary vocabulary sizes using the high gamma band power of local field potentials in the STG and neighboring cortical areas obtained via electrocorticography. APPROACH: The system implements a Viterbi decoder that incorporates phoneme likelihood estimates from a linear discriminant analysis model and transition probabilities from an n-gram phonemic language model. Grid searches were used in an attempt to determine optimal parameterizations of the feature vectors and Viterbi decoder. MAIN
RESULTS: The performance of the system was significantly improved by using spatiotemporal representations of the neural activity (as opposed to purely spatial representations) and by including language modeling and Viterbi decoding in the NSR system. SIGNIFICANCE: These results emphasize the importance of modeling the temporal dynamics of neural responses when analyzing their variations with respect to varying stimuli and demonstrate that speech recognition techniques can be successfully leveraged when decoding speech from neural signals. Guided by the results detailed in this work, further development of the NSR system could have applications in the fields of automatic speech recognition and neural prosthetics.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2016 PMID： 27484713 PMCID： PMC5031534 DOI： 10.1088/1741-2560/13/5/056004

Source DB: PubMed Journal: J Neural Eng ISSN： 1741-2552 Impact factor: 5.379

36 in total

1. Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex.

Authors: Huan Luo; David Poeppel
Journal: Neuron Date: 2007-06-21 Impact factor: 17.173

Review 2. State-dependent computations: spatiotemporal processing in cortical networks.

Authors: Dean V Buonomano; Wolfgang Maass
Journal: Nat Rev Neurosci Date: 2009-01-15 Impact factor: 34.870

3. Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English.

Authors: Marc Brysbaert; Boris New
Journal: Behav Res Methods Date: 2009-11

Review 4. The locked-in syndrome : what is it like to be conscious but paralyzed and voiceless?

Authors: Steven Laureys; Frédéric Pellas; Philippe Van Eeckhout; Sofiane Ghorbel; Caroline Schnakers; Fabien Perrin; Jacques Berré; Marie-Elisabeth Faymonville; Karl-Heinz Pantke; Francois Damas; Maurice Lamy; Gustave Moonen; Serge Goldman
Journal: Prog Brain Res Date: 2005 Impact factor: 2.453

5. Intracranial study of speech-elicited activity on the human posterolateral superior temporal gyrus.

Authors: Mitchell Steinschneider; Kirill V Nourski; Hiroto Kawasaki; Hiroyuki Oya; John F Brugge; Matthew A Howard
Journal: Cereb Cortex Date: 2011-03-02 Impact factor: 5.357

6. Neuroperceptual differences in consonant and vowel discrimination: as revealed by direct cortical electrical interference.

Authors: D Boatman; C Hall; M H Goldstein; R Lesser; B Gordon
Journal: Cortex Date: 1997-03 Impact factor: 4.027

7. Large-scale heterogeneous representation of sound attributes in rat primary auditory cortex: from unit activity to population dynamics.

Authors: Takeshi Ogawa; Jorge Riera; Takakuni Goto; Akira Sumiyoshi; Hiroi Nonaka; Karim Jerbi; Olivier Bertrand; Ryuta Kawashima
Journal: J Neurosci Date: 2011-10-12 Impact factor: 6.167

8. Spatiotemporal dynamics of electrocorticographic high gamma activity during overt and covert word repetition.

Authors: Xiaomei Pei; Eric C Leuthardt; Charles M Gaona; Peter Brunner; Jonathan R Wolpaw; Gerwin Schalk
Journal: Neuroimage Date: 2010-10-26 Impact factor: 6.556

9. Reconstructing speech from human auditory cortex.

Authors: Brian N Pasley; Stephen V David; Nima Mesgarani; Adeen Flinker; Shihab A Shamma; Nathan E Crone; Robert T Knight; Edward F Chang
Journal: PLoS Biol Date: 2012-01-31 Impact factor: 8.029

10. Neural adaptation to silence in the human auditory cortex: a magnetoencephalographic study.

Authors: Hidehiko Okamoto; Ryusuke Kakigi
Journal: Brain Behav Date: 2014-09-30 Impact factor: 2.708

16 in total

1. The Control of Vocal Pitch in Human Laryngeal Motor Cortex.

Authors: Benjamin K Dichter; Jonathan D Breshears; Matthew K Leonard; Edward F Chang
Journal: Cell Date: 2018-06-28 Impact factor: 41.582

Review 2. The Potential for a Speech Brain-Computer Interface Using Chronic Electrocorticography.

Authors: Qinwan Rabbani; Griffin Milsap; Nathan E Crone
Journal: Neurotherapeutics Date: 2019-01 Impact factor: 7.620

3. Speech synthesis from ECoG using densely connected 3D convolutional neural networks.

Authors: Miguel Angrick; Christian Herff; Emily Mugler; Matthew C Tate; Marc W Slutzky; Dean J Krusienski; Tanja Schultz
Journal: J Neural Eng Date: 2019-03-04 Impact factor: 5.379

4. The Cortical Organization of Syntax.

Authors: William Matchin; Gregory Hickok
Journal: Cereb Cortex Date: 2020-03-14 Impact factor: 5.357

5. A low-cost, scalable, current-sensing digital headstage for high channel count μECoG.

Authors: Michael Trumpis; Michele Insanally; Jialin Zou; Ashraf Elsharif; Ali Ghomashchi; N Sertac Artan; Robert C Froemke; Jonathan Viventi
Journal: J Neural Eng Date: 2017-01-19 Impact factor: 5.379

6. Parallel and distributed encoding of speech across human auditory cortex.

Authors: Liberty S Hamilton; Yulia Oganian; Jeffery Hall; Edward F Chang
Journal: Cell Date: 2021-08-18 Impact factor: 66.850

7. The Neuroanatomy of Speech Processing: A Large-scale Lesion Study.

Authors: Corianne Rogalsky; Alexandra Basilakos; Chris Rorden; Sara Pillay; Arianna N LaCroix; Lynsey Keator; Soren Mickelsen; Steven W Anderson; Tracy Love; Julius Fridriksson; Jeffrey Binder; Gregory Hickok
Journal: J Cogn Neurosci Date: 2022-07-01 Impact factor: 3.420

Review 8. Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.

Authors: Shiyu Luo; Qinwan Rabbani; Nathan E Crone
Journal: Neurotherapeutics Date: 2022-01-31 Impact factor: 6.088

9. Intracranial Electrophysiology of Auditory Selective Attention Associated with Speech Classification Tasks.

Authors: Kirill V Nourski; Mitchell Steinschneider; Ariane E Rhone; Matthew A Howard Iii
Journal: Front Hum Neurosci Date: 2017-01-10 Impact factor: 3.169

10. Multiple decisions about one object involve parallel sensory acquisition but time-multiplexed evidence incorporation.

Authors: Yul Hr Kang; Anne Löffler; Danique Jeurissen; Ariel Zylberberg; Daniel M Wolpert; Michael N Shadlen
Journal: Elife Date: 2021-03-10 Impact factor: 8.713