Literature DB >> 27484713

Neural speech recognition: continuous phoneme decoding using spatiotemporal representations of human cortical activity.

David A Moses1, Nima Mesgarani, Matthew K Leonard, Edward F Chang.   

Abstract

OBJECTIVE: The superior temporal gyrus (STG) and neighboring brain regions play a key role in human language processing. Previous studies have attempted to reconstruct speech information from brain activity in the STG, but few of them incorporate the probabilistic framework and engineering methodology used in modern speech recognition systems. In this work, we describe the initial efforts toward the design of a neural speech recognition (NSR) system that performs continuous phoneme recognition on English stimuli with arbitrary vocabulary sizes using the high gamma band power of local field potentials in the STG and neighboring cortical areas obtained via electrocorticography. APPROACH: The system implements a Viterbi decoder that incorporates phoneme likelihood estimates from a linear discriminant analysis model and transition probabilities from an n-gram phonemic language model. Grid searches were used in an attempt to determine optimal parameterizations of the feature vectors and Viterbi decoder. MAIN
RESULTS: The performance of the system was significantly improved by using spatiotemporal representations of the neural activity (as opposed to purely spatial representations) and by including language modeling and Viterbi decoding in the NSR system. SIGNIFICANCE: These results emphasize the importance of modeling the temporal dynamics of neural responses when analyzing their variations with respect to varying stimuli and demonstrate that speech recognition techniques can be successfully leveraged when decoding speech from neural signals. Guided by the results detailed in this work, further development of the NSR system could have applications in the fields of automatic speech recognition and neural prosthetics.

Entities:  

Mesh:

Year:  2016        PMID: 27484713      PMCID: PMC5031534          DOI: 10.1088/1741-2560/13/5/056004

Source DB:  PubMed          Journal:  J Neural Eng        ISSN: 1741-2552            Impact factor:   5.379


  36 in total

1.  Phase patterns of neuronal responses reliably discriminate speech in human auditory cortex.

Authors:  Huan Luo; David Poeppel
Journal:  Neuron       Date:  2007-06-21       Impact factor: 17.173

Review 2.  State-dependent computations: spatiotemporal processing in cortical networks.

Authors:  Dean V Buonomano; Wolfgang Maass
Journal:  Nat Rev Neurosci       Date:  2009-01-15       Impact factor: 34.870

3.  Moving beyond Kucera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English.

Authors:  Marc Brysbaert; Boris New
Journal:  Behav Res Methods       Date:  2009-11

Review 4.  The locked-in syndrome : what is it like to be conscious but paralyzed and voiceless?

Authors:  Steven Laureys; Frédéric Pellas; Philippe Van Eeckhout; Sofiane Ghorbel; Caroline Schnakers; Fabien Perrin; Jacques Berré; Marie-Elisabeth Faymonville; Karl-Heinz Pantke; Francois Damas; Maurice Lamy; Gustave Moonen; Serge Goldman
Journal:  Prog Brain Res       Date:  2005       Impact factor: 2.453

5.  Intracranial study of speech-elicited activity on the human posterolateral superior temporal gyrus.

Authors:  Mitchell Steinschneider; Kirill V Nourski; Hiroto Kawasaki; Hiroyuki Oya; John F Brugge; Matthew A Howard
Journal:  Cereb Cortex       Date:  2011-03-02       Impact factor: 5.357

6.  Neuroperceptual differences in consonant and vowel discrimination: as revealed by direct cortical electrical interference.

Authors:  D Boatman; C Hall; M H Goldstein; R Lesser; B Gordon
Journal:  Cortex       Date:  1997-03       Impact factor: 4.027

7.  Large-scale heterogeneous representation of sound attributes in rat primary auditory cortex: from unit activity to population dynamics.

Authors:  Takeshi Ogawa; Jorge Riera; Takakuni Goto; Akira Sumiyoshi; Hiroi Nonaka; Karim Jerbi; Olivier Bertrand; Ryuta Kawashima
Journal:  J Neurosci       Date:  2011-10-12       Impact factor: 6.167

8.  Spatiotemporal dynamics of electrocorticographic high gamma activity during overt and covert word repetition.

Authors:  Xiaomei Pei; Eric C Leuthardt; Charles M Gaona; Peter Brunner; Jonathan R Wolpaw; Gerwin Schalk
Journal:  Neuroimage       Date:  2010-10-26       Impact factor: 6.556

9.  Reconstructing speech from human auditory cortex.

Authors:  Brian N Pasley; Stephen V David; Nima Mesgarani; Adeen Flinker; Shihab A Shamma; Nathan E Crone; Robert T Knight; Edward F Chang
Journal:  PLoS Biol       Date:  2012-01-31       Impact factor: 8.029

10.  Neural adaptation to silence in the human auditory cortex: a magnetoencephalographic study.

Authors:  Hidehiko Okamoto; Ryusuke Kakigi
Journal:  Brain Behav       Date:  2014-09-30       Impact factor: 2.708

View more
  16 in total

1.  The Control of Vocal Pitch in Human Laryngeal Motor Cortex.

Authors:  Benjamin K Dichter; Jonathan D Breshears; Matthew K Leonard; Edward F Chang
Journal:  Cell       Date:  2018-06-28       Impact factor: 41.582

Review 2.  The Potential for a Speech Brain-Computer Interface Using Chronic Electrocorticography.

Authors:  Qinwan Rabbani; Griffin Milsap; Nathan E Crone
Journal:  Neurotherapeutics       Date:  2019-01       Impact factor: 7.620

3.  Speech synthesis from ECoG using densely connected 3D convolutional neural networks.

Authors:  Miguel Angrick; Christian Herff; Emily Mugler; Matthew C Tate; Marc W Slutzky; Dean J Krusienski; Tanja Schultz
Journal:  J Neural Eng       Date:  2019-03-04       Impact factor: 5.379

4.  The Cortical Organization of Syntax.

Authors:  William Matchin; Gregory Hickok
Journal:  Cereb Cortex       Date:  2020-03-14       Impact factor: 5.357

5.  A low-cost, scalable, current-sensing digital headstage for high channel count μECoG.

Authors:  Michael Trumpis; Michele Insanally; Jialin Zou; Ashraf Elsharif; Ali Ghomashchi; N Sertac Artan; Robert C Froemke; Jonathan Viventi
Journal:  J Neural Eng       Date:  2017-01-19       Impact factor: 5.379

6.  Parallel and distributed encoding of speech across human auditory cortex.

Authors:  Liberty S Hamilton; Yulia Oganian; Jeffery Hall; Edward F Chang
Journal:  Cell       Date:  2021-08-18       Impact factor: 66.850

7.  The Neuroanatomy of Speech Processing: A Large-scale Lesion Study.

Authors:  Corianne Rogalsky; Alexandra Basilakos; Chris Rorden; Sara Pillay; Arianna N LaCroix; Lynsey Keator; Soren Mickelsen; Steven W Anderson; Tracy Love; Julius Fridriksson; Jeffrey Binder; Gregory Hickok
Journal:  J Cogn Neurosci       Date:  2022-07-01       Impact factor: 3.420

Review 8.  Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.

Authors:  Shiyu Luo; Qinwan Rabbani; Nathan E Crone
Journal:  Neurotherapeutics       Date:  2022-01-31       Impact factor: 6.088

9.  Intracranial Electrophysiology of Auditory Selective Attention Associated with Speech Classification Tasks.

Authors:  Kirill V Nourski; Mitchell Steinschneider; Ariane E Rhone; Matthew A Howard Iii
Journal:  Front Hum Neurosci       Date:  2017-01-10       Impact factor: 3.169

10.  Multiple decisions about one object involve parallel sensory acquisition but time-multiplexed evidence incorporation.

Authors:  Yul Hr Kang; Anne Löffler; Danique Jeurissen; Ariel Zylberberg; Daniel M Wolpert; Michael N Shadlen
Journal:  Elife       Date:  2021-03-10       Impact factor: 8.713

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.