Literature DB >> 33690177

Data-driven machine learning models for decoding speech categorization from evoked brain responses.

Md Sultan Mahmud1,2, Mohammed Yeasin1,2, Gavin M Bidelman2,3,4.   

Abstract

Objective.Categorical perception (CP) of audio is critical to understand how the human brain perceives speech sounds despite widespread variability in acoustic properties. Here, we investigated the spatiotemporal characteristics of auditory neural activity that reflects CP for speech (i.e. differentiates phonetic prototypes from ambiguous speech sounds).Approach.We recorded 64-channel electroencephalograms as listeners rapidly classified vowel sounds along an acoustic-phonetic continuum. We used support vector machine classifiers and stability selection to determine when and where in the brain CP was best decoded across space and time via source-level analysis of the event-related potentials.Main results. We found that early (120 ms) whole-brain data decoded speech categories (i.e. prototypical vs. ambiguous tokens) with 95.16% accuracy (area under the curve 95.14%;F1-score 95.00%). Separate analyses on left hemisphere (LH) and right hemisphere (RH) responses showed that LH decoding was more accurate and earlier than RH (89.03% vs. 86.45% accuracy; 140 ms vs. 200 ms). Stability (feature) selection identified 13 regions of interest (ROIs) out of 68 brain regions [including auditory cortex, supramarginal gyrus, and inferior frontal gyrus (IFG)] that showed categorical representation during stimulus encoding (0-260 ms). In contrast, 15 ROIs (including fronto-parietal regions, IFG, motor cortex) were necessary to describe later decision stages (later 300-800 ms) of categorization but these areas were highly associated with the strength of listeners' categorical hearing (i.e. slope of behavioral identification functions).Significance.Our data-driven multivariate models demonstrate that abstract categories emerge surprisingly early (∼120 ms) in the time course of speech processing and are dominated by engagement of a relatively compact fronto-temporal-parietal brain network.
© 2021 IOP Publishing Ltd.

Entities:  

Keywords:  auditory event-related potentials (ERPs); behavioral slope; categorical perception; decision process; machine learning; stability selection; support vector machine (SVM)

Mesh:

Year:  2021        PMID: 33690177      PMCID: PMC8738965          DOI: 10.1088/1741-2552/abecf0

Source DB:  PubMed          Journal:  J Neural Eng        ISSN: 1741-2552            Impact factor:   5.379


  88 in total

1.  The correction of ocular artifacts: a topographic perspective.

Authors:  T W Picton; P van Roon; M L Armilio; P Berg; N Ille; M Scherg
Journal:  Clin Neurophysiol       Date:  2000-01       Impact factor: 3.708

2.  An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest.

Authors:  Rahul S Desikan; Florent Ségonne; Bruce Fischl; Brian T Quinn; Bradford C Dickerson; Deborah Blacker; Randy L Buckner; Anders M Dale; R Paul Maguire; Bradley T Hyman; Marilyn S Albert; Ronald J Killiany
Journal:  Neuroimage       Date:  2006-03-10       Impact factor: 6.556

3.  Speech perception in infants.

Authors:  P D Eimas; E R Siqueland; P Jusczyk; J Vigorito
Journal:  Science       Date:  1971-01-22       Impact factor: 47.728

4.  A functional magnetic resonance imaging study of the role of left posterior superior temporal gyrus in speech production: implications for the explanation of conduction aphasia.

Authors:  G Hickok; P Erhard; J Kassubek; A K Helms-Tillery; S Naeve-Velguth; J P Strupp; P L Strick; K Ugurbil
Journal:  Neurosci Lett       Date:  2000-06-23       Impact factor: 3.046

Review 5.  Speech perception as categorization.

Authors:  Lori L Holt; Andrew J Lotto
Journal:  Atten Percept Psychophys       Date:  2010-07       Impact factor: 2.199

6.  Sequential processing of lexical, grammatical, and phonological information within Broca's area.

Authors:  Ned T Sahin; Steven Pinker; Sydney S Cash; Donald Schomer; Eric Halgren
Journal:  Science       Date:  2009-10-16       Impact factor: 47.728

7.  Task-General and Acoustic-Invariant Neural Representation of Speech Categories in the Human Brain.

Authors:  Gangyi Feng; Zhenzhong Gan; Suiping Wang; Patrick C M Wong; Bharath Chandrasekaran
Journal:  Cereb Cortex       Date:  2018-09-01       Impact factor: 5.357

8.  The time-course of cortical responses to speech revealed by fast optical imaging.

Authors:  Joseph C Toscano; Nathaniel D Anderson; Monica Fabiani; Gabriele Gratton; Susan M Garnsey
Journal:  Brain Lang       Date:  2018-06-27       Impact factor: 2.381

9.  Speech rhythms and multiplexed oscillatory sensory coding in the human brain.

Authors:  Joachim Gross; Nienke Hoogenboom; Gregor Thut; Philippe Schyns; Stefano Panzeri; Pascal Belin; Simon Garrod
Journal:  PLoS Biol       Date:  2013-12-31       Impact factor: 8.029

10.  Applications of machine learning in cancer prediction and prognosis.

Authors:  Joseph A Cruz; David S Wishart
Journal:  Cancer Inform       Date:  2007-02-11
View more
  2 in total

1.  Speech categorization is better described by induced rather than evoked neural activity.

Authors:  Md Sultan Mahmud; Mohammed Yeasin; Gavin M Bidelman
Journal:  J Acoust Soc Am       Date:  2021-03       Impact factor: 1.840

2.  Lexical Influences on Categorical Speech Perception Are Driven by a Temporoparietal Circuit.

Authors:  Gavin M Bidelman; Claire Pearson; Ashleigh Harrison
Journal:  J Cogn Neurosci       Date:  2021-01-19       Impact factor: 3.420

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.