Literature DB >> 32732017

Active listening.

Karl J Friston1, Noor Sajid2, David Ricardo Quiroga-Martinez3, Thomas Parr4, Cathy J Price5, Emma Holmes6.   

Abstract

This paper introduces active listening, as a unified framework for synthesising and recognising speech. The notion of active listening inherits from active inference, which considers perception and action under one universal imperative: to maximise the evidence for our (generative) models of the world. First, we describe a generative model of spoken words that simulates (i) how discrete lexical, prosodic, and speaker attributes give rise to continuous acoustic signals; and conversely (ii) how continuous acoustic signals are recognised as words. The 'active' aspect involves (covertly) segmenting spoken sentences and borrows ideas from active vision. It casts speech segmentation as the selection of internal actions, corresponding to the placement of word boundaries. Practically, word boundaries are selected that maximise the evidence for an internal model of how individual words are generated. We establish face validity by simulating speech recognition and showing how the inferred content of a sentence depends on prior beliefs and background noise. Finally, we consider predictive validity by associating neuronal or physiological responses, such as the mismatch negativity and P300, with belief updating under active listening, which is greatest in the absence of accurate prior beliefs about what will be heard next.
Copyright © 2020 The Authors. Published by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  Audition; Segmentation; Variational Bayes; Voice; active inference; active listening; speech recognition

Mesh:

Year:  2020        PMID: 32732017      PMCID: PMC7812378          DOI: 10.1016/j.heares.2020.107998

Source DB:  PubMed          Journal:  Hear Res        ISSN: 0378-5955            Impact factor:   3.208


  164 in total

1.  Acoustic parameters in human speaker recognition.

Authors:  W A van Dommelen
Journal:  Lang Speech       Date:  1990 Jul-Sep       Impact factor: 1.500

2.  Event-related potential components reflect phonological and semantic processing of the terminal word of spoken sentences.

Authors:  J F Connolly; N A Phillips
Journal:  J Cogn Neurosci       Date:  1994       Impact factor: 3.225

3.  The possible-word constraint in the segmentation of continuous speech.

Authors:  D Norris; J M McQueen; A Cutler; S Butterfield
Journal:  Cogn Psychol       Date:  1997-12       Impact factor: 3.468

Review 4.  Neuroimaging studies of word reading.

Authors:  J A Fiez; S E Petersen
Journal:  Proc Natl Acad Sci U S A       Date:  1998-02-03       Impact factor: 11.205

Review 5.  The functional logic of cortical connections.

Authors:  S Zeki; S Shipp
Journal:  Nature       Date:  1988-09-22       Impact factor: 49.962

6.  Using envelope modulation to explain speech intelligibility in the presence of a single reflection.

Authors:  Ramesh Kumar Muralimanohar; James M Kates; Kathryn H Arehart
Journal:  J Acoust Soc Am       Date:  2017-05       Impact factor: 1.840

7.  Semantic dementia. Progressive fluent aphasia with temporal lobe atrophy.

Authors:  J R Hodges; K Patterson; S Oxbury; E Funnell
Journal:  Brain       Date:  1992-12       Impact factor: 13.501

8.  Semantic context improves speech intelligibility and reduces listening effort for listeners with hearing impairment.

Authors:  Emma Holmes; Paula Folkeard; Ingrid S Johnsrude; Susan Scollie
Journal:  Int J Audiol       Date:  2018-02-07       Impact factor: 2.117

Review 9.  The mismatch negativity: a review of underlying mechanisms.

Authors:  Marta I Garrido; James M Kilner; Klaas E Stephan; Karl J Friston
Journal:  Clin Neurophysiol       Date:  2009-01-31       Impact factor: 3.708

10.  The graphical brain: Belief propagation and active inference.

Authors:  Karl J Friston; Thomas Parr; Bert de Vries
Journal:  Netw Neurosci       Date:  2017-12-31
View more
  6 in total

Review 1.  Active inference, selective attention, and the cocktail party problem.

Authors:  Emma Holmes; Thomas Parr; Timothy D Griffiths; Karl J Friston
Journal:  Neurosci Biobehav Rev       Date:  2021-10-21       Impact factor: 8.989

2.  Neuromodulatory Control and Language Recovery in Bilingual Aphasia: An Active Inference Approach.

Authors:  Noor Sajid; Karl J Friston; Justyna O Ekert; Cathy J Price; David W Green
Journal:  Behav Sci (Basel)       Date:  2020-10-21

3.  Musicianship and melodic predictability enhance neural gain in auditory cortex during pitch deviance detection.

Authors:  David R Quiroga-Martinez; Niels Christian Hansen; Andreas Højlund; Marcus Pearce; Elvira Brattico; Emma Holmes; Karl Friston; Peter Vuust
Journal:  Hum Brain Mapp       Date:  2021-08-30       Impact factor: 5.399

4.  Simulating lesion-dependent functional recovery mechanisms.

Authors:  Noor Sajid; Emma Holmes; Thomas M Hope; Zafeirios Fountas; Cathy J Price; Karl J Friston
Journal:  Sci Rep       Date:  2021-04-02       Impact factor: 4.379

5.  Rapid adaptation of predictive models during language comprehension: Aperiodic EEG slope, individual alpha frequency and idea density modulate individual differences in real-time model updating.

Authors:  Ina Bornkessel-Schlesewsky; Isabella Sharrad; Caitlin A Howlett; Phillip M Alday; Andrew W Corcoran; Valeria Bellan; Erica Wilkinson; Reinhold Kliegl; Richard L Lewis; Steven L Small; Matthias Schlesewsky
Journal:  Front Psychol       Date:  2022-08-26

6.  The predictive function of Swedish word accents.

Authors:  Mikael Roll
Journal:  Front Psychol       Date:  2022-07-28
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.