Literature DB >> 35232067

Making sense of periodicity glimpses in a prediction-update-loop-A computational model of attentive voice tracking.

Joanna Luberadzka1, Hendrik Kayser1, Volker Hohmann1.   

Abstract

Humans are able to follow a speaker even in challenging acoustic conditions. The perceptual mechanisms underlying this ability remain unclear. A computational model of attentive voice tracking, consisting of four computational blocks: (1) sparse periodicity-based auditory features (sPAF) extraction, (2) foreground-background segregation, (3) state estimation, and (4) top-down knowledge, is presented. The model connects the theories about auditory glimpses, foreground-background segregation, and Bayesian inference. It is implemented with the sPAF, sequential Monte Carlo sampling, and probabilistic voice models. The model is evaluated by comparing it with the human data obtained in the study by Woods and McDermott [Curr. Biol. 25(17), 2238-2246 (2015)], which measured the ability to track one of two competing voices with time-varying parameters [fundamental frequency (F0) and formants (F1,F2)]. Three model versions were tested, which differ in the type of information used for the segregation: version (a) uses the oracle F0, version (b) uses the estimated F0, and version (c) uses the spectral shape derived from the estimated F0 and oracle F1 and F2. Version (a) simulates the optimal human performance in conditions with the largest separation between the voices, version (b) simulates the conditions in which the separation in not sufficient to follow the voices, and version (c) is closest to the human performance for moderate voice separation.

Entities:  

Mesh:

Year:  2022        PMID: 35232067      PMCID: PMC9088677          DOI: 10.1121/10.0009337

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   2.482


  51 in total

1.  Coding of temporally fluctuating interaural timing disparities in a binaural processing model based on phase differences.

Authors:  Mathias Dietz; Stephan D Ewert; Volker Hohmann; Birger Kollmeier
Journal:  Brain Res       Date:  2007-09-21       Impact factor: 3.252

2.  The cocktail party problem.

Authors:  Josh H McDermott
Journal:  Curr Biol       Date:  2009-12-01       Impact factor: 10.834

3.  Mechanisms of noise robust representation of speech in primary auditory cortex.

Authors:  Nima Mesgarani; Stephen V David; Jonathan B Fritz; Shihab A Shamma
Journal:  Proc Natl Acad Sci U S A       Date:  2014-04-21       Impact factor: 11.205

4.  Spectro-temporal templates unify the pitch percepts of resolved and unresolved harmonics.

Authors:  Shihab Shamma; Kelsey Dutta
Journal:  J Acoust Soc Am       Date:  2019-02       Impact factor: 1.840

5.  On the Contribution of Target Audibility to Performance in Spatialized Speech Mixtures.

Authors:  Virginia Best; Christine R Mason; Jayaganesh Swaminathan; Gerald Kidd; Kasey M Jakien; Sean D Kampel; Frederick J Gallun; Jörg M Buchholz; Helen Glyde
Journal:  Adv Exp Med Biol       Date:  2016       Impact factor: 2.622

6.  Modeling speech localization, talker identification, and word recognition in a multi-talker setting.

Authors:  Angela Josupeit; Volker Hohmann
Journal:  J Acoust Soc Am       Date:  2017-07       Impact factor: 1.840

7.  Early selective-attention effect on evoked potential reinterpreted.

Authors:  R Näätänen; A W Gaillard; S Mäntysalo
Journal:  Acta Psychol (Amst)       Date:  1978-07

Review 8.  Modelling auditory attention.

Authors:  Emine Merve Kaya; Mounya Elhilali
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2017-01-02       Impact factor: 6.237

9.  Speech perception is similar for musicians and non-musicians across a wide range of conditions.

Authors:  Sara M K Madsen; Marton Marschall; Torsten Dau; Andrew J Oxenham
Journal:  Sci Rep       Date:  2019-07-18       Impact factor: 4.379

10.  Tracking Musical Voices in Bach's The Art of the Fugue: Timbral Heterogeneity Differentially Affects Younger Normal-Hearing Listeners and Older Hearing-Aid Users.

Authors:  Kai Siedenburg; Kirsten Goldmann; Steven van de Par
Journal:  Front Psychol       Date:  2021-04-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.