Literature DB >> 29563229

Schema learning for the cocktail party problem.

Kevin J P Woods1,2, Josh H McDermott3,2.   

Abstract

The cocktail party problem requires listeners to infer individual sound sources from mixtures of sound. The problem can be solved only by leveraging regularities in natural sound sources, but little is known about how such regularities are internalized. We explored whether listeners learn source "schemas"-the abstract structure shared by different occurrences of the same type of sound source-and use them to infer sources from mixtures. We measured the ability of listeners to segregate mixtures of time-varying sources. In each experiment a subset of trials contained schema-based sources generated from a common template by transformations (transposition and time dilation) that introduced acoustic variation but preserved abstract structure. Across several tasks and classes of sound sources, schema-based sources consistently aided source separation, in some cases producing rapid improvements in performance over the first few exposures to a schema. Learning persisted across blocks that did not contain the learned schema, and listeners were able to learn and use multiple schemas simultaneously. No learning was evident when schema were presented in the task-irrelevant (i.e., distractor) source. However, learning from task-relevant stimuli showed signs of being implicit, in that listeners were no more likely to report that sources recurred in experiments containing schema-based sources than in control experiments containing no schema-based sources. The results implicate a mechanism for rapidly internalizing abstract sound structure, facilitating accurate perceptual organization of sound sources that recur in the environment.

Entities:  

Keywords:  auditory scene analysis; implicit learning; perceptual learning; statistical learning

Mesh:

Year:  2018        PMID: 29563229      PMCID: PMC5889675          DOI: 10.1073/pnas.1801614115

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  51 in total

1.  Rapid efficient coding of correlated complex acoustic properties.

Authors:  Christian E Stilp; Timothy T Rogers; Keith R Kluender
Journal:  Proc Natl Acad Sci U S A       Date:  2010-11-22       Impact factor: 11.205

2.  Recovering sound sources from embedded repetition.

Authors:  Josh H McDermott; David Wrobleski; Andrew J Oxenham
Journal:  Proc Natl Acad Sci U S A       Date:  2011-01-03       Impact factor: 11.205

3.  The foreign language cocktail party problem: Energetic and informational masking effects in non-native speech perception.

Authors:  Martin Cooke; M L Garcia Lecumberri; Jon Barker
Journal:  J Acoust Soc Am       Date:  2008-01       Impact factor: 1.840

4.  An objective measurement of the build-up of auditory streaming and of its modulation by attention.

Authors:  Sarah K Thompson; Robert P Carlyon; Rhodri Cusack
Journal:  J Exp Psychol Hum Percept Perform       Date:  2011-08       Impact factor: 3.332

5.  Auditory perceptual learning of tonal patterns.

Authors:  M R Leek; C S Watson
Journal:  Percept Psychophys       Date:  1988-04

6.  Implicit learning and acquisition of music.

Authors:  Martin Rohrmeier; Patrick Rebuschat
Journal:  Top Cogn Sci       Date:  2012-10

7.  Is relative pitch specific to pitch?

Authors:  Josh H McDermott; Andriana J Lehr; Andrew J Oxenham
Journal:  Psychol Sci       Date:  2008-12

Review 8.  The cocktail party problem: what is it? How can it be solved? And why should animal behaviorists study it?

Authors:  Mark A Bee; Christophe Micheyl
Journal:  J Comp Psychol       Date:  2008-08       Impact factor: 2.231

Review 9.  Properties of auditory stream formation.

Authors:  Brian C J Moore; Hedwig E Gockel
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2012-04-05       Impact factor: 6.237

10.  Attention, awareness, and the perception of auditory scenes.

Authors:  Joel S Snyder; Melissa K Gregg; David M Weintraub; Claude Alain
Journal:  Front Psychol       Date:  2012-02-07
View more
  15 in total

1.  Time-dependent discrimination advantages for harmonic sounds suggest efficient coding for memory.

Authors:  Malinda J McPherson; Josh H McDermott
Journal:  Proc Natl Acad Sci U S A       Date:  2020-12-01       Impact factor: 11.205

2.  Long-term implicit memory for sequential auditory patterns in humans.

Authors:  Roberta Bianco; Peter Mc Harrison; Mingyue Hu; Cora Bolger; Samantha Picken; Marcus T Pearce; Maria Chait
Journal:  Elife       Date:  2020-05-18       Impact factor: 8.140

3.  Making sense of periodicity glimpses in a prediction-update-loop-A computational model of attentive voice tracking.

Authors:  Joanna Luberadzka; Hendrik Kayser; Volker Hohmann
Journal:  J Acoust Soc Am       Date:  2022-02       Impact factor: 2.482

4.  Ecological origins of perceptual grouping principles in the auditory system.

Authors:  Wiktor Młynarski; Josh H McDermott
Journal:  Proc Natl Acad Sci U S A       Date:  2019-11-21       Impact factor: 11.205

5.  Inharmonic speech reveals the role of harmonicity in the cocktail party problem.

Authors:  Sara Popham; Dana Boebinger; Dan P W Ellis; Hideki Kawahara; Josh H McDermott
Journal:  Nat Commun       Date:  2018-05-29       Impact factor: 14.919

6.  Rapid Ocular Responses Are Modulated by Bottom-up-Driven Auditory Salience.

Authors:  Sijia Zhao; Nga Wai Yum; Lucas Benjamin; Elia Benhamou; Makoto Yoneya; Shigeto Furukawa; Fred Dick; Malcolm Slaney; Maria Chait
Journal:  J Neurosci       Date:  2019-08-07       Impact factor: 6.167

7.  Listening in complex acoustic scenes.

Authors:  Andrew J King; Kerry Mm Walker
Journal:  Curr Opin Physiol       Date:  2020-09-08

8.  Perceptual fusion of musical notes by native Amazonians suggests universal representations of musical intervals.

Authors:  Malinda J McPherson; Sophia E Dolan; Alex Durango; Tomas Ossandon; Joaquín Valdés; Eduardo A Undurraga; Nori Jacoby; Ricardo A Godoy; Josh H McDermott
Journal:  Nat Commun       Date:  2020-06-03       Impact factor: 14.919

9.  Spectral cues are necessary to encode azimuthal auditory space in the mouse superior colliculus.

Authors:  Shinya Ito; Yufei Si; David A Feldheim; Alan M Litke
Journal:  Nat Commun       Date:  2020-02-27       Impact factor: 14.919

10.  Illusory sound texture reveals multi-second statistical completion in auditory scene analysis.

Authors:  Richard McWalter; Josh H McDermott
Journal:  Nat Commun       Date:  2019-11-08       Impact factor: 14.919

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.