Literature DB >> 12002874

YIN, a fundamental frequency estimator for speech and music.

Alain de Cheveigné1, Hideki Kawahara.   

Abstract

An algorithm is presented for the estimation of the fundamental frequency (F0) of speech or musical sounds. It is based on the well-known autocorrelation method with a number of modifications that combine to prevent errors. The algorithm has several desirable features. Error rates are about three times lower than the best competing methods, as evaluated over a database of speech recorded together with a laryngograph signal. There is no upper limit on the frequency search range, so the algorithm is suited for high-pitched voices and music. The algorithm is relatively simple and may be implemented efficiently and with low latency, and it involves few parameters that must be tuned. It is based on a signal model (periodic signal) that may be extended in several ways to handle various forms of aperiodicity that occur in particular applications. Finally, interesting parallels may be drawn with models of auditory processing.

Mesh:

Year:  2002        PMID: 12002874     DOI: 10.1121/1.1458024

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  59 in total

1.  Shifting fundamental frequency in simulated electric-acoustic listening.

Authors:  Christopher A Brown; Nicole M Scherrer; Sid P Bacon
Journal:  J Acoust Soc Am       Date:  2010-09       Impact factor: 1.840

2.  Predicting plasticity: acute context-dependent changes to vocal performance predict long-term age-dependent changes.

Authors:  Logan S James; Jon T Sakata
Journal:  J Neurophysiol       Date:  2015-08-26       Impact factor: 2.714

3.  Automatic reconstruction of physiological gestures used in a model of birdsong production.

Authors:  Santiago Boari; Yonatan Sanz Perl; Ana Amador; Daniel Margoliash; Gabriel B Mindlin
Journal:  J Neurophysiol       Date:  2015-09-16       Impact factor: 2.714

4.  Perceptual coherence in listeners having longstanding childhood hearing losses, listeners with adult-onset hearing losses, and listeners with normal hearing.

Authors:  Andrea Pittman
Journal:  J Acoust Soc Am       Date:  2008-01       Impact factor: 1.840

5.  Speech identification based on temporal fine structure cues.

Authors:  Stanley Sheft; Marine Ardoint; Christian Lorenzi
Journal:  J Acoust Soc Am       Date:  2008-07       Impact factor: 1.840

6.  Low-frequency speech cues and simulated electric-acoustic hearing.

Authors:  Christopher A Brown; Sid P Bacon
Journal:  J Acoust Soc Am       Date:  2009-03       Impact factor: 1.840

7.  Pitch discrimination with mixtures of three concurrent harmonic complexes.

Authors:  Jackson E Graves; Andrew J Oxenham
Journal:  J Acoust Soc Am       Date:  2019-04       Impact factor: 1.840

8.  Speaker-dependent multipitch tracking using deep neural networks.

Authors:  Yuzhou Liu; DeLiang Wang
Journal:  J Acoust Soc Am       Date:  2017-02       Impact factor: 1.840

9.  Intermittent theta burst stimulation over right somatosensory larynx cortex enhances vocal pitch-regulation in nonsingers.

Authors:  Sebastian Finkel; Ralf Veit; Martin Lotze; Anders Friberg; Peter Vuust; Surjo Soekadar; Niels Birbaumer; Boris Kleber
Journal:  Hum Brain Mapp       Date:  2019-01-21       Impact factor: 5.038

10.  Vocal accuracy and neural plasticity following micromelody-discrimination training.

Authors:  Jean Mary Zarate; Karine Delhommeau; Sean Wood; Robert J Zatorre
Journal:  PLoS One       Date:  2010-06-17       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.