Literature DB >> 24815269

Robust fundamental frequency estimation in sustained vowels: detailed algorithmic comparisons and information fusion with adaptive Kalman filtering.

Athanasios Tsanas1, Matías Zañartu2, Max A Little3, Cynthia Fox4, Lorraine O Ramig5, Gari D Clifford1.   

Abstract

There has been consistent interest among speech signal processing researchers in the accurate estimation of the fundamental frequency (F(0)) of speech signals. This study examines ten F(0) estimation algorithms (some well-established and some proposed more recently) to determine which of these algorithms is, on average, better able to estimate F(0) in the sustained vowel /a/. Moreover, a robust method for adaptively weighting the estimates of individual F(0) estimation algorithms based on quality and performance measures is proposed, using an adaptive Kalman filter (KF) framework. The accuracy of the algorithms is validated using (a) a database of 117 synthetic realistic phonations obtained using a sophisticated physiological model of speech production and (b) a database of 65 recordings of human phonations where the glottal cycles are calculated from electroglottograph signals. On average, the sawtooth waveform inspired pitch estimator and the nearly defect-free algorithms provided the best individual F(0) estimates, and the proposed KF approach resulted in a ∼16% improvement in accuracy over the best single F(0) estimation algorithm. These findings may be useful in speech signal processing applications where sustained vowels are used to assess vocal quality, when very accurate F(0) estimation is required.

Entities:  

Mesh:

Year:  2014        PMID: 24815269      PMCID: PMC4032429          DOI: 10.1121/1.4870484

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  22 in total

1.  A comparison of high precision F0 extraction algorithms for sustained vowels.

Authors:  V Parsa; D G Jamieson
Journal:  J Speech Lang Hear Res       Date:  1999-02       Impact factor: 2.297

2.  On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation.

Authors:  Nathalie Henrich; Christophe d'Alessandro; Boris Doval; Michèle Castellengo
Journal:  J Acoust Soc Am       Date:  2004-03       Impact factor: 1.840

3.  Testing the assumptions of linear prediction analysis in normal vowels.

Authors:  M A Little; P E McSharry; I M Moroz; S J Roberts
Journal:  J Acoust Soc Am       Date:  2006-01       Impact factor: 1.840

4.  Evaluation of performance of several established pitch detection algorithms in pathological voices.

Authors:  Seung-Jin Jang; Seong-Hee Choi; Hyo-Min Kim; Hong-Shik Choi; Young-Ro Yoon
Journal:  Annu Int Conf IEEE Eng Med Biol Soc       Date:  2007

5.  A sawtooth waveform inspired pitch estimator for speech and music.

Authors:  Arturo Camacho; John G Harris
Journal:  J Acoust Soc Am       Date:  2008-09       Impact factor: 1.840

6.  Should jitter be measured by peak picking or by waveform matching?

Authors:  Paul Boersma
Journal:  Folia Phoniatr Logop       Date:  2009-10-10       Impact factor: 0.849

7.  Bifurcations in an asymmetric vocal-fold model.

Authors:  I Steinecke; H Herzel
Journal:  J Acoust Soc Am       Date:  1995-03       Impact factor: 1.840

8.  Objective assessment of vocal hyperfunction: an experimental framework and initial results.

Authors:  R E Hillman; E B Holmberg; J S Perkell; M Walsh; C Vaughan
Journal:  J Speech Hear Res       Date:  1989-06

9.  Voice simulation with a body-cover model of the vocal folds.

Authors:  B H Story; I R Titze
Journal:  J Acoust Soc Am       Date:  1995-02       Impact factor: 1.840

10.  Comparison of Fo extraction methods for high-precision voice perturbation measurements.

Authors:  I R Titze; H Liang
Journal:  J Speech Hear Res       Date:  1993-12
View more
  7 in total

1.  Real-time estimation of aerodynamic features for ambulatory voice biofeedback.

Authors:  Andrés F Llico; Matías Zañartu; Agustín J González; George R Wodicka; Daryush D Mehta; Jarrad H Van Stan; Robert E Hillman
Journal:  J Acoust Soc Am       Date:  2015-07       Impact factor: 1.840

Review 2.  Consensus Paper: Neurophysiological Assessments of Ataxias in Daily Practice.

Authors:  W Ilg; M Branscheidt; A Butala; P Celnik; L de Paola; F B Horak; L Schöls; H A G Teive; A P Vogel; D S Zee; D Timmann
Journal:  Cerebellum       Date:  2018-10       Impact factor: 3.847

3.  Developing a large scale population screening tool for the assessment of Parkinson's disease using telephone-quality voice.

Authors:  Siddharth Arora; Ladan Baghai-Ravary; Athanasios Tsanas
Journal:  J Acoust Soc Am       Date:  2019-05       Impact factor: 1.840

4.  Stage-independent, single lead EEG sleep spindle detection using the continuous wavelet transform and local weighted smoothing.

Authors:  Athanasios Tsanas; Gari D Clifford
Journal:  Front Hum Neurosci       Date:  2015-04-08       Impact factor: 3.169

5.  Tracking cortical entrainment in neural activity: auditory processes in human temporal cortex.

Authors:  Andrew Thwaites; Ian Nimmo-Smith; Elisabeth Fonteneau; Roy D Patterson; Paula Buttery; William D Marslen-Wilson
Journal:  Front Comput Neurosci       Date:  2015-02-10       Impact factor: 2.380

6.  Euclidean Distances as measures of speaker similarity including identical twin pairs: A forensic investigation using source and filter voice characteristics.

Authors:  Eugenia San Segundo; Athanasios Tsanas; Pedro Gómez-Vilda
Journal:  Forensic Sci Int       Date:  2016-11-17       Impact factor: 2.395

7.  Smartphone Application for the Analysis of Prosodic Features in Running Speech with a Focus on Bipolar Disorders: System Performance Evaluation and Case Study.

Authors:  Andrea Guidi; Sergio Salvi; Manuel Ottaviano; Claudio Gentili; Gilles Bertschy; Danilo de Rossi; Enzo Pasquale Scilingo; Nicola Vanello
Journal:  Sensors (Basel)       Date:  2015-11-06       Impact factor: 3.576

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.