Literature DB >> 22978900

Kalman-based autoregressive moving average modeling and inference for formant and antiformant tracking.

Daryush D Mehta1, Daniel Rudoy, Patrick J Wolfe.   

Abstract

Vocal tract resonance characteristics in acoustic speech signals are classically tracked using frame-by-frame point estimates of formant frequencies followed by candidate selection and smoothing using dynamic programming methods that minimize ad hoc cost functions. The goal of the current work is to provide both point estimates and associated uncertainties of center frequencies and bandwidths in a statistically principled state-space framework. Extended Kalman (K) algorithms take advantage of a linearized mapping to infer formant and antiformant parameters from frame-based estimates of autoregressive moving average (ARMA) cepstral coefficients. Error analysis of KARMA, wavesurfer, and praat is accomplished in the all-pole case using a manually marked formant database and synthesized speech waveforms. KARMA formant tracks exhibit lower overall root-mean-square error relative to the two benchmark algorithms with the ability to modify parameters in a controlled manner to trade off bias and variance. Antiformant tracking performance of KARMA is illustrated using synthesized and spoken nasal phonemes. The simultaneous tracking of uncertainty levels enables practitioners to recognize time-varying confidence in parameters of interest and adjust algorithmic settings accordingly.

Entities:  

Mesh:

Year:  2012        PMID: 22978900     DOI: 10.1121/1.4739462

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  6 in total

1.  Statistical properties of linear prediction analysis underlying the challenge of formant bandwidth estimation.

Authors:  Daryush D Mehta; Patrick J Wolfe
Journal:  J Acoust Soc Am       Date:  2015-02       Impact factor: 1.840

2.  Assessment of speech and fine motor coordination in children with autism spectrum disorder.

Authors:  Tanya Talkar; James R Williamson; Daniel Hannon; Hrishikesh M Rao; Sophia Yuditskaya; Kajal Claypool; Douglas Sturim; Lisa Nowinski; Hannah Saro; Carol Stamm; Maria Mody; Christopher J McDougle; Thomas F Quatieri
Journal:  IEEE Access       Date:  2020-07-10       Impact factor: 3.367

3.  Comparing measurement errors for formants in synthetic and natural vowels.

Authors:  Christine H Shadle; Hosung Nam; D H Whalen
Journal:  J Acoust Soc Am       Date:  2016-02       Impact factor: 1.840

4.  Predicting Cognitive Load and Operational Performance in a Simulated Marksmanship Task.

Authors:  Hrishikesh M Rao; Christopher J Smalt; Aaron Rodriguez; Hannah M Wright; Daryush D Mehta; Laura J Brattain; Harvey M Edwards; Adam Lammert; Kristin J Heaton; Thomas F Quatieri
Journal:  Front Hum Neurosci       Date:  2020-07-03       Impact factor: 3.169

5.  A Framework for Biomarkers of COVID-19 Based on Coordination of Speech-Production Subsystems.

Authors:  Thomas F Quatieri; Tanya Talkar; Jeffrey S Palmer
Journal:  IEEE Open J Eng Med Biol       Date:  2020-05-29

6.  Using Dynamics of Eye Movements, Speech Articulation and Brain Activity to Predict and Track mTBI Screening Outcomes.

Authors:  James R Williamson; Doug Sturim; Trina Vian; Joseph Lacirignola; Trey E Shenk; Sophia Yuditskaya; Hrishikesh M Rao; Thomas M Talavage; Kristin J Heaton; Thomas F Quatieri
Journal:  Front Neurol       Date:  2021-07-06       Impact factor: 4.003

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.