Literature DB >> 26167516

Convex weighting criteria for speaking rate estimation.

Yishan Jiao1, Visar Berisha1, Ming Tu1, Julie Liss1.   

Abstract

Speaking rate estimation directly from the speech waveform is a long-standing problem in speech signal processing. In this paper, we pose the speaking rate estimation problem as that of estimating a temporal density function whose integral over a given interval yields the speaking rate within that interval. In contrast to many existing methods, we avoid the more difficult task of detecting individual phonemes within the speech signal and we avoid heuristics such as thresholding the temporal envelope to estimate the number of vowels. Rather, the proposed method aims to learn an optimal weighting function that can be directly applied to time-frequency features in a speech signal to yield a temporal density function. We propose two convex cost functions for learning the weighting functions and an adaptation strategy to customize the approach to a particular speaker using minimal training. The algorithms are evaluated on the TIMIT corpus, on a dysarthric speech corpus, and on the ICSI Switchboard spontaneous speech corpus. Results show that the proposed methods outperform three competing methods on both healthy and dysarthric speech. In addition, for spontaneous speech rate estimation, the result show a high correlation between the estimated speaking rate and ground truth values.

Entities:  

Year:  2015        PMID: 26167516      PMCID: PMC4497798          DOI: 10.1109/TASLP.2015.2434213

Source DB:  PubMed          Journal:  IEEE/ACM Trans Audio Speech Lang Process


  11 in total

1.  Robust Speech Rate Estimation for Spontaneous Speech.

Authors:  Dagen Wang; Shrikanth S Narayanan
Journal:  IEEE Trans Audio Speech Lang Process       Date:  2007-11-01

2.  Discriminating dysarthria type from envelope modulation spectra.

Authors:  Julie M Liss; Sue LeGendre; Andrew J Lotto
Journal:  J Speech Lang Hear Res       Date:  2010-07-19       Impact factor: 2.297

3.  Rhythm as a coordinating device: entrainment with disordered speech.

Authors:  Stephanie A Borrie; Julie M Liss
Journal:  J Speech Lang Hear Res       Date:  2014-06-01       Impact factor: 2.297

4.  Automatic segmentation of speech into syllabic units.

Authors:  P Mermelstein
Journal:  J Acoust Soc Am       Date:  1975-10       Impact factor: 1.840

5.  Effects of speech rate on the absolute and relative timing of apraxic and conduction aphasic sentence production.

Authors:  M R McNeil; J M Liss; C H Tseng; R D Kent
Journal:  Brain Lang       Date:  1990-01       Impact factor: 2.381

6.  Effect of speaking rate on the perceptual structure of a phonetic category.

Authors:  J L Miller; L E Volaitis
Journal:  Percept Psychophys       Date:  1989-12

7.  Speaking rate and speech movement velocity profiles.

Authors:  S G Adams; G Weismer; R D Kent
Journal:  J Speech Hear Res       Date:  1993-02

8.  The influence of speaking rate on vowel space and speech intelligibility for individuals with amyotrophic lateral sclerosis.

Authors:  G S Turner; K Tjaden; G Weismer
Journal:  J Speech Hear Res       Date:  1995-10

9.  Quantifying speech rhythm abnormalities in the dysarthrias.

Authors:  Julie M Liss; Laurence White; Sven L Mattys; Kaitlin Lansford; Andrew J Lotto; Stephanie M Spitzer; John N Caviness
Journal:  J Speech Lang Hear Res       Date:  2009-08-28       Impact factor: 2.297

10.  Praat script to detect syllable nuclei and measure speech rate automatically.

Authors:  Nivja H de Jong; Ton Wempe
Journal:  Behav Res Methods       Date:  2009-05
View more
  3 in total

1.  Altered speech with migraine attacks: A prospective, longitudinal study of episodic migraine without aura.

Authors:  Todd J Schwedt; Jacob Peplinski; Pamela Garcia-Filion; Visar Berisha
Journal:  Cephalalgia       Date:  2018-11-17       Impact factor: 6.292

2.  The relationship between perceptual disturbances in dysarthric speech and automatic speech recognition performance.

Authors:  Ming Tu; Alan Wisler; Visar Berisha; Julie M Liss
Journal:  J Acoust Soc Am       Date:  2016-11       Impact factor: 1.840

3.  Acoustic and perceptual speech characteristics of native Mandarin speakers with Parkinson's disease.

Authors:  Sih-Chiao Hsu; Yishan Jiao; Megan J McAuliffe; Visar Berisha; Ruey-Meei Wu; Erika S Levy
Journal:  J Acoust Soc Am       Date:  2017-03       Impact factor: 1.840

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.