Literature DB >> 8609302

Vowel-onset detection by vowel-strength measurement, cochlear-nucleus simulation, and multilayer perceptrons.

R W Kortekaas1, D J Hermes, G F Meyer.   

Abstract

An algorithm for detection of vowel onsets in fluent speech was presented by Hermes [j. Acoust. Soc. Am. 87, 866-873 (1990)]. Performance tests showed that detection was good for fluent speech, although the parameter settings had to be modified for application to well-articulated speech. One of the purposes of the algorithm was application to speech by deaf persons, for which it failed completely. In order to improve the algorithm and to make it more generally applicable, two alternative detection strategies have been explored in the present study. These strategies were (a) simulation of transient-chopper responses in the cochlear nucleus and (B) training of multilayer perceptrons. Two large databases of read speech have been used for performance comparison of the original algorithm and the new strategies. The strategy based on simulating cochlear-nucleus responses is found both to result in a higher false-alarm rate than the original algorithm and to be rather level dependent. On the other hand, the performance of a multilayer-perceptron network, trained on mel-scaled spectra, is comparable to the performance of the Hermes algorithm. In more general terms, the results suggest that temporal information on intensity and (rough) spectral envelope are important for human vowel-onset detection behavior. In formation on harmonicity can be used as a secondary source of information to avoid detection of mainly unvoiced, nonvowel onsets.

Entities:  

Mesh:

Year:  1996        PMID: 8609302     DOI: 10.1121/1.414671

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  2 in total

1.  Development of a two-stage procedure for the automatic recognition of dysfluencies in the speech of children who stutter: II. ANN recognition of repetitions and prolongations with supplied word segment markers.

Authors:  P Howell; S Sackin; K Glenn
Journal:  J Speech Lang Hear Res       Date:  1997-10       Impact factor: 2.297

2.  An application of univariate and multivariate approaches in FMRI to quantifying the hemispheric lateralization of acoustic and linguistic processes.

Authors:  Carolyn McGettigan; Samuel Evans; Stuart Rosen; Zarinah K Agnew; Poonam Shah; Sophie K Scott
Journal:  J Cogn Neurosci       Date:  2011-11-08       Impact factor: 3.225

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.