Literature DB >> 10380672

Missing-data model of vowel identification.

A de Cheveigné1, H Kawahara.   

Abstract

Vowel identity correlates well with the shape of the transfer function of the vocal tract, in particular the position of the first two or three formant peaks. However, in voiced speech the transfer function is sampled at multiples of the fundamental frequency (F0), and the short-term spectrum contains peaks at those frequencies, rather than at formants. It is not clear how the auditory system estimates the original spectral envelope from the vowel waveform. Cochlear excitation patterns, for example, resolve harmonics in the low-frequency region and their shape varies strongly with F0. The problem cannot be cured by smoothing: lag-domain components of the spectral envelope are aliased and cause F0-dependent distortion. The problem is severe at high F0's where the spectral envelope is severely undersampled. This paper treats vowel identification as a process of pattern recognition with missing data. Matching is restricted to available data, and missing data are ignored using an F0-dependent weighting function that emphasizes regions near harmonics. The model is presented in two versions: a frequency-domain version based on short-term spectra, or tonotopic excitation patterns, and a time-domain version based on autocorrelation functions. It accounts for the relative F0-independency observed in vowel identification.

Mesh:

Year:  1999        PMID: 10380672     DOI: 10.1121/1.424675

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  8 in total

1.  Temporal integration in vowel perception.

Authors:  Andrew B Wallace; Sheila E Blumstein
Journal:  J Acoust Soc Am       Date:  2009-03       Impact factor: 1.840

2.  The Effect of Dynamic Pitch on Speech Recognition in Temporally Modulated Noise.

Authors:  Jing Shen; Pamela E Souza
Journal:  J Speech Lang Hear Res       Date:  2017-09-18       Impact factor: 2.297

3.  Comparing measurement errors for formants in synthetic and natural vowels.

Authors:  Christine H Shadle; Hosung Nam; D H Whalen
Journal:  J Acoust Soc Am       Date:  2016-02       Impact factor: 1.840

4.  Auditory training of speech recognition with interrupted and continuous noise maskers by children with hearing impairment.

Authors:  Jessica R Sullivan; Linda M Thibodeau; Peter F Assmann
Journal:  J Acoust Soc Am       Date:  2013-01       Impact factor: 1.840

5.  A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data.

Authors:  Richard E Turner; Thomas C Walters; Jessica J M Monaghan; Roy D Patterson
Journal:  J Acoust Soc Am       Date:  2009-04       Impact factor: 1.840

6.  The role of spectral cues in timbre discrimination by ferrets and humans.

Authors:  Stephen M Town; Huriye Atilgan; Katherine C Wood; Jennifer K Bizley
Journal:  J Acoust Soc Am       Date:  2015-05       Impact factor: 1.840

7.  Spectral timbre perception in ferrets: discrimination of artificial vowels under different listening conditions.

Authors:  Jennifer K Bizley; Kerry M M Walker; Andrew J King; Jan W H Schnupp
Journal:  J Acoust Soc Am       Date:  2013-01       Impact factor: 1.840

8.  Neural Representation of Concurrent Vowels in Macaque Primary Auditory Cortex.

Authors:  Yonatan I Fishman; Christophe Micheyl; Mitchell Steinschneider
Journal:  eNeuro       Date:  2016-06-10
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.