Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Missing-data model of vowel identification.

Literature DB >> 10380672

Missing-data model of vowel identification.

Abstract

Vowel identity correlates well with the shape of the transfer function of the vocal tract, in particular the position of the first two or three formant peaks. However, in voiced speech the transfer function is sampled at multiples of the fundamental frequency (F0), and the short-term spectrum contains peaks at those frequencies, rather than at formants. It is not clear how the auditory system estimates the original spectral envelope from the vowel waveform. Cochlear excitation patterns, for example, resolve harmonics in the low-frequency region and their shape varies strongly with F0. The problem cannot be cured by smoothing: lag-domain components of the spectral envelope are aliased and cause F0-dependent distortion. The problem is severe at high F0's where the spectral envelope is severely undersampled. This paper treats vowel identification as a process of pattern recognition with missing data. Matching is restricted to available data, and missing data are ignored using an F0-dependent weighting function that emphasizes regions near harmonics. The model is presented in two versions: a frequency-domain version based on short-term spectra, or tonotopic excitation patterns, and a time-domain version based on autocorrelation functions. It accounts for the relative F0-independency observed in vowel identification.

Mesh：

Year: 1999 PMID： 10380672 DOI： 10.1121/1.424675

Source DB: PubMed Journal: J Acoust Soc Am ISSN： 0001-4966 Impact factor: 1.840

Keyword Cloud
Cited

8 in total

1. Temporal integration in vowel perception.

Authors: Andrew B Wallace; Sheila E Blumstein
Journal: J Acoust Soc Am Date: 2009-03 Impact factor: 1.840

2. The Effect of Dynamic Pitch on Speech Recognition in Temporally Modulated Noise.

Authors: Jing Shen; Pamela E Souza
Journal: J Speech Lang Hear Res Date: 2017-09-18 Impact factor: 2.297

3. Comparing measurement errors for formants in synthetic and natural vowels.

Authors: Christine H Shadle; Hosung Nam; D H Whalen
Journal: J Acoust Soc Am Date: 2016-02 Impact factor: 1.840

4. Auditory training of speech recognition with interrupted and continuous noise maskers by children with hearing impairment.

Authors: Jessica R Sullivan; Linda M Thibodeau; Peter F Assmann
Journal: J Acoust Soc Am Date: 2013-01 Impact factor: 1.840

Missing-data model of vowel identification.

1. Temporal integration in vowel perception.

2. The Effect of Dynamic Pitch on Speech Recognition in Temporally Modulated Noise.

3. Comparing measurement errors for formants in synthetic and natural vowels.

4. Auditory training of speech recognition with interrupted and continuous noise maskers by children with hearing impairment.

5. A statistical, formant-pattern model for segregating vowel type and vocal-tract length in developmental formant data.

6. The role of spectral cues in timbre discrimination by ferrets and humans.

7. Spectral timbre perception in ferrets: discrimination of artificial vowels under different listening conditions.

8. Neural Representation of Concurrent Vowels in Macaque Primary Auditory Cortex.