Literature DB >> 9670541

Importance of tonal envelope cues in Chinese speech recognition.

Q J Fu1, F G Zeng, R V Shannon, S D Soli.   

Abstract

Recent studies have shown that temporal waveform envelope cues can provide significant information for English speech recognition. This study investigated the use of temporal envelope cues in a tonal language: Mandarin Chinese. In this study, the speech was divided into several frequency analysis bands; the amplitude envelope was extracted from each band by half-wave rectification and low-pass filtering and was used to modulate a noise of the same bandwidth as the analysis band. These manipulations preserved temporal and amplitude cues in each frequency band, but removed the spectral detail within each band. Chinese vowels, consonants, tones and sentences were identified by 12 native Chinese-speaking listeners with 1, 2, 3, and 4 noise bands. The results showed that the recognition score of vowels, consonants, and sentences increased monotonically with the number of bands, a pattern similar to that observed in English speech recognition. In contrast, tones were consistently recognized at about 80% correct level, independent of the number of bands. This high level of tone recognition produced a significant difference in the open-set sentence recognition between Chinese (11.0%) and English (2.9%) for the one-band condition where no spectral information was available. The data also revealed that, with primarily temporal cues, the falling-rising tone (tone 3) and the falling tone (tone 4) were more easily recognized than the flat tone (tone 1) and the rising tone (tone 2). This differential pattern in tone recognition resulted in a similar pattern in word recognition: words having either tone 3 or 4 were more likely to be recognized while words having tone 1 and 2 were not. The quantitative role of tones in Chinese speech recognition was further explored using a power-function model and found to play a significant role in relating phoneme recognition to sentence recognition.

Mesh:

Year:  1998        PMID: 9670541     DOI: 10.1121/1.423251

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  77 in total

1.  Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses.

Authors:  Li Xu; Yuhjung Tsai; Bryan E Pfingst
Journal:  J Acoust Soc Am       Date:  2002-07       Impact factor: 1.840

2.  Individual variability in cue-weighting and lexical tone learning.

Authors:  Bharath Chandrasekaran; Padma D Sampath; Patrick C M Wong
Journal:  J Acoust Soc Am       Date:  2010-07       Impact factor: 1.840

3.  The role of spectral and temporal cues in voice gender discrimination by normal-hearing listeners and cochlear implant users.

Authors:  Qian-Jie Fu; Sherol Chinchilla; John J Galvin
Journal:  J Assoc Res Otolaryngol       Date:  2004-05-20

4.  Effects of stimulation rate, mode and level on modulation detection by cochlear implant users.

Authors:  John J Galvin; Qian-Jie Fu
Journal:  J Assoc Res Otolaryngol       Date:  2005-09

5.  Mismatch negativity to pitch contours is influenced by language experience.

Authors:  Bharath Chandrasekaran; Ananthanarayan Krishnan; Jackson T Gandour
Journal:  Brain Res       Date:  2006-11-27       Impact factor: 3.252

6.  [Experiments on prosody perception with cochlear implants].

Authors:  H Meister; D Tepeli; P Wagner; W Hess; M Walger; H von Wedel; R Lang-Roth
Journal:  HNO       Date:  2007-04       Impact factor: 1.284

7.  Electromotile hearing: acoustic tones mask psychophysical response to high-frequency electrical stimulation of intact guinea pig cochleae.

Authors:  Colleen G Le Prell; Kohei Kawamoto; Yehoash Raphael; David F Dolan
Journal:  J Acoust Soc Am       Date:  2006-12       Impact factor: 1.840

8.  Speech recognition and temporal amplitude modulation processing by Mandarin-speaking cochlear implant users.

Authors:  Xin Luo; Qian-Jie Fu; Chao-Gang Wei; Ke-Li Cao
Journal:  Ear Hear       Date:  2008-12       Impact factor: 3.570

9.  Lexical tone recognition in noise in normal-hearing children and prelingually deafened children with cochlear implants.

Authors:  Yitao Mao; Li Xu
Journal:  Int J Audiol       Date:  2016-08-26       Impact factor: 2.117

10.  Melodic pitch perception and lexical tone perception in Mandarin-speaking cochlear implant users.

Authors:  Duoduo Tao; Rui Deng; Ye Jiang; John J Galvin; Qian-Jie Fu; Bing Chen
Journal:  Ear Hear       Date:  2015-01       Impact factor: 3.570

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.