Literature DB >> 18453884

Lexical tone recognition with an artificial neural network.

Ning Zhou1, Wenle Zhang, Chao-Yang Lee, Li Xu.   

Abstract

OBJECTIVES: Tone production is particularly important for communicating in tone languages such as Mandarin Chinese. In the present study, an artificial neural network was used to recognize tones produced by adult native speakers. The purposes of the study were (1) to test the sensitivity of the neural network to speaker variation typically in adult speaker groups, (2) to evaluate two normalization procedures to overcome the effects of speaker variation, and (3) to compare tone recognition performance of the neural network with that of the human listeners.
DESIGN: A feedforward multilayer neural network was used. Twenty-nine adult native Mandarin Chinese speakers were recruited to record tone samples. The F0 contours of the vowel part of the 1044 monosyllabic words recorded were extracted using an autocorrelation method. Samples from the F0 contours were used as inputs to the neural network. The efficacy of the neural network was first tested by varying the number of inputs and the number of neurons in the hidden layer from 1 to 16. The sensitivity of the neural network to speaker variation was tested by (1) using the raw F0 data from speech tokens of a number of randomly drawn speakers that varied from 1 to 29, (2) using the raw F0 data from speech tokens of either male-only or female-only speakers, and (3) using two sets of normalized F0 data (i.e., tone 1-based normalization and first-order derivative) from speech tokens from a number of randomly drawn speakers that varied from 1 to 29. The recognition performance of the neural network under several experimental conditions was compared with the corresponding recognition performance of 10 normal-hearing, native Mandarin Chinese speaking adult listeners.
RESULTS: Three inputs and four hidden neurons were found to be sufficient for the neural network to perform at about 85% correct using speech samples without normalization. The performance of the neural network was affected by variation across speakers particularly between genders. Using the tone 1-based normalization procedure, the performance of the neural network improved significantly. The recognition accuracy of the neural network as a whole or for each tone was comparable with that of the human listeners.
CONCLUSIONS: The neural network can be used to evaluate the tone production of Mandarin Chinese speaking adults with human listener-like recognition accuracy. The tone 1-based normalization procedure improves the performance of the neural network to human listener-like accuracy. The success of our neural network in recognizing tones from multiple speakers supports its utility for evaluating tone production. Further testing of the neural network with hearing-impaired speakers might reveal its potential use for clinical evaluation of tone production.

Entities:  

Mesh:

Year:  2008        PMID: 18453884      PMCID: PMC2562432          DOI: 10.1097/AUD.0b013e3181662c42

Source DB:  PubMed          Journal:  Ear Hear        ISSN: 0196-0202            Impact factor:   3.570


  29 in total

1.  Coding of the fundamental frequency in continuous interleaved sampling processors for cochlear implants.

Authors:  L Geurts; J Wouters
Journal:  J Acoust Soc Am       Date:  2001-02       Impact factor: 1.840

2.  Cantonese tone perception ability of cochlear implant children in comparison with normal-hearing children.

Authors:  Kathy Y S Lee; C A van Hasselt; S N Chiu; Dilys M C Cheung
Journal:  Int J Pediatr Otorhinolaryngol       Date:  2002-04-25       Impact factor: 1.675

3.  Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses.

Authors:  Li Xu; Yuhjung Tsai; Bryan E Pfingst
Journal:  J Acoust Soc Am       Date:  2002-07       Impact factor: 1.840

4.  Tone production of Mandarin Chinese speaking children with cochlear implants.

Authors:  Demin Han; Ning Zhou; Yongxin Li; Xiuwu Chen; Xiaoyan Zhao; Li Xu
Journal:  Int J Pediatr Otorhinolaryngol       Date:  2007-03-21       Impact factor: 1.675

5.  Development and evaluation of methods for assessing tone production skills in Mandarin-speaking children with cochlear implants.

Authors:  Ning Zhou; Li Xu
Journal:  J Acoust Soc Am       Date:  2008-03       Impact factor: 1.840

6.  Nucleus 22-channel cochlear mini-system implantations in Mandarin-speaking patients.

Authors:  T S Huang; N M Wang; S Y Liu
Journal:  Am J Otol       Date:  1996-01

7.  Chinese tonal language rehabilitation following cochlear implantation in children.

Authors:  W I Wei; R Wong; Y Hui; D K Au; B Y Wong; W K Ho; A Tsang; P Kung; E Chung
Journal:  Acta Otolaryngol       Date:  2000-03       Impact factor: 1.494

8.  Recognition of lexical tone production of children with an artificial neural network.

Authors:  Li Xu; Xiuwu Chen; Ning Zhou; Yongxin Li; Xiaoyan Zhao; Demin Han
Journal:  Acta Otolaryngol       Date:  2007-04       Impact factor: 1.494

9.  The perception of Cantonese lexical tones by early-deafened cochlear implantees.

Authors:  Valter Ciocca; Alexander L Francis; Rani Aisha; Lena Wong
Journal:  J Acoust Soc Am       Date:  2002-05       Impact factor: 1.840

10.  Spectral and temporal cues to pitch in noise-excited vocoder simulations of continuous-interleaved-sampling cochlear implants.

Authors:  Tim Green; Andrew Faulkner; Stuart Rosen
Journal:  J Acoust Soc Am       Date:  2002-11       Impact factor: 1.840

View more
  6 in total

1.  Relative contributions of temporal envelope and fine structure cues to lexical tone recognition in hearing-impaired listeners.

Authors:  Shuo Wang; Li Xu; Robert Mannell
Journal:  J Assoc Res Otolaryngol       Date:  2011-08-11

2.  Development and evaluation of methods for assessing tone production skills in Mandarin-speaking children with cochlear implants.

Authors:  Ning Zhou; Li Xu
Journal:  J Acoust Soc Am       Date:  2008-03       Impact factor: 1.840

3.  Tone perception and production in pediatric cochlear implants users.

Authors:  Li Xu; Xiuwu Chen; Hongyun Lu; Ning Zhou; Shuo Wang; Qiaoyun Liu; Yongxin Li; Xiaoyan Zhao; Demin Han
Journal:  Acta Otolaryngol       Date:  2010-12-20       Impact factor: 1.494

4.  Encoding lexical tones in jTRACE: a simulation of monosyllabic spoken word recognition in Mandarin Chinese.

Authors:  Lan Shuai; Jeffrey G Malins
Journal:  Behav Res Methods       Date:  2017-02

5.  Acoustic Assessment of Tone Production of Prelingually-Deafened Mandarin-Speaking Children With Cochlear Implants.

Authors:  Yitao Mao; Hongsheng Chen; Shumin Xie; Li Xu
Journal:  Front Neurosci       Date:  2020-11-04       Impact factor: 4.677

6.  Relationship between tone perception and production in prelingually deafened children with cochlear implants.

Authors:  Ning Zhou; Juan Huang; Xiuwu Chen; Li Xu
Journal:  Otol Neurotol       Date:  2013-04       Impact factor: 2.311

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.