Literature DB >> 7983269

Tone recognition of continuous Mandarin speech assisted with prosodic information.

Y R Wang1, S H Chen.   

Abstract

In this paper, a simple recurrent neural network (SRNN) is employed to model the prosody of continuous Mandarin speech to assist tone recognition. For each syllable in continuous speech, several acoustic features carrying prosodic information are extracted and taken as inputs to the SRNN. If proper linguistic features extracted from the context of the syllable are set as output targets, the SRNN can learn to represent the prosodic state of the utterance at the syllable using its hidden nodes. Outputs of the hidden nodes then serve as additional recognition features to assist recognition of the tone of the syllable. The performance of the proposed tone recognition approach was examined by simulation on a multilayer perception (MLP)-based speaker-dependent tone recognition task. The recognition rate was improved from 91.38% to 93.10%. The SRNN prosodic model is further analyzed to exploit the linguistic meaning of prosodic states. By vector quantizing the outputs of the hidden nodes of the SRNN, a finite-state automata that roughly represents the mechanism of human prosody pronunciation can be obtained.

Entities:  

Mesh:

Year:  1994        PMID: 7983269     DOI: 10.1121/1.411274

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  3 in total

1.  Development and evaluation of methods for assessing tone production skills in Mandarin-speaking children with cochlear implants.

Authors:  Ning Zhou; Li Xu
Journal:  J Acoust Soc Am       Date:  2008-03       Impact factor: 1.840

2.  Lexical tone recognition with an artificial neural network.

Authors:  Ning Zhou; Wenle Zhang; Chao-Yang Lee; Li Xu
Journal:  Ear Hear       Date:  2008-06       Impact factor: 3.570

3.  Recognition of lexical tone production of children with an artificial neural network.

Authors:  Li Xu; Xiuwu Chen; Ning Zhou; Yongxin Li; Xiaoyan Zhao; Demin Han
Journal:  Acta Otolaryngol       Date:  2007-04       Impact factor: 1.494

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.