Literature DB >> 15532666

A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters.

Christopher T Kello1, David C Plaut.   

Abstract

Three neural network models were trained on the forward mapping from articulatory positions to acoustic outputs for a single speaker of the Edinburgh multi-channel articulatory speech database. The model parameters (i.e., connection weights) were learned via the backpropagation of error signals generated by the difference between acoustic outputs of the models, and their acoustic targets. Efficacy of the trained models was assessed by subjecting the models' acoustic outputs to speech intelligibility tests. The results of these tests showed that enough phonetic information was captured by the models to support rates of word identification as high as 84%, approaching an identification rate of 92% for the actual target stimuli. These forward models could serve as one component of a data-driven articulatory synthesizer. The models also provide the first step toward building a model of spoken word acquisition and phonological development trained on real speech.

Mesh:

Year:  2004        PMID: 15532666     DOI: 10.1121/1.1715112

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  5 in total

1.  Statistical Methods for Estimation of Direct and Differential Kinematics of the Vocal Tract.

Authors:  Adam Lammert; Louis Goldstein; Shrikanth Narayanan; Khalil Iskarous
Journal:  Speech Commun       Date:  2013-01       Impact factor: 2.017

2.  Vowel Recognition from Articulatory Position Time-Series Data.

Authors:  Jun Wang; Ashok Samal; Jordan R Green; Tom D Carrell
Journal:  Int Conf Signal Process Commun       Date:  2009-09-28

Review 3.  A case for the role of memory consolidation in speech-motor learning.

Authors:  Anne L van Zelst; F Sayako Earle
Journal:  Psychon Bull Rev       Date:  2021-02

4.  Lexical is as lexical does: computational approaches to lexical representation.

Authors:  Anna M Woollams
Journal:  Lang Cogn Neurosci       Date:  2015-04-21       Impact factor: 2.331

5.  Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces.

Authors:  Florent Bocquelet; Thomas Hueber; Laurent Girin; Christophe Savariaux; Blaise Yvert
Journal:  PLoS Comput Biol       Date:  2016-11-23       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.