Literature DB >> 17348495

Speech production knowledge in automatic speech recognition.

Simon King1, Joe Frankel, Karen Livescu, Erik McDermott, Korin Richmond, Mirjam Wester.   

Abstract

Although much is known about how speech is produced, and research into speech production has resulted in measured articulatory data, feature systems of different kinds, and numerous models, speech production knowledge is almost totally ignored in current mainstream approaches to automatic speech recognition. Representations of speech production allow simple explanations for many phenomena observed in speech which cannot be easily analyzed from either acoustic signal or phonetic transcription alone. In this article, a survey of a growing body of work in which such representations are used to improve automatic speech recognition is provided.

Mesh:

Year:  2007        PMID: 17348495     DOI: 10.1121/1.2404622

Source DB:  PubMed          Journal:  J Acoust Soc Am        ISSN: 0001-4966            Impact factor:   1.840


  13 in total

1.  An Optimal Set of Flesh Points on Tongue and Lips for Speech-Movement Classification.

Authors:  Jun Wang; Ashok Samal; Panying Rong; Jordan R Green
Journal:  J Speech Lang Hear Res       Date:  2016-02       Impact factor: 2.297

2.  Co-registration of speech production datasets from electromagnetic articulography and real-time magnetic resonance imaging.

Authors:  Jangwon Kim; Adam C Lammert; Prasanta Kumar Ghosh; Shrikanth S Narayanan
Journal:  J Acoust Soc Am       Date:  2014-02       Impact factor: 1.840

3.  Recognizing Whispered Speech Produced by an Individual with Surgically Reconstructed Larynx Using Articulatory Movement Data.

Authors:  Beiming Cao; Myungjong Kim; Ted Mau; Jun Wang
Journal:  Workshop Speech Lang Process Assist Technol       Date:  2016-09

4.  Estimation of vocal fold physiology from voice acoustics using machine learning.

Authors:  Zhaoyan Zhang
Journal:  J Acoust Soc Am       Date:  2020-03       Impact factor: 1.840

5.  Vowel Recognition from Articulatory Position Time-Series Data.

Authors:  Jun Wang; Ashok Samal; Jordan R Green; Tom D Carrell
Journal:  Int Conf Signal Process Commun       Date:  2009-09-28

6.  Articulatory distinctiveness of vowels and consonants: a data-driven approach.

Authors:  Jun Wang; Jordan R Green; Ashok Samal; Yana Yunusova
Journal:  J Speech Lang Hear Res       Date:  2013-07-09       Impact factor: 2.297

7.  Speaker verification based on the fusion of speech acoustics and inverted articulatory signals.

Authors:  Ming Li; Jangwon Kim; Adam Lammert; Prasanta Kumar Ghosh; Vikram Ramanarayanan; Shrikanth Narayanan
Journal:  Comput Speech Lang       Date:  2015-05-22       Impact factor: 1.899

8.  Directly data-derived articulatory gesture-like representations retain discriminatory information about phone categories.

Authors:  Vikram Ramanarayanan; Maarten Van Segbroeck; Shrikanth S Narayanan
Journal:  Comput Speech Lang       Date:  2015-03-21       Impact factor: 1.899

9.  Perception and hierarchical dynamics.

Authors:  Stefan J Kiebel; Jean Daunizeau; Karl J Friston
Journal:  Front Neuroinform       Date:  2009-07-20       Impact factor: 4.081

10.  Modeling speech imitation and ecological learning of auditory-motor maps.

Authors:  Claudia Canevari; Leonardo Badino; Alessandro D'Ausilio; Luciano Fadiga; Giorgio Metta
Journal:  Front Psychol       Date:  2013-06-27
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.