Literature DB >> 19212454

LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP.

Mark Hasegawa-Johnson1, James Baker, Sarah Borys, Ken Chen, Emily Coogan, Steven Greenberg, Amit Juneja, Katrin Kirchhoff, Karen Livescu, Srividya Mohan, Jennifer Muller, Kemal Sonmez, Tianyu Wang.   

Abstract

Three research prototype speech recognition systems are described, all of which use recently developed methods from artificial intelligence (specifically support vector machines, dynamic Bayesian networks, and maximum entropy classification) in order to implement, in the form of an automatic speech recognizer, current theories of human speech perception and phonology (specifically landmark-based speech perception, nonlinear phonology, and articulatory phonology). All three systems begin with a high-dimensional multiframe acoustic-to-distinctive feature transformation, implemented using support vector machines trained to detect and classify acoustic phonetic landmarks. Distinctive feature probabilities estimated by the support vector machines are then integrated using one of three pronunciation models: a dynamic programming algorithm that assumes canonical pronunciation of each word, a dynamic Bayesian network implementation of articulatory phonology, or a discriminative pronunciation model trained using the methods of maximum entropy classification. Log probability scores computed by these models are then combined, using log-linear combination, with other word scores available in the lattice output of a first-pass recognizer, and the resulting combination score is used to compute a second-pass speech recognition output.

Entities:  

Year:  2005        PMID: 19212454      PMCID: PMC2638080          DOI: 10.1109/ICASSP.2005.1415088

Source DB:  PubMed          Journal:  Proc IEEE Int Conf Acoust Speech Signal Process        ISSN: 1520-6149


  2 in total

Review 1.  Articulatory phonology: an overview.

Authors:  C P Browman; L Goldstein
Journal:  Phonetica       Date:  1992       Impact factor: 1.759

2.  On the role of spectral transition for speech perception.

Authors:  S Furui
Journal:  J Acoust Soc Am       Date:  1986-10       Impact factor: 1.840

  2 in total
  3 in total

1.  Stop-like modification of the dental fricative /ð/: an acoustic analysis.

Authors:  Sherry Y Zhao
Journal:  J Acoust Soc Am       Date:  2010-10       Impact factor: 1.840

2.  A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English /r/.

Authors:  Xinhui Zhou; Carol Y Espy-Wilson; Suzanne Boyce; Mark Tiede; Christy Holland; Ann Choe
Journal:  J Acoust Soc Am       Date:  2008-06       Impact factor: 2.482

3.  Computational Modelling of Tone Perception Based on Direct Processing of f0 Contours.

Authors:  Yue Chen; Yingming Gao; Yi Xu
Journal:  Brain Sci       Date:  2022-03-02
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.