Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP.

Literature DB >> 19212454

LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP.

Mark Hasegawa-Johnson¹, James Baker, Sarah Borys, Ken Chen, Emily Coogan, Steven Greenberg, Amit Juneja, Katrin Kirchhoff, Karen Livescu, Srividya Mohan, Jennifer Muller, Kemal Sonmez, Tianyu Wang.
1. University of Illinois.

Abstract

Three research prototype speech recognition systems are described, all of which use recently developed methods from artificial intelligence (specifically support vector machines, dynamic Bayesian networks, and maximum entropy classification) in order to implement, in the form of an automatic speech recognizer, current theories of human speech perception and phonology (specifically landmark-based speech perception, nonlinear phonology, and articulatory phonology). All three systems begin with a high-dimensional multiframe acoustic-to-distinctive feature transformation, implemented using support vector machines trained to detect and classify acoustic phonetic landmarks. Distinctive feature probabilities estimated by the support vector machines are then integrated using one of three pronunciation models: a dynamic programming algorithm that assumes canonical pronunciation of each word, a dynamic Bayesian network implementation of articulatory phonology, or a discriminative pronunciation model trained using the methods of maximum entropy classification. Log probability scores computed by these models are then combined, using log-linear combination, with other word scores available in the lattice output of a first-pass recognizer, and the resulting combination score is used to compute a second-pass speech recognition output.

Entities: Species

Year: 2005 PMID： 19212454 PMCID： PMC2638080 DOI： 10.1109/ICASSP.2005.1415088

Source DB: PubMed Journal: Proc IEEE Int Conf Acoust Speech Signal Process ISSN： 1520-6149

2 in total

Review 1. Articulatory phonology: an overview.

Authors: C P Browman; L Goldstein
Journal: Phonetica Date: 1992 Impact factor: 1.759

2. On the role of spectral transition for speech perception.

Authors: S Furui
Journal: J Acoust Soc Am Date: 1986-10 Impact factor: 1.840

2 in total

3 in total

LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP.

Review 1. Articulatory phonology: an overview.

2. On the role of spectral transition for speech perception.

1. Stop-like modification of the dental fricative /ð/: an acoustic analysis.

2. A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English /r/.

3. Computational Modelling of Tone Perception Based on Direct Processing of f₀ Contours.

LANDMARK-BASED SPEECH RECOGNITION: REPORT OF THE 2004 JOHNS HOPKINS SUMMER WORKSHOP.

Review 1. Articulatory phonology: an overview.

2. On the role of spectral transition for speech perception.

1. Stop-like modification of the dental fricative /ð/: an acoustic analysis.

2. A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English /r/.

3. Computational Modelling of Tone Perception Based on Direct Processing of f0 Contours.

3. Computational Modelling of Tone Perception Based on Direct Processing of f₀ Contours.