Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Multistream articulatory feature-based models for visual speech recognition.

Literature DB >> 19574628

Multistream articulatory feature-based models for visual speech recognition.

Kate Saenko¹, Karen Livescu, James Glass, Trevor Darrell.

Abstract

We study the problem of automatic visual speech recognition (VSR) using dynamic Bayesian network (DBN)-based models consisting of multiple sequences of hidden states, each corresponding to an articulatory feature (AF) such as lip opening (LO) or lip rounding (LR). A bank of discriminative articulatory feature classifiers provides input to the DBN, in the form of either virtual evidence (VE) (scaled likelihoods) or raw classifier margin outputs. We present experiments on two tasks, a medium-vocabulary word-ranking task and a small-vocabulary phrase recognition task. We show that articulatory feature-based models outperform baseline models, and we study several aspects of the models, such as the effects of allowing articulatory asynchrony, of using dictionary-based versus whole-word models, and of incorporating classifier outputs via virtual evidence versus alternative observation models.

Entities: Chemical Disease

Mesh：

Year: 2009 PMID： 19574628 DOI： 10.1109/TPAMI.2008.303

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

Keyword Cloud
Cited

1 in total

1. Articulatory distinctiveness of vowels and consonants: a data-driven approach.

Authors: Jun Wang; Jordan R Green; Ashok Samal; Yana Yunusova
Journal: J Speech Lang Hear Res Date: 2013-07-09 Impact factor: 2.297

1 in total