Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos.

Literature DB >> 30990421

Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos.

Oscar Koller, Necati Cihan Camgoz, Hermann Ney, Richard Bowden.

Abstract

In this work we present a new approach to the field of weakly supervised learning in the video domain. Our method is relevant to sequence learning problems which can be split up into sub-problems that occur in parallel. Here, we experiment with sign language data. The approach exploits sequence constraints within each independent stream and combines them by explicitly imposing synchronisation points to make use of parallelism that all sub-problems share. We do this with multi-stream HMMs while adding intermediate synchronisation constraints among the streams. We embed powerful CNN-LSTM models in each HMM stream following the hybrid approach. This allows the discovery of attributes which on their own lack sufficient discriminative power to be identified. We apply the approach to the domain of sign language recognition exploiting the sequential parallelism to learn sign language, mouth shape and hand shape classifiers. We evaluate the classifiers on three publicly available benchmark data sets featuring challenging real-life sign language with over 1,000 classes, full sentence based lip-reading and articulated hand shape recognition on a fine-grained hand shape taxonomy featuring over 60 different hand shapes. We clearly outperform the state-of-the-art on all data sets and observe significantly faster convergence using the parallel alignment approach.

Year: 2019 PMID： 30990421 DOI： 10.1109/TPAMI.2019.2911077

Source DB: PubMed Journal: IEEE Trans Pattern Anal Mach Intell ISSN： 0098-5589 Impact factor: 6.226

Keyword Cloud
Cited

13 in total

1. End-to-End Sentence-Level Multi-View Lipreading Architecture with Spatial Attention Module Integrated Multiple CNNs and Cascaded Local Self-Attention-CTC.

Authors: Sanghun Jeon; Mun Sang Kim
Journal: Sensors (Basel) Date: 2022-05-09 Impact factor: 3.847

2. Cross Modal Few-Shot Contextual Transfer for Heterogenous Image Classification.

Authors: Zhikui Chen; Xu Zhang; Wei Huang; Jing Gao; Suhua Zhang
Journal: Front Neurorobot Date: 2021-05-24 Impact factor: 2.650

3. An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences.

Authors: Jiangbin Zheng; Zheng Zhao; Min Chen; Jing Chen; Chong Wu; Yidong Chen; Xiaodong Shi; Yiqi Tong
Journal: Comput Intell Neurosci Date: 2020-10-23

4. 3D Point-of-Intention Determination Using a Multimodal Fusion of Hand Pointing and Eye Gaze for a 3D Display.

Authors: Suparat Yeamkuan; Kosin Chamnongthai
Journal: Sensors (Basel) Date: 2021-02-06 Impact factor: 3.576

5. An Attention-Enhanced Multi-Scale and Dual Sign Language Recognition Network Based on a Graph Convolution Network.

Authors: Lu Meng; Ronghui Li
Journal: Sensors (Basel) Date: 2021-02-05 Impact factor: 3.576

6. Continuous Sign Language Recognition through a Context-Aware Generative Adversarial Network.

Authors: Ilias Papastratis; Kosmas Dimitropoulos; Petros Daras
Journal: Sensors (Basel) Date: 2021-04-01 Impact factor: 3.576

7. CNN-LSTM Hybrid Real-Time IoT-Based Cognitive Approaches for ISLR with WebRTC: Auditory Impaired Assistive Technology.

Authors: Meenu Gupta; Narina Thakur; Dhruvi Bansal; Gopal Chaudhary; Battulga Davaasambuu; Qiaozhi Hua
Journal: J Healthc Eng Date: 2022-02-21 Impact factor: 2.682