Literature DB >> 30990421

Weakly Supervised Learning with Multi-Stream CNN-LSTM-HMMs to Discover Sequential Parallelism in Sign Language Videos.

Oscar Koller, Necati Cihan Camgoz, Hermann Ney, Richard Bowden.   

Abstract

In this work we present a new approach to the field of weakly supervised learning in the video domain. Our method is relevant to sequence learning problems which can be split up into sub-problems that occur in parallel. Here, we experiment with sign language data. The approach exploits sequence constraints within each independent stream and combines them by explicitly imposing synchronisation points to make use of parallelism that all sub-problems share. We do this with multi-stream HMMs while adding intermediate synchronisation constraints among the streams. We embed powerful CNN-LSTM models in each HMM stream following the hybrid approach. This allows the discovery of attributes which on their own lack sufficient discriminative power to be identified. We apply the approach to the domain of sign language recognition exploiting the sequential parallelism to learn sign language, mouth shape and hand shape classifiers. We evaluate the classifiers on three publicly available benchmark data sets featuring challenging real-life sign language with over 1,000 classes, full sentence based lip-reading and articulated hand shape recognition on a fine-grained hand shape taxonomy featuring over 60 different hand shapes. We clearly outperform the state-of-the-art on all data sets and observe significantly faster convergence using the parallel alignment approach.

Year:  2019        PMID: 30990421     DOI: 10.1109/TPAMI.2019.2911077

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  13 in total

1.  End-to-End Sentence-Level Multi-View Lipreading Architecture with Spatial Attention Module Integrated Multiple CNNs and Cascaded Local Self-Attention-CTC.

Authors:  Sanghun Jeon; Mun Sang Kim
Journal:  Sensors (Basel)       Date:  2022-05-09       Impact factor: 3.847

2.  Cross Modal Few-Shot Contextual Transfer for Heterogenous Image Classification.

Authors:  Zhikui Chen; Xu Zhang; Wei Huang; Jing Gao; Suhua Zhang
Journal:  Front Neurorobot       Date:  2021-05-24       Impact factor: 2.650

3.  An Improved Sign Language Translation Model with Explainable Adaptations for Processing Long Sign Sentences.

Authors:  Jiangbin Zheng; Zheng Zhao; Min Chen; Jing Chen; Chong Wu; Yidong Chen; Xiaodong Shi; Yiqi Tong
Journal:  Comput Intell Neurosci       Date:  2020-10-23

4.  3D Point-of-Intention Determination Using a Multimodal Fusion of Hand Pointing and Eye Gaze for a 3D Display.

Authors:  Suparat Yeamkuan; Kosin Chamnongthai
Journal:  Sensors (Basel)       Date:  2021-02-06       Impact factor: 3.576

5.  An Attention-Enhanced Multi-Scale and Dual Sign Language Recognition Network Based on a Graph Convolution Network.

Authors:  Lu Meng; Ronghui Li
Journal:  Sensors (Basel)       Date:  2021-02-05       Impact factor: 3.576

6.  Continuous Sign Language Recognition through a Context-Aware Generative Adversarial Network.

Authors:  Ilias Papastratis; Kosmas Dimitropoulos; Petros Daras
Journal:  Sensors (Basel)       Date:  2021-04-01       Impact factor: 3.576

7.  CNN-LSTM Hybrid Real-Time IoT-Based Cognitive Approaches for ISLR with WebRTC: Auditory Impaired Assistive Technology.

Authors:  Meenu Gupta; Narina Thakur; Dhruvi Bansal; Gopal Chaudhary; Battulga Davaasambuu; Qiaozhi Hua
Journal:  J Healthc Eng       Date:  2022-02-21       Impact factor: 2.682

8.  Context-Aware Automatic Sign Language Video Transcription in Psychiatric Interviews.

Authors:  Erion-Vasilis Pikoulis; Aristeidis Bifis; Maria Trigka; Constantinos Constantinopoulos; Dimitrios Kosmopoulos
Journal:  Sensors (Basel)       Date:  2022-03-30       Impact factor: 3.576

9.  Recognition of Signed Expressions in an Experimental System Supporting Deaf Clients in the City Office.

Authors:  Tomasz Kapuscinski; Marian Wysocki
Journal:  Sensors (Basel)       Date:  2020-04-13       Impact factor: 3.576

10.  Recognition of Non-Manual Content in Continuous Japanese Sign Language.

Authors:  Heike Brock; Iva Farag; Kazuhiro Nakadai
Journal:  Sensors (Basel)       Date:  2020-10-01       Impact factor: 3.576

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.