Literature DB >> 26731641

Explore Efficient Local Features from RGB-D Data for One-Shot Learning Gesture Recognition.

Jun Wan, Guodong Guo, Stan Z Li.   

Abstract

Availability of handy RGB-D sensors has brought about a surge of gesture recognition research and applications. Among various approaches, one shot learning approach is advantageous because it requires minimum amount of data. Here, we provide a thorough review about one-shot learning gesture recognition from RGB-D data and propose a novel spatiotemporal feature extracted from RGB-D data, namely mixed features around sparse keypoints (MFSK). In the review, we analyze the challenges that we are facing, and point out some future research directions which may enlighten researchers in this field. The proposed MFSK feature is robust and invariant to scale, rotation and partial occlusions. To alleviate the insufficiency of one shot training samples, we augment the training samples by artificially synthesizing versions of various temporal scales, which is beneficial for coping with gestures performed at varying speed. We evaluate the proposed method on the Chalearn gesture dataset (CGD). The results show that our approach outperforms all currently published approaches on the challenging data of CGD, such as translated, scaled and occluded subsets. When applied to the RGB-D datasets that are not one-shot (e.g., the Cornell Activity Dataset-60 and MSR Daily Activity 3D dataset), the proposed feature also produces very promising results under leave-one-out cross validation or one-shot learning.

Entities:  

Mesh:

Year:  2015        PMID: 26731641     DOI: 10.1109/TPAMI.2015.2513479

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  3 in total

1.  Adaptive Local Spatiotemporal Features from RGB-D Data for One-Shot Learning Gesture Recognition.

Authors:  Jia Lin; Xiaogang Ruan; Naigong Yu; Yee-Hong Yang
Journal:  Sensors (Basel)       Date:  2016-12-17       Impact factor: 3.576

2.  CNN-LSTM Hybrid Real-Time IoT-Based Cognitive Approaches for ISLR with WebRTC: Auditory Impaired Assistive Technology.

Authors:  Meenu Gupta; Narina Thakur; Dhruvi Bansal; Gopal Chaudhary; Battulga Davaasambuu; Qiaozhi Hua
Journal:  J Healthc Eng       Date:  2022-02-21       Impact factor: 2.682

3.  A PointNet-Based Solution for 3D Hand Gesture Recognition.

Authors:  Radu Mirsu; Georgiana Simion; Catalin Daniel Caleanu; Ioana Monica Pop-Calimanu
Journal:  Sensors (Basel)       Date:  2020-06-05       Impact factor: 3.576

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.