Literature DB >> 31944975

View-invariant Deep Architecture for Human Action Recognition using Two-stream Motion and Shape Temporal Dynamics.

Chhavi Dhiman, Dinesh Kumar Vishwakarma.   

Abstract

Human action Recognition for unknown views, is a challenging task. We propose a deep view-invariant human action recognition framework, which is a novel integration of two important action cues: motion and shape temporal dynamics (STD). The motion stream encapsulates the motion content of action as RGB Dynamic Images (RGB-DIs), which are generated by Approximate Rank Pooling (ARP) and processed by using finetuned InceptionV3 model. The STD stream learns long-term view-invariant shape dynamics of action using a sequence of LSTM and Bi-LSTM learning models. Human Pose Model (HPM) generates view-invariant features of structural similarity index matrix (SSIM) based key depth human pose frames. The final prediction of the action is made on the basis of three types of late fusion techniques i.e. maximum (max), average (avg) and multiply (mul), applied on individual stream scores. To validate the performance of the proposed novel framework, the experiments are performed using both cross-subject and cross-view validation schemes on three publically available benchmarks- NUCLA multi-view dataset, UWA3D-II Activity dataset and NTU RGB-D Activity dataset. Our algorithm outperforms existing state-of-the-arts significantly, which is measured in terms of recognition accuracy, receiver operating characteristic (ROC) curve and area under the curve (AUC).

Entities:  

Year:  2020        PMID: 31944975     DOI: 10.1109/TIP.2020.2965299

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  2 in total

1.  Classification and detection of COVID-19 X-Ray images based on DenseNet and VGG16 feature fusion.

Authors:  Lingzhi Kong; Jinyong Cheng
Journal:  Biomed Signal Process Control       Date:  2022-05-08       Impact factor: 5.076

2.  Intelligent Correction Method of Shooting Action Based on Computer Vision.

Authors:  Bo Li; Lei Wang; Hao Feng
Journal:  Comput Intell Neurosci       Date:  2022-07-11
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.