Literature DB >> 29990080

Action Recognition with Dynamic Image Networks.

Hakan Bilen, Basura Fernando, Efstratios Gavves, Andrea Vedaldi.   

Abstract

We introduce the concept of dynamic image, a novel compact representation of videos useful for video analysis, particularly in combination with convolutional neural networks (CNNs). A dynamic image encodes temporal data such as RGB or optical flow videos by using the concept of 'rank pooling'. The idea is to learn a ranking machine that captures the temporal evolution of the data and to use the parameters of the latter as a representation. We call the resulting representation dynamic image because it summarizes the video dynamics in addition to appearance. This powerful idea allows to convert any video to an image so that existing CNN models pre-trained with still images can be immediately extended to videos. We also present an efficient approximate rank pooling operator that runs two orders of magnitude faster than the standard ones with any loss in ranking performance and can be formulated as a CNN layer. To demonstrate the power of the representation, we introduce a novel four stream CNN architecture which can learn from RGB and optical flow frames as well as from their dynamic image representations. We show that the proposed network achieves state-of-the-art performance, 95.5 and 72.5 percent accuracy, in the UCF101 and HMDB51, respectively.

Year:  2017        PMID: 29990080     DOI: 10.1109/TPAMI.2017.2769085

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  4 in total

1.  Action Recognition Using Action Sequences Optimization and Two-Stream 3D Dilated Neural Network.

Authors:  Xin Xiong; Weidong Min; Qing Han; Qi Wang; Cheng Zha
Journal:  Comput Intell Neurosci       Date:  2022-06-13

2.  Weakly Supervised Violence Detection in Surveillance Video.

Authors:  David Choqueluque-Roman; Guillermo Camara-Chavez
Journal:  Sensors (Basel)       Date:  2022-06-14       Impact factor: 3.847

3.  Using a Selective Ensemble Support Vector Machine to Fuse Multimodal Features for Human Action Recognition.

Authors:  Chao Tang; Anyang Tong; Aihua Zheng; Hua Peng; Wei Li
Journal:  Comput Intell Neurosci       Date:  2022-01-10

4.  Machine Learning-Based Classification of Human Behaviors and Falls in Restroom via Dual Doppler Radar Measurements.

Authors:  Kenshi Saho; Sora Hayashi; Mutsuki Tsuyama; Lin Meng; Masao Masugi
Journal:  Sensors (Basel)       Date:  2022-02-22       Impact factor: 3.576

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.