Literature DB >> 32275607

Host-Parasite: Graph LSTM-in-LSTM for Group Activity Recognition.

Xiangbo Shu, Liyan Zhang, Yunlian Sun, Jinhui Tang.   

Abstract

This article aims to tackle the problem of group activity recognition in the multiple-person scene. To model the group activity with multiple persons, most long short-term memory (LSTM)-based methods first learn the person-level action representations by several LSTMs and then integrate all the person-level action representations into the following LSTM to learn the group-level activity representation. This type of solution is a two-stage strategy, which neglects the "host-parasite" relationship between the group-level activity ("host") and person-level actions ("parasite") in spatiotemporal space. To this end, we propose a novel graph LSTM-in-LSTM (GLIL) for group activity recognition by modeling the person-level actions and the group-level activity simultaneously. GLIL is a "host-parasite" architecture, which can be seen as several person LSTMs (P-LSTMs) in the local view or a graph LSTM (G-LSTM) in the global view. Specifically, P-LSTMs model the person-level actions based on the interactions among persons. Meanwhile, G-LSTM models the group-level activity, where the person-level motion information in multiple P-LSTMs is selectively integrated and stored into G-LSTM based on their contributions to the inference of the group activity class. Furthermore, to use the person-level temporal features instead of the person-level static features as the input of GLIL, we introduce a residual LSTM with the residual connection to learn the person-level residual features, consisting of temporal features and static features. Experimental results on two public data sets illustrate the effectiveness of the proposed GLIL compared with state-of-the-art methods.

Entities:  

Mesh:

Year:  2021        PMID: 32275607     DOI: 10.1109/TNNLS.2020.2978942

Source DB:  PubMed          Journal:  IEEE Trans Neural Netw Learn Syst        ISSN: 2162-237X            Impact factor:   10.451


  4 in total

1.  Gas Concentration Prediction Based on IWOA-LSTM-CEEMDAN Residual Correction Model.

Authors:  Ningke Xu; Xiangqian Wang; Xiangrui Meng; Haoqian Chang
Journal:  Sensors (Basel)       Date:  2022-06-10       Impact factor: 3.847

2.  Multi-Perspective Representation to Part-Based Graph for Group Activity Recognition.

Authors:  Lifang Wu; Xianglong Lang; Ye Xiang; Qi Wang; Meng Tian
Journal:  Sensors (Basel)       Date:  2022-07-24       Impact factor: 3.847

Review 3.  Human Activity Recognition: Review, Taxonomy and Open Challenges.

Authors:  Muhammad Haseeb Arshad; Muhammad Bilal; Abdullah Gani
Journal:  Sensors (Basel)       Date:  2022-08-27       Impact factor: 3.847

4.  Regularity Normalization: Neuroscience-Inspired Unsupervised Attention across Neural Network Layers.

Authors:  Baihan Lin
Journal:  Entropy (Basel)       Date:  2021-12-28       Impact factor: 2.524

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.