Literature DB >> 32286981

Temporal Reasoning Graph for Activity Recognition.

Jingran Zhang, Fumin Shen, Xing Xu, Heng Tao Shen.   

Abstract

Despite great success has been achieved in activity analysis, it still has many challenges. Most existing works in activity recognition pay more attention to designing efficient architecture or video sampling strategy. However, due to the property of fine-grained action and long term structure in video, activity recognition is expected to reason temporal relation between video sequences. In this paper, we propose an efficient temporal reasoning graph (TRG) to simultaneously capture the appearance features and temporal relation between video sequences at multiple time scales. Specifically, we construct learnable temporal relation graphs to explore temporal relation on the multi-scale range. Additionally, to facilitate multi-scale temporal relation extraction, we design a multi-head temporal adjacent matrix to represent multi-kinds of temporal relations. Eventually, a multi-head temporal relation aggregator is proposed to extract the semantic meaning of those features convolving through the graphs. Extensive experiments are performed on widely-used large-scale datasets, such as Something-Something, Charades and Jester, and the results show that our model can achieve stateof- the-art performance. Further analysis shows that temporal relation reasoning with our TRG can extract discriminative features for activity recognition.

Year:  2020        PMID: 32286981     DOI: 10.1109/TIP.2020.2985219

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  1 in total

1.  Analysis of Volleyball Video Intelligent Description Technology Based on Computer Memory Network and Attention Mechanism.

Authors:  Zhongzi Zhang
Journal:  Comput Intell Neurosci       Date:  2021-12-28
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.