Literature DB >> 30183621

Temporal Segment Networks for Action Recognition in Videos.

Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool.   

Abstract

We present a general and flexible video-level framework for learning action models in videos. This method, called temporal segment network (TSN), aims to model long-range temporal structure with a new segment-based sampling and aggregation scheme. This unique design enables the TSN framework to efficiently learn action models by using the whole video. The learned models could be easily deployed for action recognition in both trimmed and untrimmed videos with simple average pooling and multi-scale temporal window integration, respectively. We also study a series of good practices for the implementation of the TSN framework given limited training samples. Our approach obtains the state-the-of-art performance on five challenging action recognition benchmarks: HMDB51 (71.0 percent), UCF101 (94.9 percent), THUMOS14 (80.1 percent), ActivityNet v1.2 (89.6 percent), and Kinetics400 (75.7 percent). In addition, using the proposed RGB difference as a simple motion representation, our method can still achieve competitive accuracy on UCF101 (91.0 percent) while running at 340 FPS. Furthermore, based on the proposed TSN framework, we won the video classification track at the ActivityNet challenge 2016 among 24 teams.

Entities:  

Year:  2018        PMID: 30183621     DOI: 10.1109/TPAMI.2018.2868668

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  6 in total

1.  Spatio-temporal prediction and reconstruction network for video anomaly detection.

Authors:  Ting Liu; Chengqing Zhang; Xiaodong Niu; Liming Wang
Journal:  PLoS One       Date:  2022-05-26       Impact factor: 3.752

2.  STA-TSN: Spatial-Temporal Attention Temporal Segment Network for action recognition in video.

Authors:  Guoan Yang; Yong Yang; Zhengzhi Lu; Junjie Yang; Deyang Liu; Chuanbo Zhou; Zien Fan
Journal:  PLoS One       Date:  2022-03-17       Impact factor: 3.240

3.  State-of-the-art violence detection techniques in video surveillance security systems: a systematic review.

Authors:  Batyrkhan Omarov; Sergazi Narynov; Zhandos Zhumanov; Aidana Gumar; Mariyam Khassanova
Journal:  PeerJ Comput Sci       Date:  2022-04-06

Review 4.  A Comprehensive Review of Recent Deep Learning Techniques for Human Activity Recognition.

Authors:  Viet-Tuan Le; Kiet Tran-Trung; Vinh Truong Hoang
Journal:  Comput Intell Neurosci       Date:  2022-04-20

Review 5.  A Comprehensive Review on Temporal-Action Proposal Generation.

Authors:  Sorn Sooksatra; Sitapa Watcharapinchai
Journal:  J Imaging       Date:  2022-07-23

6.  Can a humanoid social robot stimulate the interactivity of cognitively impaired elderly? A thorough study based on computer vision methods.

Authors:  Gauri Tulsulkar; Nidhi Mishra; Nadia Magnenat Thalmann; Hwee Er Lim; Mei Ping Lee; Siok Khoong Cheng
Journal:  Vis Comput       Date:  2021-07-30       Impact factor: 2.601

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.