Literature DB >> 24051735

Temporal localization of actions with actoms.

Adrien Gaidon1, Zaid Harchaoui, Cordelia Schmid.   

Abstract

We address the problem of localizing actions, such as opening a door, in hours of challenging video data. We propose a model based on a sequence of atomic action units, termed "actoms," that are semantically meaningful and characteristic for the action. Our actom sequence model (ASM) represents an action as a sequence of histograms of actom-anchored visual features, which can be seen as a temporally structured extension of the bag-of-features. Training requires the annotation of actoms for action examples. At test time, actoms are localized automatically based on a nonparametric model of the distribution of actoms, which also acts as a prior on an action's temporal structure. We present experimental results on two recent benchmarks for action localization "Coffee and Cigarettes" and the "DLSBP" dataset. We also adapt our approach to a classification-by-localization set-up and demonstrate its applicability on the challenging "Hollywood 2" dataset. We show that our ASM method outperforms the current state of the art in temporal action localization, as well as baselines that localize actions with a sliding window method.

Mesh:

Year:  2013        PMID: 24051735     DOI: 10.1109/TPAMI.2013.65

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  6 in total

1.  Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation.

Authors:  Le Wang; Xuhuan Duan; Qilin Zhang; Zhenxing Niu; Gang Hua; Nanning Zheng
Journal:  Sensors (Basel)       Date:  2018-05-22       Impact factor: 3.576

2.  An Unsupervised Framework for Online Spatiotemporal Detection of Activities of Daily Living by Hierarchical Activity Models.

Authors:  Farhood Negin; François Brémond
Journal:  Sensors (Basel)       Date:  2019-09-29       Impact factor: 3.576

3.  Self-Supervised Learning to Detect Key Frames in Videos.

Authors:  Xiang Yan; Syed Zulqarnain Gilani; Mingtao Feng; Liang Zhang; Hanlin Qin; Ajmal Mian
Journal:  Sensors (Basel)       Date:  2020-12-04       Impact factor: 3.576

4.  Convolutional-de-convolutional neural networks for recognition of surgical workflow.

Authors:  Yu-Wen Chen; Ju Zhang; Peng Wang; Zheng-Yu Hu; Kun-Hua Zhong
Journal:  Front Comput Neurosci       Date:  2022-09-07       Impact factor: 3.387

5.  Horizontal Review on Video Surveillance for Smart Cities: Edge Devices, Applications, Datasets, and Future Trends.

Authors:  Mostafa Ahmed Ezzat; Mohamed A Abd El Ghany; Sultan Almotairi; Mohammed A-M Salem
Journal:  Sensors (Basel)       Date:  2021-05-06       Impact factor: 3.576

6.  Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network.

Authors:  Le Wang; Jinliang Zang; Qilin Zhang; Zhenxing Niu; Gang Hua; Nanning Zheng
Journal:  Sensors (Basel)       Date:  2018-06-21       Impact factor: 3.576

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.