Literature DB >> 33540809

Improved Action Recognition with Separable Spatio-Temporal Attention Using Alternative Skeletal and Video Pre-Processing.

Pau Climent-Pérez1, Francisco Florez-Revuelta1.   

Abstract

The potential benefits of recognising activities of daily living from video for active and assisted living have yet to be fully untapped. These technologies can be used for behaviour understanding, and lifelogging for caregivers and end users alike. The recent publication of realistic datasets for this purpose, such as the Toyota Smarthomes dataset, calls for pushing forward the efforts to improve action recognition. Using the separable spatio-temporal attention network proposed in the literature, this paper introduces a view-invariant normalisation of skeletal pose data and full activity crops for RGB data, which improve the baseline results by 9.5% (on the cross-subject experiments), outperforming state-of-the-art techniques in this field when using the original unmodified skeletal data in dataset. Our code and data are available online.

Entities:  

Keywords:  action recognition; active and assisted living; computer vision; deep learning; inflated convolutional neural networks; spatio-temporal attention

Mesh:

Year:  2021        PMID: 33540809      PMCID: PMC7867344          DOI: 10.3390/s21031005

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  5 in total

1.  Efficient human pose estimation from single depth images.

Authors:  Jamie Shotton; Ross Girshick; Andrew Fitzgibbon; Toby Sharp; Mat Cook; Mark Finocchio; Richard Moore; Pushmeet Kohli; Antonio Criminisi; Alex Kipman; Andrew Blake
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2013-12       Impact factor: 6.226

2.  NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding.

Authors:  Jun Liu; Amir Shahroudy; Mauricio Perez; Gang Wang; Ling-Yu Duan; Alex C Kot
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-05-14       Impact factor: 6.226

3.  OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields.

Authors:  Zhe Cao; Gines Hidalgo Martinez; Tomas Simon; Shih-En Wei; Yaser A Sheikh
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-07-17       Impact factor: 6.226

4.  LCR-Net++: Multi-Person 2D and 3D Pose Detection in Natural Images.

Authors:  Gregory Rogez; Philippe Weinzaepfel; Cordelia Schmid
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2019-01-14       Impact factor: 6.226

5.  Actions as space-time shapes.

Authors:  Lena Gorelick; Moshe Blank; Eli Shechtman; Michal Irani; Ronen Basri
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2007-12       Impact factor: 6.226

  5 in total
  2 in total

1.  Evaluating Automatic Body Orientation Detection for Indoor Location from Skeleton Tracking Data to Detect Socially Occupied Spaces Using the Kinect v2, Azure Kinect and Zed 2i.

Authors:  Violeta Ana Luz Sosa-León; Angela Schwering
Journal:  Sensors (Basel)       Date:  2022-05-17       Impact factor: 3.847

2.  Recognition Method of Wushu Human Complex Movement Based on Bone Point Feature.

Authors:  Anping Li; Ruijie Zhang; Lingrong Tao
Journal:  Comput Math Methods Med       Date:  2022-04-21       Impact factor: 2.809

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.