Literature DB >> 22201057

Visual event recognition in videos by learning from Web data.

Lixin Duan1, Dong Xu, Ivor Wai-Hung Tsang, Jiebo Luo.   

Abstract

We propose a visual event recognition framework for consumer videos by leveraging a large amount of loosely labeled web videos (e.g., from YouTube). Observing that consumer videos generally contain large intraclass variations within the same type of events, we first propose a new method, called Aligned Space-Time Pyramid Matching (ASTPM), to measure the distance between any two video clips. Second, we propose a new transfer learning method, referred to as Adaptive Multiple Kernel Learning (A-MKL), in order to 1) fuse the information from multiple pyramid levels and features (i.e., space-time features and static SIFT features) and 2) cope with the considerable variation in feature distributions between videos from two domains (i.e., web video domain and consumer video domain). For each pyramid level and each type of local features, we first train a set of SVM classifiers based on the combined training set from two domains by using multiple base kernels from different kernel types and parameters, which are then fused with equal weights to obtain a prelearned average classifier. In A-MKL, for each event class we learn an adapted target classifier based on multiple base kernels and the prelearned average classifiers from this event class or all the event classes by minimizing both the structural risk functional and the mismatch between data distributions of two domains. Extensive experiments demonstrate the effectiveness of our proposed framework that requires only a small number of labeled consumer videos by leveraging web data. We also conduct an in-depth investigation on various aspects of the proposed method A-MKL, such as the analysis on the combination coefficients on the prelearned classifiers, the convergence of the learning algorithm, and the performance variation by using different proportions of labeled consumer videos. Moreover, we show that A-MKL using the prelearned classifiers from all the event classes leads to better performance when compared with A-MK- using the prelearned classifiers only from each individual event class.

Entities:  

Mesh:

Year:  2012        PMID: 22201057     DOI: 10.1109/TPAMI.2011.265

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  9 in total

1.  An adaptive Hidden Markov model for activity recognition based on a wearable multi-sensor device.

Authors:  Zhen Li; Zhiqiang Wei; Yaofeng Yue; Hao Wang; Wenyan Jia; Lora E Burke; Thomas Baranowski; Mingui Sun
Journal:  J Med Syst       Date:  2015-03-19       Impact factor: 4.460

2.  Identifying Autism Spectrum Disorder With Multi-Site fMRI via Low-Rank Domain Adaptation.

Authors:  Mingliang Wang; Daoqiang Zhang; Jiashuang Huang; Pew-Thian Yap; Dinggang Shen; Mingxia Liu
Journal:  IEEE Trans Med Imaging       Date:  2019-08-05       Impact factor: 10.048

3.  Semantic Pooling for Complex Event Analysis in Untrimmed Videos.

Authors:  Eric P Xing
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2016-09-13       Impact factor: 6.226

4.  Physical Activity Recognition Based on Motion in Images Acquired by a Wearable Camera.

Authors:  Hong Zhang; Lu Li; Wenyan Jia; John D Fernstrom; Robert J Sclabassi; Zhi-Hong Mao; Mingui Sun
Journal:  Neurocomputing       Date:  2011-06-01       Impact factor: 5.719

5.  A Survey on Ambient Intelligence in Health Care.

Authors:  Giovanni Acampora; Diane J Cook; Parisa Rashidi; Athanasios V Vasilakos
Journal:  Proc IEEE Inst Electr Electron Eng       Date:  2013-12-01       Impact factor: 10.961

6.  Domain transfer learning for MCI conversion prediction.

Authors:  Bo Cheng; Daoqiang Zhang; Dinggang Shen
Journal:  Med Image Comput Comput Assist Interv       Date:  2012

7.  Transfer Learning for Activity Recognition: A Survey.

Authors:  Diane Cook; Kyle D Feuz; Narayanan C Krishnan
Journal:  Knowl Inf Syst       Date:  2013-09-01       Impact factor: 2.822

8.  Kernel Manifold Alignment for Domain Adaptation.

Authors:  Devis Tuia; Gustau Camps-Valls
Journal:  PLoS One       Date:  2016-02-12       Impact factor: 3.240

9.  Collegial Activity Learning between Heterogeneous Sensors.

Authors:  Kyle D Feuz; Diane J Cook
Journal:  Knowl Inf Syst       Date:  2017-03-27       Impact factor: 2.822

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.