Literature DB >> 24808345

Data-driven hierarchical structure kernel for multiscale part-based object recognition.

Yuan F Zheng.   

Abstract

Detecting generic object categories in images and videos are a fundamental issue in computer vision. However, it faces the challenges from inter and intraclass diversity, as well as distortions caused by viewpoints, poses, deformations, and so on. To solve object variations, this paper constructs a structure kernel and proposes a multiscale part-based model incorporating the discriminative power of kernels. The structure kernel would measure the resemblance of part-based objects in three aspects: 1) the global similarity term to measure the resemblance of the global visual appearance of relevant objects; 2) the part similarity term to measure the resemblance of the visual appearance of distinctive parts; and 3) the spatial similarity term to measure the resemblance of the spatial layout of parts. In essence, the deformation of parts in the structure kernel is penalized in a multiscale space with respect to horizontal displacement, vertical displacement, and scale difference. Part similarities are combined with different weights, which are optimized efficiently to maximize the intraclass similarities and minimize the interclass similarities by the normalized stochastic gradient ascent algorithm. In addition, the parameters of the structure kernel are learned during the training process with regard to the distribution of the data in a more discriminative way. With flexible part sizes on scale and displacement, it can be more robust to the intraclass variations, poses, and viewpoints. Theoretical analysis and experimental evaluations demonstrate that the proposed multiscale part-based representation model with structure kernel exhibits accurate and robust performance, and outperforms state-of-the-art object classification approaches.

Entities:  

Year:  2014        PMID: 24808345      PMCID: PMC5330370          DOI: 10.1109/TIP.2014.2307480

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  16 in total

1.  Vehicle detection in aerial surveillance using dynamic Bayesian networks.

Authors:  Hsu-Yung Cheng; Chih-Chia Weng; Yi-Ying Chen
Journal:  IEEE Trans Image Process       Date:  2011-10-19       Impact factor: 10.856

2.  Object detection with DoG scale-space: a multiple kernel learning approach.

Authors:  Sharmin Nilufar; Nilanjan Ray; Hong Zhang
Journal:  IEEE Trans Image Process       Date:  2012-04-03       Impact factor: 10.856

3.  Contextual object localization with multiple kernel nearest neighbor.

Authors:  Brian McFee; Carolina Galleguillos; Gert Lanckriet
Journal:  IEEE Trans Image Process       Date:  2010-08-23       Impact factor: 10.856

4.  Object detection with discriminatively trained part-based models.

Authors:  Pedro F Felzenszwalb; Ross B Girshick; David McAllester; Deva Ramanan
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2010-09       Impact factor: 6.226

5.  Context-dependent kernels for object classification.

Authors:  Hichem Sahbi; Jean-Yves Audibert; Renaud Keriven
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2011-04       Impact factor: 6.226

6.  Multiple kernel learning for dimensionality reduction.

Authors:  Yen-Yu Lin; Tyng-Luh Liu; Chiou-Shann Fuh
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2011-06       Impact factor: 6.226

7.  Generating descriptive visual words and visual phrases for large-scale image applications.

Authors:  Shiliang Zhang; Qi Tian; Gang Hua; Qingming Huang; Wen Gao
Journal:  IEEE Trans Image Process       Date:  2011-03-17       Impact factor: 10.856

8.  Reducing the dimensionality of data with neural networks.

Authors:  G E Hinton; R R Salakhutdinov
Journal:  Science       Date:  2006-07-28       Impact factor: 47.728

9.  Semantics-preserving bag-of-words models and applications.

Authors:  Lei Wu; Steven C H Hoi; Nenghai Yu
Journal:  IEEE Trans Image Process       Date:  2010-03-11       Impact factor: 10.856

10.  Multiple-kernel, multiple-instance similarity features for efficient visual object detection.

Authors:  Chensheng Sun; Kin-Man Lam
Journal:  IEEE Trans Image Process       Date:  2013-03-28       Impact factor: 10.856

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.