Literature DB >> 30235112

Learning Rotation-Invariant and Fisher Discriminative Convolutional Neural Networks for Object Detection.

Gong Cheng, Junwei Han, Peicheng Zhou, Dong Xu.   

Abstract

The performance of object detection has recently been significantly improved due to the powerful features learnt through convolutional neural networks (CNNs). Despite the remarkable success, there are still several major challenges in object detection, including object rotation, within-class diversity, and between-class similarity, which generally degenerate object detection performance. To address these issues, we build up the existing state-of-the-art object detection systems and propose a simple but effective method to train rotation-invariant and Fisher discriminative CNN models to further boost object detection performance. This is achieved by optimizing a new objective function that explicitly imposes a rotation-invariant regularizer and a Fisher discrimination regularizer on the CNN features. Specifically, the first regularizer enforces the CNN feature representations of the training samples before and after rotation to be mapped closely to each other in order to achieve rotation-invariance. The second regularizer constrains the CNN features to have small within-class scatter but large between-class separation. We implement our proposed method under four popular object detection frameworks, including region-CNN (R-CNN), Fast R- CNN, Faster R- CNN, and R- FCN. In the experiments, we comprehensively evaluate the proposed method on the PASCAL VOC 2007 and 2012 data sets and a publicly available aerial image data set. Our proposed methods outperform the existing baseline methods and achieve the state-of-the-art results.

Year:  2019        PMID: 30235112     DOI: 10.1109/TIP.2018.2867198

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  9 in total

1.  A feature fusion deep-projection convolution neural network for vehicle detection in aerial images.

Authors:  Bin Wang; Bin Xu
Journal:  PLoS One       Date:  2021-05-07       Impact factor: 3.240

2.  Two-Way Affective Modeling for Hidden Movie Highlights' Extraction.

Authors:  Zheng Wang; Xinyu Yan; Wei Jiang; Meijun Sun
Journal:  Sensors (Basel)       Date:  2018-12-03       Impact factor: 3.576

3.  Applying single-image super-resolution to enhancment of deep-water bathymetry.

Authors:  Kristen Nock; David Bonanno; Paul Elmore; Leslie Smith; Vicki Ferrini; Fred Petry
Journal:  Heliyon       Date:  2019-10-21

4.  Machine Learning-Based Fast Banknote Serial Number Recognition Using Knowledge Distillation and Bayesian Optimization.

Authors:  Eunjeong Choi; Somi Chae; Jeongtae Kim
Journal:  Sensors (Basel)       Date:  2019-09-28       Impact factor: 3.576

5.  ULN: An efficient face recognition method for person wearing a mask.

Authors:  Hongtao Lu; Zijun Zhuang
Journal:  Multimed Tools Appl       Date:  2022-08-12       Impact factor: 2.577

6.  COVID-19 Infection Segmentation and Severity Assessment Using a Self-Supervised Learning Approach.

Authors:  Yao Song; Jun Liu; Xinghua Liu; Jinshan Tang
Journal:  Diagnostics (Basel)       Date:  2022-07-26

7.  A comparative study of the effectiveness of using popular DNN object detection algorithms for pith detection in cross-sectional images of parawood.

Authors:  Wattanapong Kurdthongmee
Journal:  Heliyon       Date:  2020-02-28

8.  A multi-branch separable convolution neural network for pedestrian attribute recognition.

Authors:  Imran N Junejo; Naveed Ahmed
Journal:  Heliyon       Date:  2020-03-17

9.  A Real-Time Automatic Plate Recognition System Based on Optical Character Recognition and Wireless Sensor Networks for ITS.

Authors:  Nicole do Vale Dalarmelina; Marcio Andrey Teixeira; Rodolfo I Meneguette
Journal:  Sensors (Basel)       Date:  2019-12-20       Impact factor: 3.576

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.