Literature DB >> 33466293

SynPo-Net-Accurate and Fast CNN-Based 6DoF Object Pose Estimation Using Synthetic Training.

Yongzhi Su1, Jason Rambach2, Alain Pagani2, Didier Stricker1,2.   

Abstract

Estimation and tracking of 6DoF poses of objects in images is a challenging problem of great importance for robotic interaction and augmented reality. Recent approaches applying deep neural networks for pose estimation have shown encouraging results. However, most of them rely on training with real images of objects with severe limitations concerning ground truth pose acquisition, full coverage of possible poses, and training dataset scaling and generalization capability. This paper presents a novel approach using a Convolutional Neural Network (CNN) trained exclusively on single-channel Synthetic images of objects to regress 6DoF object Poses directly (SynPo-Net). The proposed SynPo-Net is a network architecture specifically designed for pose regression and a proposed domain adaptation scheme transforming real and synthetic images into an intermediate domain that is better fit for establishing correspondences. The extensive evaluation shows that our approach significantly outperforms the state-of-the-art using synthetic training in terms of both accuracy and speed. Our system can be used to estimate the 6DoF pose from a single frame, or be integrated into a tracking system to provide the initial pose.

Entities:  

Keywords:  6DoF object pose; 6DoF object tracking; convolutional neural networks; deep learning; domain adaptation; object pose estimation; training with synthetic images

Year:  2021        PMID: 33466293      PMCID: PMC7796199          DOI: 10.3390/s21010300

Source DB:  PubMed          Journal:  Sensors (Basel)        ISSN: 1424-8220            Impact factor:   3.576


  2 in total

1.  Gradient response maps for real-time detection of textureless objects.

Authors:  Stefan Hinterstoisser; Cedric Cagniart; Slobodan Ilic; Peter Sturm; Nassir Navab; Pascal Fua; Vincent Lepetit
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2012-05       Impact factor: 6.226

2.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation.

Authors:  Vijay Badrinarayanan; Alex Kendall; Roberto Cipolla
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-01-02       Impact factor: 6.226

  2 in total
  2 in total

1.  Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization.

Authors:  Leon Eversberg; Jens Lambrecht
Journal:  Sensors (Basel)       Date:  2021-11-26       Impact factor: 3.576

Review 2.  A Survey of 6D Object Detection Based on 3D Models for Industrial Applications.

Authors:  Felix Gorschlüter; Pavel Rojtberg; Thomas Pöllabauer
Journal:  J Imaging       Date:  2022-02-24
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.