Literature DB >> 33816875

Synthetic dataset generation for object-to-model deep learning in industrial applications.

Matthew Z Wong1, Kiyohito Kunii1, Max Baylis1, Wai Hong Ong1, Pavel Kroupa1, Swen Koller1.   

Abstract

The availability of large image data sets has been a crucial factor in the success of deep learning-based classification and detection methods. Yet, while data sets for everyday objects are widely available, data for specific industrial use-cases (e.g., identifying packaged products in a warehouse) remains scarce. In such cases, the data sets have to be created from scratch, placing a crucial bottleneck on the deployment of deep learning techniques in industrial applications. We present work carried out in collaboration with a leading UK online supermarket, with the aim of creating a computer vision system capable of detecting and identifying unique supermarket products in a warehouse setting. To this end, we demonstrate a framework for using data synthesis to create an end-to-end deep learning pipeline, beginning with real-world objects and culminating in a trained model. Our method is based on the generation of a synthetic dataset from 3D models obtained by applying photogrammetry techniques to real-world objects. Using 100K synthetic images for 10 classes, an InceptionV3 convolutional neural network was trained, which achieved accuracy of 96% on a separately acquired test set of real supermarket product images. The image generation process supports automatic pixel annotation. This eliminates the prohibitively expensive manual annotation typically required for detection tasks. Based on this readily available data, a one-stage RetinaNet detector was trained on the synthetic, annotated images to produce a detector that can accurately localize and classify the specimen products in real-time.
© 2019 Wong et al.

Entities:  

Keywords:  3D Modelling; Computer science applications; Convolutional neural network; Deep learning with limited data; Industrial computer vision; Photogrammetry; Synthetic data

Year:  2019        PMID: 33816875      PMCID: PMC7924434          DOI: 10.7717/peerj-cs.222

Source DB:  PubMed          Journal:  PeerJ Comput Sci        ISSN: 2376-5992


  2 in total

1.  Deep Convolutional Neural Networks for Image Classification: A Comprehensive Review.

Authors:  Waseem Rawat; Zenghui Wang
Journal:  Neural Comput       Date:  2017-06-09       Impact factor: 2.026

2.  Focal Loss for Dense Object Detection.

Authors:  Tsung-Yi Lin; Priya Goyal; Ross Girshick; Kaiming He; Piotr Dollar
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2018-07-23       Impact factor: 6.226

  2 in total
  2 in total

1.  Generating Images with Physics-Based Rendering for an Industrial Object Detection Task: Realism versus Domain Randomization.

Authors:  Leon Eversberg; Jens Lambrecht
Journal:  Sensors (Basel)       Date:  2021-11-26       Impact factor: 3.576

2.  Distribution-preserving data augmentation.

Authors:  Nurdan Ayse Saran; Murat Saran; Fatih Nar
Journal:  PeerJ Comput Sci       Date:  2021-05-27
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.