Literature DB >> 29563857

Cascaded Segmentation-Detection Networks for Word-Level Text Spotting.

Siyang Qin1, Roberto Manduchi1.   

Abstract

We introduce an algorithm for word-level text spotting that is able to accurately and reliably determine the bounding regions of individual words of text "in the wild". Our system is formed by the cascade of two convolutional neural networks. The first network is fully convolutional and is in charge of detecting areas containing text. This results in a very reliable but possibly inaccurate segmentation of the input image. The second network (inspired by the popular YOLO architecture) analyzes each segment produced in the first stage, and predicts oriented rectangular regions containing individual words. No post-processing (e.g. text line grouping) is necessary. With execution time of 450 ms for a 1000 × 560 image on a Titan X GPU, our system achieves good performance on the ICDAR 2013, 2015 benchmarks [2], [1].

Entities:  

Keywords:  convolutional neural network; scene text detection

Year:  2018        PMID: 29563857      PMCID: PMC5858575          DOI: 10.1109/ICDAR.2017.210

Source DB:  PubMed          Journal:  Proc Int Conf Doc Anal Recognit        ISSN: 1520-5363


  4 in total

1.  Robust Text Detection in Natural Scene Images.

Authors:  Xu-Cheng Yin; Xuwang Yin; Kaizhu Huang; Hong-Wei Hao
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2014-05       Impact factor: 6.226

2.  Text Detection and Recognition in Imagery: A Survey.

Authors:  Qixiang Ye; David Doermann
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2015-07       Impact factor: 6.226

3.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs.

Authors:  Liang-Chieh Chen; George Papandreou; Iasonas Kokkinos; Kevin Murphy; Alan L Yuille
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2017-04-27       Impact factor: 6.226

4.  Text-Attentional Convolutional Neural Network for Scene Text Detection.

Authors:  Tong He; Weilin Huang; Yu Qiao; Jian Yao
Journal:  IEEE Trans Image Process       Date:  2016-06       Impact factor: 10.856

  4 in total
  1 in total

1.  Scene Text Access: A Comparison of Mobile OCR Modalities for Blind Users.

Authors:  Leo Neat; Ren Peng; Siyang Qin; Roberto Manduchi
Journal:  IUI       Date:  2019-03
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.