Literature DB >> 21411405

Text string detection from natural scenes by structure-based partition and grouping.

Chucai Yi1, YingLi Tian.   

Abstract

Text information in natural scene images serves as important clues for many image-based applications such as scene understanding, content-based image retrieval, assistive navigation, and automatic geocoding. However, locating text from a complex background with multiple colors is a challenging task. In this paper, we explore a new framework to detect text strings with arbitrary orientations in complex natural scene images. Our proposed framework of text string detection consists of two steps: 1) image partition to find text character candidates based on local gradient features and color uniformity of character components and 2) character candidate grouping to detect text strings based on joint structural features of text characters in each text string such as character size differences, distances between neighboring characters, and character alignment. By assuming that a text string has at least three characters, we propose two algorithms of text string detection: 1) adjacent character grouping method and 2) text line grouping method. The adjacent character grouping method calculates the sibling groups of each character candidate as string segments and then merges the intersecting sibling groups into text string. The text line grouping method performs Hough transform to fit text line among the centroids of text candidates. Each fitted text line describes the orientation of a potential text string. The detected text string is presented by a rectangle region covering all characters whose centroids are cascaded in its text line. To improve efficiency and accuracy, our algorithms are carried out in multi-scales. The proposed methods outperform the state-of-the-art results on the public Robust Reading Dataset, which contains text only in horizontal orientation. Furthermore, the effectiveness of our methods to detect text strings with arbitrary orientations is evaluated on the Oriented Scene Text Dataset collected by ourselves containing text strings in nonhorizontal orientations.

Entities:  

Year:  2011        PMID: 21411405      PMCID: PMC3337634          DOI: 10.1109/TIP.2011.2126586

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  5 in total

1.  Automatic detection and recognition of signs from natural scenes.

Authors:  Xilin Chen; Jie Yang; Jing Zhang; Alex Waibel
Journal:  IEEE Trans Image Process       Date:  2004-01       Impact factor: 10.856

2.  Geometric rectification of camera-captured document images.

Authors:  Jian Liang; Daniel DeMenthon; David Doermann
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2008-04       Impact factor: 6.226

3.  Morphological text extraction from images.

Authors:  Y Y Hasan; L J Karam
Journal:  IEEE Trans Image Process       Date:  2000       Impact factor: 10.856

4.  Text extraction and document image segmentation using matched wavelets and MRF model.

Authors:  Sunil Kumar; Rajat Gupta; Nitin Khanna; Santanu Chaudhury; Shiv Dutt Joshi
Journal:  IEEE Trans Image Process       Date:  2007-08       Impact factor: 10.856

5.  A new approach for overlay text detection and extraction from complex video scene.

Authors:  Wonjun Kim; Changick Kim
Journal:  IEEE Trans Image Process       Date:  2008-12-16       Impact factor: 10.856

  5 in total
  5 in total

1.  Text Extraction from Scene Images by Character Appearance and Structure Modeling.

Authors:  Chucai Yi; Yingli Tian
Journal:  Comput Vis Image Underst       Date:  2013-02-01       Impact factor: 3.876

2.  Text Detection in Natural Scene Images by Stroke Gabor Words.

Authors:  Chucai Yi; Yingli Tian
Journal:  Proc Int Conf Doc Anal Recognit       Date:  2011

3.  DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

Authors:  Xu-Cheng Yin; Chun Yang; Wei-Yi Pei; Haixia Man; Jun Zhang; Erik Learned-Miller; Hong Yu
Journal:  PLoS One       Date:  2015-05-07       Impact factor: 3.240

4.  Scene text detection via extremal region based double threshold convolutional network classification.

Authors:  Wei Zhu; Jing Lou; Longtao Chen; Qingyuan Xia; Mingwu Ren
Journal:  PLoS One       Date:  2017-08-18       Impact factor: 3.240

5.  Rotation-invariant features for multi-oriented text detection in natural images.

Authors:  Cong Yao; Xin Zhang; Xiang Bai; Wenyu Liu; Yi Ma; Zhuowen Tu
Journal:  PLoS One       Date:  2013-08-05       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.