Literature DB >> 23475363

Scene text detection via connected component clustering and nontext filtering.

Hyung Il Koo1, Duck Hoon Kim.   

Abstract

In this paper, we present a new scene text detection algorithm based on two machine learning classifiers: one allows us to generate candidate word regions and the other filters out nontext ones. To be precise, we extract connected components (CCs) in images by using the maximally stable extremal region algorithm. These extracted CCs are partitioned into clusters so that we can generate candidate regions. Unlike conventional methods relying on heuristic rules in clustering, we train an AdaBoost classifier that determines the adjacency relationship and cluster CCs by using their pairwise relations. Then we normalize candidate word regions and determine whether each region contains text or not. Since the scale, skew, and color of each candidate can be estimated from CCs, we develop a text/nontext classifier for normalized images. This classifier is based on multilayer perceptrons and we can control recall and precision rates with a single free parameter. Finally, we extend our approach to exploit multichannel information. Experimental results on ICDAR 2005 and 2011 robust reading competition datasets show that our method yields the state-of-the-art performance both in speed and accuracy.

Mesh:

Year:  2013        PMID: 23475363     DOI: 10.1109/TIP.2013.2249082

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  3 in total

1.  Vessel identification based on automatic hull inscriptions recognition.

Authors:  Natalia Wawrzyniak; Tomasz Hyla; Izabela Bodus-Olkowska
Journal:  PLoS One       Date:  2022-07-19       Impact factor: 3.752

2.  DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

Authors:  Xu-Cheng Yin; Chun Yang; Wei-Yi Pei; Haixia Man; Jun Zhang; Erik Learned-Miller; Hong Yu
Journal:  PLoS One       Date:  2015-05-07       Impact factor: 3.240

3.  Scene text detection via extremal region based double threshold convolutional network classification.

Authors:  Wei Zhu; Jing Lou; Longtao Chen; Qingyuan Xia; Mingwu Ren
Journal:  PLoS One       Date:  2017-08-18       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.