Literature DB >> 19696446

Scene text recognition using similarity and a lexicon with sparse belief propagation.

Jerod J Weinman1, Erik Learned-Miller, Allen R Hanson.   

Abstract

Scene text recognition (STR) is the recognition of text anywhere in the environment, such as signs and storefronts. Relative to document recognition, it is challenging because of font variability, minimal language context, and uncontrolled conditions. Much information available to solve this problem is frequently ignored or used sequentially. Similarity between character images is often overlooked as useful information. Because of language priors, a recognizer may assign different labels to identical characters. Directly comparing characters to each other, rather than only a model, helps ensure that similar instances receive the same label. Lexicons improve recognition accuracy but are used post hoc. We introduce a probabilistic model for STR that integrates similarity, language properties, and lexical decision. Inference is accelerated with sparse belief propagation, a bottom-up method for shortening messages by reducing the dependency between weakly supported hypotheses. By fusing information sources in one model, we eliminate unrecoverable errors that result from sequential processing, improving accuracy. In experimental results recognizing text from images of signs in outdoor scenes, incorporating similarity reduces character recognition error by 19 percent, the lexicon reduces word recognition error by 35 percent, and sparse belief propagation reduces the lexicon words considered by 99.9 percent with a 12X speedup and no loss in accuracy.

Entities:  

Year:  2009        PMID: 19696446      PMCID: PMC3021989          DOI: 10.1109/TPAMI.2009.38

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  1 in total

1.  Automatic detection and recognition of signs from natural scenes.

Authors:  Xilin Chen; Jie Yang; Jing Zhang; Alex Waibel
Journal:  IEEE Trans Image Process       Date:  2004-01       Impact factor: 10.856

  1 in total
  3 in total

1.  (Computer) Vision without Sight.

Authors:  Roberto Manduchi; James Coughlan
Journal:  Commun ACM       Date:  2012-01       Impact factor: 4.654

2.  Text Extraction from Scene Images by Character Appearance and Structure Modeling.

Authors:  Chucai Yi; Yingli Tian
Journal:  Comput Vis Image Underst       Date:  2013-02-01       Impact factor: 3.876

3.  DeTEXT: A Database for Evaluating Text Extraction from Biomedical Literature Figures.

Authors:  Xu-Cheng Yin; Chun Yang; Wei-Yi Pei; Haixia Man; Jun Zhang; Erik Learned-Miller; Hong Yu
Journal:  PLoS One       Date:  2015-05-07       Impact factor: 3.240

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.