| Literature DB >> 21747737 |
Pannag Sanketi1, Huiying Shen, James M Coughlan.
Abstract
There is a growing body of work addressing the problem of localizing printed text regions occurring in natural scenes, all of it focused on images in which the text to be localized is resolved clearly enough to be read by OCR. This paper introduces an alternative approach to text localization based on the fact that it is often useful to localize text that is identifiable as text but too blurry or small to be read, for two reasons. First, an image can be decimated and processed at a coarser resolution than usual, resulting in faster localization before OCR is performed (at full resolution, if needed). Second, in real-time applications such as a cell phone app to find and read text, text may initially be acquired from a lower-resolution video image in which it appears too small to be read; once the text's presence and location have been established, a higher-resolution image can be taken in order to resolve the text clearly enough to read it.We demonstrate proof of concept of this approach by describing a novel algorithm for binarizing the image and extracting candidate text features, called "blobs," and grouping and classifying the blobs into text and non-text categories. Experimental results are shown on a variety of images in which the text is resolved too poorly to be clearly read, but is still identifiable by our algorithm as text.Entities:
Year: 2011 PMID: 21747737 PMCID: PMC3132180 DOI: 10.1109/WACV.2011.5711546
Source DB: PubMed Journal: Proc IEEE Workshop Appl Comput Vis ISSN: 1550-5790