Literature DB >> 18276966

Geometric rectification of camera-captured document images.

Jian Liang1, Daniel DeMenthon, David Doermann.   

Abstract

Compared to typical scanners, handheld cameras offer convenient, flexible, portable, and non-contact image capture, which enables many new applications and breathes new life into existing ones. However, camera-captured documents may suffer from distortions caused by non-planar document shape and perspective projection, which lead to failure of current OCR technologies. We present a geometric rectification framework for restoring the frontal-flat view of a document from a single camera-captured image. Our approach estimates 3D document shape from texture flow information obtained directly from the image without requiring additional 3D/metric data or prior camera calibration. Our framework provides a unified solution for both planar and curved documents and can be applied in many, especially mobile, camera-based document analysis applications. Experiments show that our method produces results that are significantly more OCR compatible than the original images.

Mesh:

Year:  2008        PMID: 18276966     DOI: 10.1109/TPAMI.2007.70724

Source DB:  PubMed          Journal:  IEEE Trans Pattern Anal Mach Intell        ISSN: 0098-5589            Impact factor:   6.226


  2 in total

1.  Text string detection from natural scenes by structure-based partition and grouping.

Authors:  Chucai Yi; YingLi Tian
Journal:  IEEE Trans Image Process       Date:  2011-03-14       Impact factor: 10.856

2.  Text Detection in Natural Scene Images by Stroke Gabor Words.

Authors:  Chucai Yi; Yingli Tian
Journal:  Proc Int Conf Doc Anal Recognit       Date:  2011
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.