Literature DB >> 35068992

FastSS: Fast and smooth segmentation of JPEG compressed printed text documents using DC and AC signal analysis.

Bulla Rajesh1, Mohammed Javed1, P Nagabhushan1.   

Abstract

With the surge of COVID-19 pandemic, the world is moving towards digitization and automation more than it was presumed. The Internet is becoming one of the popular mediums for communication, and multimedia (image, audio, and video) combined with data compression techniques play a pivotal role in handling a huge volume of data that is being generated on a daily basis. Developing novel algorithms for automatic analysis of compressed data without decompression is the need of the present hour. JPEG is a popular compression algorithm supported in the digital electronics world that achieves compression by dividing the whole image into non-overlapping blocks of 8 × 8 pixels, and subsequently transforming each block using Discrete Cosine Transform (DCT). This research paper proposes to carry out Fast and Smooth Segmentation (FastSS) directly in JPEG compressed printed text document images at text-line and word-level using DC and AC signals. From each 8 × 8 block, DC and AC signals are analyzed for accomplishing Fast and Smooth segmentation, and subsequently, two Faster segmentation (MFastSS) algorithms are also devised using low resolution-images generated by mapping the DC signal (DC Reduced Image) and encoded DCT (ECM Image) coefficients separately. Proposed models are tested on various JPEG compressed printed text document images created with varied space and fonts. The experimental results have demonstrated that the direct analysis of compressed streams is computationally efficient, and has achieved speed gain more than 90% when compared to uncompressed domains.
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022.

Entities:  

Keywords:  DCT coefficients; Document Image processing; JPEG compressed domain; Printed text-line segmentation; Printed word segmentation

Year:  2022        PMID: 35068992      PMCID: PMC8764884          DOI: 10.1007/s11042-021-11858-0

Source DB:  PubMed          Journal:  Multimed Tools Appl        ISSN: 1380-7501            Impact factor:   2.757


  2 in total

1.  Text extraction and document image segmentation using matched wavelets and MRF model.

Authors:  Sunil Kumar; Rajat Gupta; Nitin Khanna; Santanu Chaudhury; Shiv Dutt Joshi
Journal:  IEEE Trans Image Process       Date:  2007-08       Impact factor: 10.856

2.  A novel word spotting method based on recurrent neural networks.

Authors:  Volkmar Frinken; Andreas Fischer; R Manmatha; Horst Bunke
Journal:  IEEE Trans Pattern Anal Mach Intell       Date:  2012-02       Impact factor: 6.226

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.