Literature DB >> 27832092

A No-Reference Adaptive Blockiness Measure for JPEG Compressed Images.

Chaoying Tang1, Biao Wang1.   

Abstract

Digital images have been extensively used in education, research, and entertainment. Many of these images, taken by consumer cameras, are compressed by the JPEG algorithm for effective storage and transmission. Blocking artifact is a well-known problem caused by this algorithm. Effective measurement of blocking artifacts plays an important role in the design, optimization, and evaluation of image compression algorithms. In this paper, we propose a no-reference objective blockiness measure, which is adaptive to high frequency component in an image. Difference of entropies across blocks and variation of block boundary pixel values in edge images are adopted to calculate the blockiness level in areas with low and high frequency component, respectively. Extensive experimental results prove that the proposed measure is effective and stable across a wide variety of images. It is robust to image noise and can be used for real-world image quality monitoring and control. Index Terms-JPEG, no-reference, blockiness measure.

Entities:  

Mesh:

Year:  2016        PMID: 27832092      PMCID: PMC5104446          DOI: 10.1371/journal.pone.0165664

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Although digital images provide great convenience to our daily life, storing and transmitting these images are problematic. Consumer cameras can produce images with size more than 12 mega pixels. Some of them can even produce images with over 20 mega pixels. The standard JPEG algorithm installed in consumer cameras are commonly used to compress these images. JPEG is a discrete cosine transform (DCT) -based compression scheme. The input image is divided into 8×8 blocks, each block being transformed independently to DCT coefficients. The DCT coefficients are then quantized using a scalar quantization matrix. At low bitrates, the DCT-coded images generally suffer from visually annoying blocking artifacts as a result of coarse quantization. It can dramatically degrade the image quality. Over the past decades, research has been carried out to alleviate blocking effects in JPEG-compressed images. Effective measurement of blocking artifacts plays an important role in the development and evaluation of these algorithms. The most widely used point-wise comparison measures, namely mean square error (MSE) and peak signal to noise ratio (PSNR), are not very well matched to perceived visual quality [1]. A lot of effort has been put in the development of effective quality measures. In this paper, we propose a no-reference adaptive blockiness measure for JPEG compressed images. For areas with low frequency component, we use the difference of entropies across blocks as a measure. For areas with high frequency component, we use the variation of block boundary pixel values in edge images as a measure. The final index is the weighted average of the two measures. The rest of this paper is organized as follows. Section 2 gives a short review of the related work. Section 3 introduces our blockiness measure. Section 4 reports experimental results. Section 5 offers some concluding remarks.

Literature Review

Image quality assessment techniques can be divided into two classes: subjective assessment and objective assessment. In subjective assessment, a group of testers are required to rate a batch of images compressed under different ratios. Objective methods try to estimate the compression amount using mathematical techniques. They can be further divided into two types: full-reference and no-reference methods. Full-reference methods give an assessment through a comparison between reference and compressed images. They mainly utilize the sensitivity of human visual system and a combination of multiple features [2]. Wang and Bovik [3] proposed a universal image quality index which models image distortion as a combination of three factors: loss of correlation, luminance distortion, and contrast distortion. In [4], the measure was generalized to the Structural SIMilarity index (SSIM) to characterize the saturation effects of the visual system at low luminance and contrast regions and to assure numerical stability. No-reference methods directly evaluate the compressed images. They are more useful because the reference image is usually unavailable. Moorthy and Bovik [5] presented two strategies—visual fixation-based weighting, and quality-based weighting to weight image quality measurements by visual importance, and demonstrated improvements on the SSIM index in both its multi-scale and single-scale versions. Wang and Bovik [6] proposed an algorithm which evaluates blockiness by computing the FFT along the rows and columns. Pan et al. [7] proposed a method based on the fact that for images with more severe blocking artifacts, more edge pixels are arrayed horizontally and vertically. Lee et al. [8] classified blocks by analyzing block boundaries and converted boundary strength into blockiness score. Li et al. [9] presented a blockiness metric which computes the regularities of pseudo structures. In [10], Li et al. proposed a quality measure for JPEG-compressed images based on Tchebichef kernels. Golestaneh and Chandler [11] presented a measure that gets the number of zero-valued discrete cosine transform (DCT) coefficients within each block. Then it uses a quality map to represent these numbers. Freitas et. al [12] proposed a machine-learning-based method that uses the histograms of local ternary pattern (LTP) as features for the training procedure. Li et al. [13] proposed a deep-neural-network-based algorithm that extracts features using shearlet transform and evolves the features using stacked auto-encoders. Then the differences of evolved features are identified by a softmax classifier. Ren et al. [14] developed a method which uses shearlet transform to localize the distributed discontinuities induced by image degradation, and exhibits the alteration of image quality by the nature scene statistics of shearlet coefficients. A common problem with the current blockiness measures is that none of them have adaptability to image characteristics. Different areas in an image usually have different amount of high frequency component. The compression process has different influences on these areas. This property should be considered in the blockiness measure.

Proposed Algorithm

In this paper, we propose a no-reference blockiness measure which is adaptive to different areas in an image. Low frequency component in images corresponds to pixel values that change slowly over space, while high frequency component means pixel values that change rapidly. Edges in a digital image are the points at which the image brightness changes sharply or has discontinuities. Therefore sharp transients like edges represent high frequency component, while long unchanging spaces correspond to low frequency component. For the areas with low frequency component, the compression process reduces intensity variability inside a block, sometimes even generates blocks of constant gray scale, especially under high compression ratio. The block boundaries produce abrupt change of intensities across blocks. Fig 1 illustrates the influence of JPEG compression on different image areas. Fig 1(A) is an original image of a building, where the square shows an area with low frequency component (see Fig 1(B)). After compression, blocks of constant gray scale are produced (see Fig 1(C)). Comparing Fig 1(B) and 1(C), we find that before compression, the entropies of pixel values inside a block and across blocks are almost the same. However, they are quite different after compression, because of the generated block boundaries. Considering this property, we use the difference between the entropies of pixel values inside a block and across blocks to represent the compression level.
Fig 1

Illustration of the influence of JPEG compression on different image areas.

(a) is an original image of a building, where the square shows an area with low frequency component; (d) is the same original image, where the square shows an area with high frequency component; (b) and (e) are the selected image areas in (a) and (d), respectively; (c) and (f) are the JPEG compressed versions of (b) and (e) at 0.071 bpp, respectively.

Illustration of the influence of JPEG compression on different image areas.

(a) is an original image of a building, where the square shows an area with low frequency component; (d) is the same original image, where the square shows an area with high frequency component; (b) and (e) are the selected image areas in (a) and (d), respectively; (c) and (f) are the JPEG compressed versions of (b) and (e) at 0.071 bpp, respectively. For the areas with high frequency component, the entropy difference is not influenced by the compression process, since the entropy of pixel values inside a block is already high before compression (see Fig 1D–1F). However, the generated blocks do change the edge information. Discontinuities appear around block boundaries in the edge image, and the higher the compression ratio is, the larger the discontinuities are. Therefore, in this case, we use Sobel operator to get horizontal and vertical edge images first. Then we capture the difference of pixel values around block boundaries in the edge images to represent the compression level. Let I be a compressed image, and M and M be the Sobel masks defined as follows: The convolution results between I and M, M are denoted as: where * is the convolution operator, and C and C are the horizontal and vertical edge images, respectively. Then C and C are decomposed into 8×8 non-overlapping blocks, since DCT transform is performed by 8×8 blocks in JPEG compression. For the kth 8×8 block in the image, if it has edges (with high frequency component), four regions in C and C, R, R, R, and R, are considered. R and R denote the pixels just inside the horizontal and vertical boundaries, respectively. R and R denote the pixels just outside the horizontal and vertical boundaries, respectively. Fig 2A–2D illustrate these regions. We define measure s1 for the areas with high frequency component. It is calculated according to the following equations: where N and N are the number of elements in the summation (here N = 32, N = 40). N is the number of blocks in this area. The measure s1 is obtained from the luminance variance of pixels inside and outside block boundaries.
Fig 2

Illustration of the block areas involved in the blockiness measure.

Dark elements represent the regions (a) R, (b) R, (c) R, (d) R, (e) I and (f) I.

Illustration of the block areas involved in the blockiness measure.

Dark elements represent the regions (a) R, (b) R, (c) R, (d) R, (e) I and (f) I. For the tth 8×8 block in the areas without edges (with low frequency component), the regions I and I are considered. I denotes an 8×8 block, while I represents a 10×10 pixel area with the 8×8 block in the center. Fig 2E and 2F illustrate these regions. We define measure s2 for the areas with low frequency component. It is calculated according to the following equations: where En(·) is the function to compute entropy. N is the number of blocks in this area. The measure s2 is obtained from the entropy difference of pixels inside and across blocks. The final blockiness measure is defined as the weighted average of the two measures: where w (i = 1, 2) is the weight of the two measures which is determined as the total numbers of the two different kinds of blocks in an image. Take Fig 1(A) as an example. After compression, N, N, s and s are 637, 1721, 0.220 and 0.812, respectively. In the areas with low frequency component, the average values of En(I) and En(I) are 0.366 and 0.734, respectively. It can be seen that the entropy increases after compression, because of the generated block boundaries in I. The final blockiness measure s is 0.652.

Experimental Results and Discussion

Extensive experiments were conducted to evaluate the performance of the proposed blockiness measure. In the first experiment, the images “sailing2” from LIVE database [15] with different levels of blocking artifacts were adopted. The bitrates of the three compressed images are 0.3929, 0.1710 and 0.1638, respectively. The subjective evaluating results of the images are represented using difference mean opinion score (DMOS). In general, the higher DMOS an image has, the lower the image quality is. Therefore a reasonable blockiness measure should be proportional to the DMOS values. The blockiness measures given by Wang’s [6], Pan’s [7], Lee’s [8], Li’s [9] and the proposed method are summarized in Table 1. It can be seen that images No. 2 and 3 have severe blocking artifacts, and image No. 3 is a little bit more serious. Therefore, the blockiness measure of No. 3 should be larger than that of No. 2. However, in Wang’s, Pan’s and Lee’s results, the score of No. 3 is smaller. It can be seen that these measures cannot accurately evaluate images with similar DMOS values. Li’s and the proposed method achieve correct results. However, the difference between Li’s scores of No. 2 and No. 3 is too large compared with the difference between the corresponding DMOS values. The scores given by the proposed method is more consistent with DMOS values. To evaluate the entropy component in the proposed measure, we calculated measure s2 in Eq (8) for the three images and put them in the last column of Table 1. It can be seen that s2 increases monotonically with the compression ratio, which means that entropy is able to describe compression level.
Table 1

Comparison of blockiness measures for the image“sailing2” from LIVE database with different compression levels

Image No.BitratesDMOSWang [6]Pan [7]Lee [8]Li [9]proposed measures2
10.392944.8135289.050.990.5447.250.60420.7088
20.171059.21103213.291.464.4696.840.69450.9409
30.163860.08103157.941.443.7898.890.71950.9744
In the second experiment, we used six benchmark images with different themes to do evaluation. The image sizes vary from 256×256, 320×240, 512×512, to 480×720. These images were compressed under quality factors of 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, and 95, using the baseline JPEG standard in MATLAB. The corresponding bitrates can be obtained accordingly. We compared our results with a classical no-reference measure, Perra’s [2], and two full-reference measures, Qi index [3] and SSIM index [4]. Because Qi index and SSIM index give higher results for images with better quality, we use their reciprocals as blockiness indices, i.e., higher index corresponds to more serious blocking artifacts. To have a better observation, we normalize their variations to the range of 0 and 1. Fig 3 reports the experimental results. All of them have areas of different amount of high frequency component. The sensibility to the JPEG blockiness distortion was evaluated using the images compressed at different bitrates. The results show that Perra’s [2] does not change monotonically with the increase of bitrates for any of the images. In other words, its value does not represent the blockiness level in an image. Qi index has the property of monotonicity only for the image of ‘lena’ (see Fig 3(A)). For the images of ‘couple’ (Fig 3(B)), ‘bird’ (Fig 3(D)), and ‘statue’ (Fig 3(F)), the index varies drastically up and down for bitrates between 0.5 and 2.5. For the image of ‘landscape’ (Fig 3(C)) and ‘lighthouse’ (Fig 3(E)), it even increases for bitrates higher than 1. SSIM index decreases monotonically with the increase of bitrates for all the images. However, it is a full-reference measure, so it requires the original image as a reference. The proposed measure has a good property of monotonicity for all the images.
Fig 3

Relationship between normalized blockiness measures and bitrates of some benchmark images.

(a) lena, (b) couple, (c) landscape, (d) bird, (e) lighthouse, and (f) statue. For the copyright issue, the original images are not shown here.

Relationship between normalized blockiness measures and bitrates of some benchmark images.

(a) lena, (b) couple, (c) landscape, (d) bird, (e) lighthouse, and (f) statue. For the copyright issue, the original images are not shown here. In the third experiment, we evaluated the influence of noise on the proposed blockiness measure. The six benchmark images were adopted again. In this experiment, we added Gaussian white noise of mean 0 and variance 0.01 into the images, and then compressed them using the baseline JPEG standard in MATLAB. The following procedure was the same as in the second experiment. Fig 4 reports the experimental results. It can be seen that even SSIM does not have correct result for the image of ‘couple’ (Fig 4(B)). The proposed measure still has better performance than other no-reference measures. To evaluate whether the entropy component can correctly represent the compression level for images with different noises, we calculated measure s2 in Eq (8) for images added with Gaussian white noise of different variances and salt & pepper noise with different densities, and compressed with different levels. For the sake of space limit, we show the results of one image (“rushmore” from CSIQ database [16]) in Table 2 and Table 3. The bitrates, the blockiness measure and the corresponding s2 are shown for each variance and each compression level. It can be seen that for any case of the noises, both of the measures increase monotonically with the decrease of bitrates, which means that neither of them is interfered by noise. Therefore, the block differentiation mechanism in the proposed measure which depends on high frequency component will not be influenced by noise. In the fourth experiment, the JPEG images from four image-quality databases, including MICT[17], IVC[18], TID2008 [19], and TID2013 [20], were used to evaluate the overall performance of the proposed measure. The JPEG subsets in MICT, IVC, TID2008, and TID2013 contain 98, 60, 125, and 150 images, respectively. In these databases, 14 of 98, 10 of 60, 25 of 125, and 25 of 150 are reference images. Pearson linear correlation coefficient (PLCC) and root mean square error (RMSE) were used to evaluate the prediction accuracy, and Spearman rank-order correlation coefficient (SROCC) was used to evaluate the prediction monotonicity. These criterions were computed between the predicted scores and the subjective scores, where nonlinear fitting was first conducted to bring them on the same scale [10][21]. For comparison, the results of some popular no-reference blocking artifact metrics were also provided, including Perra’s [2], Wang’s [6], Pan’s [7], Lee’s [8], Li’s [10], Bovik’s [22], Wang’s [23], Liu’s [24], Chen’s [25], Mittal’s [26], Ye’s [27], Liu’s [28], and Liu’s [29]. A good quality measure should achieve high values in PLCC and SRCC, while low values in RMSE. Tables 4 and 5 provide the results, where the best result is marked in boldface. As shown in Table 4, the proposed measure yields the best results of both PLCC and SROCC for MICT, IVC, and TID2013. For TID2008, its SRCC is also the best, and its PLCC is very close to the best result. It can be seen that the proposed measure has a very high correlation with the subjective quality ratings. As for RMSE, the proposed measure also provides the best result for TID2008.
Fig 4

Relationship between normalized blockiness measures and bitrates of some benchmark images interfered by Gaussian noise.

(a) lena, (b) couple, (c) landscape, (d) bird, (e) lighthouse, and (f) statue. For the copyright issue, the original images are not shown here.

Table 2

Evaluation of the proposed measure and the corresponding s2 for the image“rushmore” from CSIQ database with Gaussian white noise of different variances and with different compression levels

Gaussian white noise of variance 0.01
bitrates5.54653.60133.15072.51282.06251.74721.45651.17350.85710.4667
proposed measure0.11360.11370.11370.11370.11500.11540.11800.11850.12380.2812
s20.06000.06040.06090.06130.06240.06330.06480.06680.07830.3210
Gaussian white noise of variance 0.03
bitrates6.13284.05262.99552.75472.20471.68021.47091.25300.81750.5655
proposed measure0.11490.11500.11560.11560.11730.11750.11860.11870.11900.1576
s20.06680.06700.06790.06800.06820.06890.06950.07030.07590.1386
Gaussian white noise of variance 0.05
bitrates6.45324.99354.27703.22582.28452.09601.67781.42461.16270.6384
proposed measure0.11740.11800.11810.11890.11900.11920.11970.12040.12270.1316
s20.06970.07020.07020.07030.07060.07050.07060.07160.07220.0959
Table 3

Evaluation of the proposed measure and the corresponding s2 for the image“rushmore” from CSIQ database with salt and pepper noise of different densities and with different compression levels

salt and pepper noise of density 0.01
bitrates3.63812.56712.02511.67321.43921.23101.01140.88610.75110.4349
proposed measure0.11950.12180.12550.12860.13050.13690.14380.15270.16250.2853
s20.05040.05450.06160.06890.07560.08470.10060.11480.13360.3325
salt and pepper noise of density 0.03
bitrates3.52202.71732.22781.87081.60751.33341.02140.85840.68010.4766
proposed measure0.12130.12390.12760.12970.13490.14180.15270.16250.19060.2853
s20.05260.05850.06550.07120.08120.09150.11480.13360.18190.3325
salt and pepper noise of density 0.05
bitrates3.05831.87571.55261.33131.08661.00720.91110.68910.56350.4167
proposed measure0.12480.12740.12820.12830.12910.13000.13280.13510.14960.2172
s20.05130.05590.06010.06270.06960.07190.07590.09030.11790.2272
Table 4

PLCC and SROCC of blockiness measures for the JPEG images in MICT, IVC, TID2008, and TID2013 databases.

DatabaseMeasurePLCCSROCC
MICTPerra [2]0.74320.7516
Wang [6]0.79140.7689
Pan [7]0.80170.8253
Lee [8]0.76250.8097
Li [10]0.91070.9007
Bovik [22]0.88550.8067
Wang [23]0.91250.8952
Liu [24]0.81260.8136
Chen [25]0.80300.8227
Proposed0.94910.9828
IVCPerra [2]0.78160.8034
Wang [6]0.71120.7172
Pan [7]0.79460.7973
Lee [8]0.89820.8968
Li [10]0.92860.9169
Bovik [22]0.90490.9006
Wang [23]0.95170.9456
Liu [24]0.82950.8284
Chen [25]0.86010.8840
Proposed0.95570.9900
TID2008Perra [2]0.72630.0727
Wang [6]0.68500.6691
Pan [7]0.70620.5357
Lee [8]0.74840.6152
Li [9]0.78530.7401
Li [13]0.95310.9311
Ren [14]0.8970.840
Bovik [22]0.74790.6169
Chen [25]0.75200.6866
Proposed0.95120.9840
TID2013Freitas [12]0.88770.7133
Mittal [26]0.81450.6567
Ye [27]0.84670.6622
Liu [28]0.78050.5922
Liu [29]0.83150.6533
Proposed0.95550.9920
Table 5

RMSE of blockiness measures for the JPEG images in MICT, IVC and TID2008 databases.

DatabaseMeasureRMSE
MICTPerra [2]0.8828
Wang [6]0.8067
Pan [7]0.7887
Lee [8]0.8538
Li [10]0.5452
Bovik [22]0.6131
Wang [23]0.5399
Liu [24]0.7691
Chen [25]0.7864
Proposed0.7408
IVCPerra [2]0.7257
Wang [6]0.8179
Pan [7]0.7063
Lee [8]0.5114
Li [10]0.4318
Bovik [22]0.4952
Wang [23]0.3572
Liu [24]0.6497
Chen [25]0.5935
Proposed0.7019
TID2008Perra [2]1.1708
Wang [6]1.2410
Pan [7]1.2016
Lee [8]1.1298
Li [9]0.9925
Bovik [22]1.1307
Chen [25]1.1227
Proposed0.9199

Relationship between normalized blockiness measures and bitrates of some benchmark images interfered by Gaussian noise.

(a) lena, (b) couple, (c) landscape, (d) bird, (e) lighthouse, and (f) statue. For the copyright issue, the original images are not shown here. The experimental results show that the proposed blockiness measure is effective and stable across a wide variety of images. It has a good estimation of compression level over different image areas. Moreover, it is robust to image noise and can be used for real-world image quality monitoring and control.

Conclusions

In this paper, we proposed a no-reference JPEG blockiness measure which is adaptive to the high frequency component in different image areas. It uses difference of entropies across blocks and variation of block boundary pixel values in edge images to estimate the blockiness level in an image. Extensive experiments were conducted to evaluate the performance of the proposed measure. The results indicated that compared with other blockiness indices, the proposed measure has satisfying and stable performance across a wide variety of images. It is robust to image noise and has positive perspectives in applications such as image quality monitoring and control.
  3 in total

1.  Image quality assessment: from error visibility to structural similarity.

Authors:  Zhou Wang; Alan Conrad Bovik; Hamid Rahim Sheikh; Eero P Simoncelli
Journal:  IEEE Trans Image Process       Date:  2004-04       Impact factor: 10.856

2.  VSNR: a wavelet-based visual signal-to-noise ratio for natural images.

Authors:  Damon M Chandler; Sheila S Hemami
Journal:  IEEE Trans Image Process       Date:  2007-09       Impact factor: 10.856

3.  No-reference image quality assessment in the spatial domain.

Authors:  Anish Mittal; Anush Krishna Moorthy; Alan Conrad Bovik
Journal:  IEEE Trans Image Process       Date:  2012-08-17       Impact factor: 10.856

  3 in total
  1 in total

1.  Similarity measure and domain adaptation in multiple mixture model clustering: An application to image processing.

Authors:  Siow Hoo Leong; Seng Huat Ong
Journal:  PLoS One       Date:  2017-07-07       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.