| Literature DB >> 35735972 |
Abstract
With the development of digital imaging techniques, image quality assessment methods are receiving more attention in the literature. Since distortion-free versions of camera images in many practical, everyday applications are not available, the need for effective no-reference image quality assessment algorithms is growing. Therefore, this paper introduces a novel no-reference image quality assessment algorithm for the objective evaluation of authentically distorted images. Specifically, we apply a broad spectrum of local and global feature vectors to characterize the variety of authentic distortions. Among the employed local features, the statistics of popular local feature descriptors, such as SURF, FAST, BRISK, or KAZE, are proposed for NR-IQA; other features are also introduced to boost the performances of local features. The proposed method was compared to 12 other state-of-the-art algorithms on popular and accepted benchmark datasets containing RGB images with authentic distortions (CLIVE, KonIQ-10k, and SPAQ). The introduced algorithm significantly outperforms the state-of-the-art in terms of correlation with human perceptual quality ratings.Entities:
Keywords: image statistics; no-reference image quality assessment; quality-aware features
Year: 2022 PMID: 35735972 PMCID: PMC9224559 DOI: 10.3390/jimaging8060173
Source DB: PubMed Journal: J Imaging ISSN: 2313-433X
Summary about the applied benchmark IQA databases with authentic distortions. DSLR: digital single-lens reflex camera. DSC: digital still camera. SPHN: smartphone.
| Attribute | CLIVE [ | KonIQ-10k [ | SPAQ [ |
|---|---|---|---|
| #Images | 1162 | 10,073 | 11,125 |
| Resolution |
|
|
|
| #Subjects | 8100 | 1,467 | 600 |
| #Annotations | 1400 | 1,200,000 | 186,400 |
| Scale of quality scores | 0–100 | 1–5 | 0–100 |
| Subjective methodology | Crowdsourcing | Crowdsourcing | Laboratory |
| Types of cameras | DSLR/DSC/SPHN | DSLR/DSC/SPHN | SPHN |
| Year of publication | 2017 | 2018 | 2020 |
Figure 1Empirical MOS distributions in the applied IQA databases: (a) CLIVE [45], (b) KonIQ-10k [46], and (c) SPAQ [47].
The applied computer configuration of the experiments.
| Computer model | STRIX Z270H Gaming |
| Operating system | Windows 10 |
| CPU | Intel(R) Core(TM) i7-7700K CPU 4.20 GHz (8 cores) |
| Memory | 15 GB |
| GPU | Nvidia GeForce GTX 1080 |
Figure 2Workflow of the proposed NR-IQA algorithm.
Summary of the applied features. Quality-aware features proposed by this paper are in bold.
| Feature | Input | Feature | Number of |
|---|---|---|---|
|
| SURF [ | mean, median, std, | 5 |
|
| FAST [ | mean, median, std, | 5 |
|
| BRISK [ | mean, median, std, | 5 |
|
| KAZE [ | mean, median, std, | 5 |
|
| ORB [ | mean, median, std, | 5 |
|
| Harris [ | mean, median, std, | 5 |
|
| Minimum Eigenvalue [ | mean, median, std, | 5 |
|
| SURF [ | mean, median, std, | 5 |
|
| FAST [ | mean, median, std, | 5 |
|
| BRISK [ | mean, median, std, | 5 |
|
| KAZE [ | mean, median, std, | 5 |
|
| ORB [ | mean, median, std, | 5 |
|
| Harris [ | mean, median, std, | 5 |
|
| Minimum Eigenvalue [ | mean, median, std, | 5 |
|
| Binary image | Hu invariant moments [ | 7 |
| f78-f87 | RGB image | Perceptual features | 10 |
|
| GL-GM map | histogram variance | 1 |
|
| GL-GM map | histogram variance | 1 |
|
| GL-GM map | histogram variance | 1 |
| f91 | GM map [ | histogram variance | 1 |
| f92 | RO map [ | histogram variance | 1 |
| f93 | RM map [ | histogram variance | 1 |
Mean values of perceptual features in CLIVE [45] with respect to five equal MOS intervals.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Blur | 0.412 | 0.362 | 0.315 | 0.285 | 0.329 |
| Colorfulness | 0.046 | 0.038 | 0.042 | 0.045 | 0.072 |
| Chroma | 15.510 | 13.681 | 14.995 | 15.409 | 21.977 |
| Color gradient-mean | 92.801 | 116.884 | 154.651 | 189.795 | 196.287 |
| Color gradient-std | 132.693 | 163.876 | 207.837 | 244.420 | 235.855 |
| DCF | 0.217 | 0.211 | 0.197 | 0.220 | 0.192 |
| Michelson contrast | 2.804 | 2.832 | 2.911 | 2.937 | 2.953 |
| RMS contrast | 0.201 | 0.201 | 0.219 | 0.222 | 0.223 |
| GCF | 5.304 | 5.488 | 6.602 | 6.264 | 6.796 |
| Entropy | 6.832 | 6.985 | 7.182 | 7.413 | 7.583 |
Standard deviation values of perceptual features in CLIVE [45] with respect to five equal MOS intervals.
|
|
|
|
|
| |
|---|---|---|---|---|---|
| Blur | 0.109 | 0.096 | 0.075 | 0.067 | 0.093 |
| Colorfulness | 0.050 | 0.033 | 0.037 | 0.039 | 0.049 |
| Chroma | 12.698 | 8.143 | 9.680 | 8.927 | 11.720 |
| Color gradient-mean | 45.480 | 66.164 | 89.762 | 96.283 | 99.800 |
| Color gradient-std | 58.236 | 71.187 | 82.104 | 84.179 | 78.250 |
| DCF | 0.141 | 0.122 | 0.117 | 0.115 | 0.105 |
| Michelson contrast | 0.328 | 0.252 | 0.173 | 0.143 | 0.140 |
| RMS contrast | 0.080 | 0.068 | 0.065 | 0.056 | 0.051 |
| GCF | 1.934 | 1.665 | 1.761 | 1.857 | 1.746 |
| Entropy | 1.019 | 0.966 | 0.748 | 0.532 | 0.227 |
Ablation study on CLIVE [45] database. Median PLCC, SROCC, and KROCC values were measured over 100 random train–test splits.
| SVR | GPR | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| Feature descriptors, RGB image | 0.518 | 0.484 | 0.337 | 0.578 | 0.523 | 0.364 |
| Feature descriptors, filtered image | 0.529 | 0.488 | 0.338 | 0.582 | 0.527 | 0.364 |
| Hu invariant moments | 0.302 | 0.295 | 0.199 | 0.328 | 0.320 | 0.219 |
| Perceptual features | 0.607 | 0.588 | 0.420 | 0.626 | 0.598 | 0.425 |
| GL and gradient statistics | 0.528 | 0.492 | 0.343 | 0.541 | 0.495 | 0.343 |
| All | 0.636 | 0.604 | 0.428 | 0.685 | 0.644 | 0.466 |
Figure 3Comparison of the statistics of local feature descriptors as quality-aware features in CLIVE [45]. Median SROCC values were measured over 100 random train–test splits. (a) RGB image, SVR, (b) RGB image, GPR, (c) filtered image, SVR, (d) filtered image, GPR.
Figure 4Performance in terms of SROCC of the proposed FLG-IQA in cases when a given feature is removed from the proposed feature vector. The performance of the entire feature vector is indicated by ‘X’. Median SROCC values were measured on CLIVE [45] after 100 random train–test splits.
Figure 5Results of the RReliefF algorithm on the features extracted from the images of CLIVE [45]. (a) nearest neighbours, (b) nearest neighbours, (c) nearest neighbours, (d) nearest neighbours.
Comparison to the state-of-the-art in CLIVE [45] and KonIQ-10k [46] databases. Median PLCC, SROCC, and KROCC values were measured over 100 random train–test splits. The best results are in bold and the second-best results are underlined.
| CLIVE [ | KonIQ-10k [ | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| BLIINDS-II [ | 0.473 | 0.442 | 0.291 | 0.574 | 0.575 | 0.414 |
| BMPRI [ | 0.541 | 0.487 | 0.333 | 0.637 | 0.619 | 0.421 |
| BRISQUE [ | 0.524 | 0.497 | 0.345 | 0.707 | 0.677 | 0.494 |
| CurveletQA [ |
|
| 0.421 | 0.730 | 0.718 | 0.495 |
| DIIVINE [ | 0.617 | 0.580 | 0.405 | 0.709 | 0.693 | 0.471 |
| ENIQA [ | 0.596 | 0.564 | 0.376 | 0.761 | 0.745 |
|
| GRAD-LOG-CP [ | 0.607 | 0.604 | 0.383 | 0.705 | 0.696 | 0.501 |
| GWH-GLBP [ | 0.584 | 0.559 | 0.395 | 0.723 | 0.698 | 0.507 |
| NBIQA [ | 0.629 | 0.604 |
|
|
| 0.515 |
| OG-IQA [ | 0.545 | 0.505 | 0.364 | 0.652 | 0.635 | 0.447 |
| PIQE [ | 0.172 | 0.108 | 0.081 | 0.208 | 0.246 | 0.172 |
| SSEQ [ | 0.487 | 0.436 | 0.309 | 0.589 | 0.572 | 0.423 |
| FLG-IQA |
|
|
|
|
|
|
Comparison to the state-of-the-art in the SPAQ [47] database. Median PLCC, SROCC, and KROCC values were measured over 100 random train–test splits. The best results are in bold and the second-best results are underlined.
| Method | PLCC | SROCC | KROCC |
|---|---|---|---|
| BLIINDS-II [ | 0.676 | 0.675 | 0.486 |
| BMPRI [ | 0.739 | 0.734 | 0.506 |
| BRISQUE [ | 0.726 | 0.720 | 0.518 |
| CurveletQA [ | 0.793 | 0.774 | 0.503 |
| DIIVINE [ | 0.774 | 0.756 | 0.514 |
| ENIQA [ |
|
|
|
| GRAD-LOG-CP [ | 0.786 | 0.782 | 0.572 |
| GWH-GLBP [ | 0.801 | 0.796 | 0.542 |
| NBIQA [ | 0.802 | 0.793 | 0.539 |
| OG-IQA [ | 0.726 | 0.724 | 0.594 |
| PIQE [ | 0.211 | 0.156 | 0.091 |
| SSEQ [ | 0.745 | 0.742 | 0.549 |
| FLG-IQA |
|
|
|
Comparison to the state-of-the-art. Direct and weighted average PLCC, SROCC, and KROCC are reported based on the results measured in CLIVE [45], KonIQ-10k [46], and SPAQ [47]. The best results are in bold and the second-best results are underlined.
| Direct Average | Weighted Average | |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
| BLIINDS-II [ | 0.574 | 0.564 | 0.397 | 0.620 | 0.618 | 0.443 |
| BMPRI [ | 0.639 | 0.613 | 0.420 | 0.683 | 0.669 | 0.459 |
| BRISQUE [ | 0.652 | 0.631 | 0.452 | 0.707 | 0.689 | 0.498 |
| CurveletQA [ | 0.720 | 0.704 | 0.473 | 0.756 | 0.741 | 0.495 |
| DIIVINE [ | 0.700 | 0.676 | 0.463 | 0.737 | 0.718 | 0.489 |
| ENIQA [ | 0.723 | 0.704 |
| 0.778 |
|
|
| GRAD-LOG-CP [ | 0.699 | 0.694 | 0.485 | 0.740 | 0.734 | 0.530 |
| GWH-GLBP [ | 0.703 | 0.684 | 0.481 | 0.755 | 0.740 | 0.519 |
| NBIQA [ |
|
| 0.494 |
| 0.763 | 0.522 |
| OG-IQA [ | 0.641 | 0.621 | 0.468 | 0.683 | 0.673 | 0.516 |
| PIQE [ | 0.197 | 0.170 | 0.115 | 0.208 | 0.194 | 0.127 |
| SSEQ [ | 0.607 | 0.583 | 0.427 | 0.661 | 0.650 | 0.480 |
| FLG-IQA |
|
|
|
|
|
|
Figure 6Ground truth scores versus predicted scores in (a) CLIVE [45] and (b) KonIQ-10k [46] test sets.
Results of the significance tests. Symbol denotes that the proposed FLG-IQA algorithm is significantly ( confidence interval) better (worse) than the NR-IQA algorithm in the row on the IQA benchmark database in the column.
| Method | CLIVE [ | KonIQ-10k [ | SPAQ [ |
|---|---|---|---|
| BLIINDS-II [ | 1 | 1 | 1 |
| BMPRI [ | 1 | 1 | 1 |
| BRISQUE [ | 1 | 1 | 1 |
| CurveletQA [ | 1 | 1 | 1 |
| DIIVINE [ | 1 | 1 | 1 |
| ENIQA [ | 1 | 1 | 1 |
| GRAD-LOG-CP [ | 1 | 1 | 1 |
| GWH-GLBP [ | 1 | 1 | 1 |
| NBIQA [ | 1 | 1 | 1 |
| OG-IQA [ | 1 | 1 | 1 |
| PIQE [ | 1 | 1 | 1 |
| SSEQ [ | 1 | 1 | 1 |
Results of the cross-database test. The examined and the proposed methods were trained on KonIQ-10k [46] and tested on CLIVE [45]. The best results are in bold and the second-best results are underlined.
| Method | PLCC | SROCC | KROCC |
|---|---|---|---|
| BLIINDS-II [ | 0.107 | 0.090 | 0.063 |
| BMPRI [ | 0.453 | 0.389 | 0.298 |
| BRISQUE [ |
| 0.460 | 0.310 |
| CurveletQA [ | 0.496 | 0.505 |
|
| DIIVINE [ | 0.479 | 0.434 | 0.299 |
| ENIQA [ | 0.428 | 0.386 | 0.272 |
| GRAD-LOG-CP [ | 0.427 | 0.384 | 0.261 |
| GWH-GLBP [ | 0.480 | 0.479 | 0.328 |
| NBIQA [ | 0.503 |
| 0.284 |
| OG-IQA [ | 0.442 | 0.427 | 0.289 |
| SSEQ [ | 0.270 | 0.256 | 0.170 |
| FLG-IQA |
|
|
|
Figure 7Normalized ground truth scores versus normalized predicted score scatter plot of FLG-IQA in the cross-database test.