| Literature DB >> 35062460 |
Hua Zhang1,2, Xinwen Hu1, Ruoyun Gou1, Lingjun Zhang1,3, Bolun Zheng1, Zhuonan Shen1.
Abstract
The human visual system (HVS), affected by viewing distance when perceiving the stereo image information, is of great significance to study of stereoscopic image quality assessment. Many methods of stereoscopic image quality assessment do not have comprehensive consideration for human visual perception characteristics. In accordance with this, we propose a Rich Structural Index (RSI) for Stereoscopic Image objective Quality Assessment (SIQA) method based on multi-scale perception characteristics. To begin with, we put the stereo pair into the image pyramid based on Contrast Sensitivity Function (CSF) to obtain sensitive images of different resolution. Then, we obtain local Luminance and Structural Index (LSI) in a locally adaptive manner on gradient maps which consider the luminance masking and contrast masking. At the same time we use Singular Value Decomposition (SVD) to obtain the Sharpness and Intrinsic Structural Index (SISI) to effectively capture the changes introduced in the image (due to distortion). Meanwhile, considering the disparity edge structures, we use gradient cross-mapping algorithm to obtain Depth Texture Structural Index (DTSI). After that, we apply the standard deviation method for the above results to obtain contrast index of reference and distortion components. Finally, for the loss caused by the randomness of the parameters, we use Support Vector Machine Regression based on Genetic Algorithm (GA-SVR) training to obtain the final quality score. We conducted a comprehensive evaluation with state-of-the-art methods on four open databases. The experimental results show that the proposed method has stable performance and strong competitive advantage.Entities:
Keywords: cyclopean map; depth information; image pyramid; structural index; visual sensitivity
Mesh:
Year: 2022 PMID: 35062460 PMCID: PMC8780543 DOI: 10.3390/s22020499
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Framework diagram of the proposed method.
Figure 2Image pairs with different scales. (a) Original image pair. (b) down-sampled image pair by 2× times. (c) down-sampled image pair by 4× times. (d) down-sampled image pair by 5× times. Three representative regions are highlighted with red rectangles in each image.
Figure 3Workflow diagram of LSI.
Figure 4Comparison of two methods of gradient. From (top) to (bottom), original maps, normal gradient maps and locally weighted gradient maps.
Figure 5Workflow diagram of SISI.
Figure 6Workflow diagram of DTSI.
Figure 7Curves of gradient similarity deviation values in multi-scale model.
Figure 8Curves of singular value similarity deviation values in multi-scale model.
Performance of SIQA on four public databases, measuring the effect of cyclopean map and depth perception features on SIQA, respectively.
|
|
|
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PLCC | SROCC | RMSE | PLCC | SROCC | RMSE | PLCC | SROCC | RMSE | PLCC | SROCC | RMSE | |
| LIVE Phase-I | 0.9412 | 0.9278 | 5.2598 | 0.9389 | 0.9201 | 5.7892 | 0.8545 | 0.8458 | 8.5975 | 0.9512 | 0.9429 | 5.0028 |
| LIVE Phase-II 3D | 0.9325 | 0.9317 | 5.8256 | 0.9263 | 0.9136 | 5.2715 | 0.7789 | 0.7369 | 10.2548 | 0.9431 | 0.9452 | 4.2859 |
| WaterlooIVC Phase-I | 0.9458 | 0.9389 | 5.0214 | 0.9404 | 0.9321 | 5.5825 | 0.7782 | 0.7654 | 9.8975 | 0.9546 | 0.9478 | 4.2859 |
| MCL | 0.9124 | 0.9147 | 1.2925 | 0.9077 | 0.9101 | 1.2356 | 0.7625 | 0.7855 | 1.5478 | 0.9219 | 0.9259 | 1.0026 |
Overall performance comparison of the proposed SIQA method and six methods on LIVE Phase-I and LIVE Phase-II 3D databases.
| LIVE Phase-I | LIVE Phase-II | |||||
|---|---|---|---|---|---|---|
| PLCC | SROCC | RMSE | PLCC | SROCC | RMSE | |
| Jiang [ | 0.9460 | 0.9378 | 5.3160 | 0.9261 | 0.9257 | 4.2627 |
| Yue [ | 0.9370 | 0.9140 | 5.6521 | 0.9140 | 0.9060 | 4.4490 |
| Khan [ | 0.9272 | 0.9163 | - | 0.9323 | 0.9272 | - |
| Shao [ | 0.9389 | 0.9308 | 5.6459 | 0.9263 | 0.9282 | 4.1996 |
| Geng [ | 0.9430 | 0.9320 | 5.5140 | 0.9210 | 0.9190 | 5.4001 |
| Ma [ | 0.9409 | 0.9340 | 5.2110 | 0.9300 | 0.9218 |
|
| proposed |
|
|
|
|
| 4.2859 |
Overall performance comparison of the proposed SIQA method and six methods on WaterlooIVC Phase-I 3D.
| PLCC | SROCC | RMSE | |
|---|---|---|---|
| Khan [ | 0.9344 | 0.9253 | - |
| Ma [ | 0.9252 | 0.9117 | 5.8766 |
| Yue [ | 0.9261 | 0.9192 |
|
| Yang [ | 0.9439 | 0.9246 | - |
| Geng [ | 0.8460 | 0.8101 | 9.4691 |
| Proposed |
|
| 4.6836 |
Overall performance comparison of the proposed SIQA method and six methods on MCL 3D.
| PLCC | SROCC | RMSE | |
|---|---|---|---|
| Zhou [ | 0.8850 | 0.8520 | 1.1770 |
| Shao [ | 0.9138 | 0.9040 | 1.0233 |
| Khan [ | 0.9113 | 0.9058 | - |
| Liu [ | 0.9044 | 0.9087 | 1.1137 |
| Chen [ | 0.8278 | 0.8300 | 1.4596 |
| Proposed |
|
|
|
Figure 9Scatter plots of overall predicted quality scores against the subjective scores of the proposed method on four Database. (a) LIVE Phase–I. (b) LIVE Phase–II. (c) WaterlooIVC Phase–I. (d) MCL.
In LIVE Phase-I and LIVE Phase-II 3D databases, performance comparison of the proposed SIQA method and six methods on different types of distortion, and the evaluation index is PLCC.
| LIVE Phase-I | LIVE Phase-II | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| JP2K | JPEG | Gblur | WN | FF | JP2K | JPEG | Gblur | WN | FF | |
| Jiang [ | 0.9408 | 0.6975 | 0.9578 | 0.9516 | 0.8554 | 0.8463 | 0.8771 | 0.9845 | 0.9593 | 0.9601 |
| Yue [ | 0.9350 | 0.7440 | 0.9710 | 0.9620 | 0.8540 |
| 0.8430 | 0.9730 |
| 0.9230 |
| Khan [ | 0.9508 | 0.7110 | 0.9593 | 0.9470 | 0.8583 | 0.9270 | 0.8925 | 0.9778 | 0.9699 | 0.8987 |
| Shao [ | 0.9366 | 0.6540 | 0.9542 | 0.9441 | 0.8304 | 0.8768 | 0.8506 | 0.9445 | 0.9339 | 0.9330 |
| Geng [ | 0.9420 | 0.7190 | 0.9620 |
| 0.8670 | 0.8510 | 0.8350 | 0.9790 | 0.9490 | 0.9480 |
| Ma [ | 0.9610 | 0.7746 | 0.9711 | 0.9412 |
| 0.9670 | 0.9350 | 0.9384 | 0.9341 | 0.9489 |
| Proposed |
|
|
| 0.9558 | 0.8856 | 0.9327 |
|
| 0.9707 |
|
In LIVE Phase-I and LIVE Phase-II 3D databases, performance comparison of the proposed SIQA method and six methods on different types of distortion, and the evaluation index is SROCC.
| LIVE Phase-I | LIVE Phase-II | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| JP2K | JPEG | Gblur | WN | FF | JP2K | JPEG | Gblur | WN | FF | |
| Jiang [ | 0.9027 | 0.6628 |
| 0.9529 | 0.8079 | 0.8497 | 0.8547 | 0.9383 | 0.9563 |
|
| Yue [ | 0.8320 | 0.5950 | 0.8570 | 0.9320 | 0.7790 | 0.9590 | 0.7690 | 0.8680 |
| 0.9130 |
| Khan [ | 0.9074 | 0.6062 | 0.9295 | 0.9386 | 0.8092 | 0.9133 | 0.8670 | 0.8854 | 0.9584 | 0.8646 |
| Shao [ | 0.9000 | 0.6339 | 0.9242 | 0.9430 | 0.7807 | 0.8747 | 0.8340 | 0.9241 | 0.9325 | 0.9409 |
| Geng [ | 0.9050 | 0.6530 | 0.9310 |
| 0.8160 | 0.8360 | 0.8410 | 0.9210 | 0.9390 | 0.9160 |
| Ma [ | 0.9140 | 0.6659 | 0.9030 | 0.9037 |
| 0.9328 | 0.8968 | 0.8992 | 0.8893 | 0.9167 |
| Proposed |
|
| 0.9252 | 0.9335 | 0.8156 |
|
|
| 0.9319 | 0.9174 |
In MCL 3D database, performance comparison of the proposed SIQA method and three methods on different types of distortion, and the evaluation index is PLCC.
| Zhou [ | Shao [ | Khan [ | Liu [ | Proposed | |
|---|---|---|---|---|---|
| JPEG | 0.8260 | 0.7016 |
| 0.9404 | 0.9432 |
| JP2K | 0.8760 | 0.8571 | 0.9640 | 0.9219 |
|
| WN | 0.9140 | 0.6748 | 0.9561 | 0.9135 |
|
| Gblur | 0.9340 | 0.9013 | 0.9270 | 0.9479 |
|
| Sblur | 0.9410 | 0.8640 | 0.9409 | 0.9530 |
|
| Tloss |
| 0.5814 | 0.8722 | 0.7618 | 0.8571 |
In MCL 3D database, performance comparison of the proposed SIQA method and three methods on different types of distortion, and the evaluation index is SROCC.
| Zhou [ | Shao [ | Khan [ | Liu [ | Proposed | |
|---|---|---|---|---|---|
| JPEG | 0.7760 | 0.7992 | 0.8877 | 0.8506 |
|
| JP2K | 0.8520 | 0.8415 | 0.9317 | 0.9011 |
|
| WN | 0.9040 | 0.6404 |
| 0.9256 | 0.9273 |
| Gblur | 0.9160 | 0.8993 | 0.9131 |
| 0.9504 |
| Sblur | 0.9330 | 0.8532 | 0.9348 | 0.9577 |
|
| Tloss | 0.8450 | 0.5674 | 0.8744 | 0.7909 |
|