| Literature DB >> 25254237 |
Qiuwen Zhang1, Nana Li1, Yong Gan1.
Abstract
High efficiency video coding- (HEVC-) based 3D video coding (3D-HEVC) developed by joint collaborative team on 3D video coding (JCT-3V) for multiview video and depth map is an extension of HEVC standard. In the test model of 3D-HEVC, variable coding unit (CU) size decision and disparity estimation (DE) are introduced to achieve the highest coding efficiency with the cost of very high computational complexity. In this paper, a fast mode decision algorithm based on variable size CU and DE is proposed to reduce 3D-HEVC computational complexity. The basic idea of the method is to utilize the correlations between depth map and motion activity in prediction mode where variable size CU and DE are needed, and only in these regions variable size CU and DE are enabled. Experimental results show that the proposed algorithm can save about 43% average computational complexity of 3D-HEVC while maintaining almost the same rate-distortion (RD) performance.Entities:
Mesh:
Year: 2014 PMID: 25254237 PMCID: PMC4164800 DOI: 10.1155/2014/392505
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1The simple example for recursive CU.
Statistical analysis of depth level distribution for three treeblocks types.
| Sequences | Treeblocks in near region mode | Treeblocks in middle region mode | Treeblocks in far region mode | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Level 0 (%) | Level 1 (%) | Level 2 (%) | Level 3 (%) | Level 0 (%) | Level 1 (%) | Level 2 (%) | Level 3 (%) | Level 0 (%) | Level 1 (%) | Level 2 (%) | Level 3 (%) | |
| Kendo | 67.4 | 23.9 | 7.6 | 2.1 | 3.6 | 29.2 | 36.8 | 30.4 | 91.2 | 7.2 | 1.3 | 0.3 |
| Balloons | 75.6 | 18.2 | 5.8 | 0.4 | 4.6 | 31.9 | 37.2 | 26.3 | 89.3 | 6.3 | 3.2 | 1.2 |
| Newspaper | 58.9 | 29.4 | 8.6 | 3.1 | 2.1 | 26.8 | 37.7 | 33.4 | 92.4 | 6.1 | 1.2 | 0.3 |
| Shark | 70.2 | 19.5 | 7.5 | 2.8 | 1.9 | 23.9 | 36.1 | 38.1 | 93.7 | 4.9 | 1.3 | 0.1 |
| Undo_Dancer | 81.2 | 15.3 | 3.2 | 0.3 | 3.2 | 30.9 | 37.3 | 28.6 | 91.5 | 5.8 | 2.1 | 0.6 |
| GT_Fly | 70.4 | 21.8 | 6.2 | 1.6 | 4.1 | 37.2 | 29.5 | 29.2 | 87.2 | 8.2 | 3.6 | 1.0 |
| Poznan_Street | 65.6 | 24.3 | 7.5 | 2.6 | 2.8 | 27.2 | 32.9 | 37.1 | 92.8 | 4.6 | 1.7 | 0.9 |
| Poznan_Hall2 | 72.4 | 19.3 | 5.9 | 2.6 | 4.5 | 36.8 | 34.4 | 24.3 | 85.1 | 9.4 | 3.7 | 1.8 |
|
| ||||||||||||
| Average | 70.2 | 21.5 | 6.5 | 1.9 | 3.4 | 30.5 | 35.2 | 30.9 | 90.4 | 6.6 | 2.3 | 0.8 |
Candidate depth levels for three treeblocks types.
| Treeblock type | Candidate depth levels | Depth range, [Depthmin, Depthmax] |
|---|---|---|
| Near region mode | 0, 1 | [0,1] |
| Middle region mode | 1, 2, 3 | [1,2, 3] |
| Far region mode | 0 | [0,0] |
Figure 2Flowchart of the proposed fast CU size decision algorithm.
Analysis of view prediction and temporal prediction distributions for three treeblocks types.
| Sequences | Treeblocks in near region mode | Treeblocks in middle region mode | Treeblocks in far region mode | |||
|---|---|---|---|---|---|---|
| T (%) | V (%) | T (%) | V (%) | T (%) | V (%) | |
| Kendo | 87.4 | 12.6 | 62.3 | 37.7 | 98.1 | 1.9 |
| Balloons | 82.9 | 17.1 | 71.2 | 29.8 | 96.3 | 3.7 |
| Newspaper | 91.2 | 8.8 | 76.5 | 23.5 | 99.5 | 0.5 |
| Shark | 83.6 | 16.4 | 72.1 | 27.9 | 97.4 | 2.6 |
| Undo_Dancer | 91.3 | 8.7 | 64.2 | 35.8 | 96.8 | 3.2 |
| GT_Fly | 87.7 | 12.3 | 62.3 | 37.7 | 94.3 | 5.7 |
| Poznan_Street | 85.6 | 14.4 | 67.6 | 32.4 | 96.1 | 3.9 |
| Poznan_Hall2 | 90.2 | 9.8 | 77.3 | 22.7 | 94.9 | 5.1 |
|
| ||||||
| Average | 87.4 | 12.5 | 69.2 | 30.9 | 96.7 | 3.3 |
“T” and “V” represent temporal prediction and view prediction, respectively.
Figure 3Flowchart of the proposed selective disparity estimation algorithm.
Results of each individual algorithm compared to 3DV-HTM.
| Sequences | Texture | Rendered | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FCUS | SDE | FCUS | SDE | |||||||||
| BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | |
| Kendo | 1.23 | −0.03 | −38 | 0.82 | −0.02 | −12 | 0.22 | −0.02 | −38 | 0.12 | −0.01 | −12 |
| Balloons | 1.41 | −0.05 | −32 | 0.35 | −0.01 | −8 | 0.37 | −0.04 | −32 | 0.32 | −0.00 | −8 |
| Newspaper | 0.78 | −0.02 | −39 | 0.96 | −0.04 | −17 | 0.16 | −0.03 | −39 | 0.26 | −0.02 | −17 |
| Shark | 2.17 | −0.06 | −42 | 1.13 | −0.02 | −21 | 0.43 | −0.02 | −42 | 0.33 | −0.01 | −21 |
| Undo_Dancer | 0.89 | −0.08 | −29 | 0.47 | −0.01 | −6 | 0.19 | −0.03 | −29 | 0.21 | −0.02 | −6 |
| GT_Fly | 1.54 | −0.02 | −34 | 1.34 | −0.05 | −11 | 0.67 | −0.04 | −34 | 0.14 | −0.01 | −11 |
| Poznan_Street | 0.82 | −0.07 | −26 | 0.96 | −0.06 | −16 | 0.32 | −0.01 | −26 | 0.32 | −0.03 | −16 |
| Poznan_Hall2 | 0.94 | −0.04 | −44 | 0.72 | −0.03 | −9 | 0.43 | −0.01 | −44 | 0.21 | −0.02 | −9 |
|
| ||||||||||||
| Average | 1.22 | −0.046 | −35 | 0.84 | −0.03 | −13 | 0.35 | −0.025 | −35 | 0.24 | −0.015 | −13 |
Results of the proposed overall algorithm compared with 3DV-HTM.
| Sequences | Overall algorithm | |||||
|---|---|---|---|---|---|---|
| Texture | Rendered | |||||
| BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | |
| Kendo | 1.92 | −0.04 | −45 | 1.21 | −0.03 | −45 |
| Balloons | 1.78 | −0.06 | −39 | 0.84 | −0.02 | −39 |
| Newspaper | 1.27 | −0.03 | −46 | 1.27 | −0.04 | −46 |
| Shark | 2.34 | −0.06 | −48 | 1.29 | −0.03 | −48 |
| Undo_Dancer | 1.12 | −0.09 | −37 | 0.58 | −0.02 | −37 |
| GT_Fly | 1.54 | −0.03 | −41 | 1.82 | −0.06 | −41 |
| Poznan_Street | 1.45 | −0.08 | −36 | 1.29 | −0.07 | −36 |
| Poznan_Hall2 | 1.76 | −0.05 | −53 | 0.96 | −0.04 | −53 |
|
| ||||||
| Average | 1.65 | −0.055 | −43 | 1.16 | −0.039 | −43 |
Figure 4Rate-distortion curves comparison. The x-axis denotes total bitrate to code three texture videos and three depth maps. The y-axis denotes average PSNR of the rendered views.
Comparing the proposed overall algorithm with a state-of-the-art fast algorithm in [14].
| Sequences | Texture | Rendered | ||||
|---|---|---|---|---|---|---|
| BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | |
| Kendo | 1.35 | −0.09 | −12 | 1.12 | −0.08 | −12 |
| Balloons | 1.54 | −0.12 | −8 | 1.38 | −0.11 | −8 |
| Newspaper | 3.12 | −0.19 | −21 | 2.59 | −0.16 | −21 |
| Shark | 1.69 | −0.14 | −15 | 1.34 | −0.11 | −15 |
| Undo_Dancer | 2.71 | −0.18 | −3 | 2.16 | −0.14 | −3 |
| GT_Fly | 1.89 | −0.15 | −5 | 1.54 | −0.12 | −5 |
| Poznan_Street | 1.63 | −0.13 | −6 | 1.21 | −0.09 | −6 |
| Poznan_Hall2 | 1.72 | −0.14 | −22 | 1.53 | −0.12 | −22 |
|
| ||||||
| Average | 1.96 | −0.14 | −12 | 1.61 | −0.12 | −12 |