| Literature DB >> 24963512 |
Qiuwen Zhang1, Nana Li1, Qinggang Wu1.
Abstract
The emerging international standard of high efficiency video coding based 3D video coding (3D-HEVC) is a successor to multiview video coding (MVC). In 3D-HEVC depth intracoding, depth modeling mode (DMM) and high efficiency video coding (HEVC) intraprediction mode are both employed to select the best coding mode for each coding unit (CU). This technique achieves the highest possible coding efficiency, but it results in extremely large encoding time which obstructs the 3D-HEVC from practical application. In this paper, a fast mode decision algorithm based on the correlation between texture video and depth map is proposed to reduce 3D-HEVC depth intracoding computational complexity. Since the texture video and its associated depth map represent the same scene, there is a high correlation among the prediction mode from texture video and depth map. Therefore, we can skip some specific depth intraprediction modes rarely used in related texture CU. Experimental results show that the proposed algorithm can significantly reduce computational complexity of 3D-HEVC depth intracoding while maintaining coding efficiency.Entities:
Mesh:
Year: 2014 PMID: 24963512 PMCID: PMC4052507 DOI: 10.1155/2014/620142
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Statistical analysis of intramode distributions in depth map coding.
| Sequences | QP = 34 | QP = 39 | QP = 42 | QP = 45 |
|---|---|---|---|---|
| Kendo | 1.0% | 0.8% | 0.6% | 0.5% |
| Balloons | 1.2% | 1.1% | 0.9% | 0.7% |
| Newspaper | 1.1% | 0.9% | 0.7% | 0.9% |
| Shark | 1.9% | 1.6% | 1.5% | 1.3% |
| Undo_Dancer | 0.8% | 0.7% | 0.5% | 0.4% |
| GT_Fly | 2.1% | 1.9% | 1.6% | 1.2% |
| Poznan_Street | 0.7% | 0.6% | 0.6% | 0.5% |
| Poznan_Hall2 | 0.2% | 0.2% | 0.1% | 0.1% |
|
| ||||
| Average | 1.1% | 1.0% | 0.8% | 0.7% |
Experimental condition: temporal length of a GOP = 8, basis quantization parameters (QP) = 34, 39, 42, and 45, treeblock size = 64, and CABAC (context-adaptive binary arithmetic coding) is used for entropy coding.
Statistical analysis different PU size distributions in DMM.
| Sequences | 64 × 64 | 32 × 32 | 16 × 16 | 8 × 8 | 4 × 4 |
|---|---|---|---|---|---|
| Kendo | None | 4.1 | 6.6 | 21.1 | 68.2 |
| Balloons | None | 4.4 | 7.1 | 15.2 | 73.3 |
| Newspaper | None | 1.6 | 3.4 | 12.5 | 82.5 |
| Shark | None | 4.5 | 8.8 | 27.3 | 59.4 |
| Undo_Dancer | None | 3.8 | 4.2 | 18.9 | 73.1 |
| GT_Fly | None | 5.4 | 9.9 | 25.2 | 59.5 |
| Poznan_Street | None | 1.6 | 3.2 | 10.5 | 84.7 |
| Poznan_Hall2 | None | 2.4 | 3.8 | 19.6 | 74.2 |
|
| |||||
| Average | None | 3.5 | 5.9 | 18.8 | 71.9 |
Statistical analysis for accuracies of ET methods in intradepth map coding.
| Sequences | ET based on the homogeneity checking | ET based on the texture video-depth map correlation |
|---|---|---|
| Kendo | 82.3% | 91.1% |
| Balloons | 79.5% | 88.3% |
| Newspaper | 89.2% | 95.8% |
| Shark | 83.5% | 92.2% |
| Undo_Dancer | 76.1% | 86.8% |
| GT_Fly | 83.8% | 90.3% |
| Poznan_Street | 72.2% | 84.6% |
| Poznan_Hall2 | 87.3% | 93.9% |
|
| ||
| Average | 81.7% | 90.4% |
Statistical analysis of accuracy for proposed algorithm.
| Sequences | Treeblocks in complex region | |||
|---|---|---|---|---|
| QP = 34 | QP = 39 | QP = 42 | QP = 45 | |
| Kendo | 86% | 92% | 94% | 97% |
| Balloons | 78% | 85% | 90% | 94% |
| Newspaper | 83% | 87% | 93% | 96% |
| Shark | 88% | 94% | 96% | 99% |
| Undo_Dancer | 87% | 93% | 95% | 98% |
| GT_Fly | 86% | 92% | 94% | 98% |
| Poznan_Street | 82% | 86% | 90% | 94% |
| Poznan_Hall2 | 84% | 89% | 92% | 95% |
|
| ||||
| Average | 84% | 90% | 93% | 96% |
Figure 1Texture video and corresponding depth map.
Statistical analysis of accuracy for two treeblocks type.
| Sequences | Treeblocks in homogeneous region | Treeblocks in complex region | ||||||
|---|---|---|---|---|---|---|---|---|
| QP = 34 | QP = 39 | QP = 42 | QP = 45 | QP = 34 | QP = 39 | QP = 42 | QP = 45 | |
| Kendo | 89% | 94% | 97% | 99% | 62% | 71% | 79% | 84% |
| Balloons | 79% | 88% | 94% | 97% | 58% | 65% | 73% | 80% |
| Newspaper | 88% | 93% | 95% | 98% | 63% | 71% | 74% | 86% |
| Shark | 93% | 95% | 98% | 100% | 82% | 90% | 93% | 96% |
| Undo_Dancer | 92% | 94% | 98% | 99% | 85% | 92% | 94% | 97% |
| GT_Fly | 95% | 97% | 98% | 100% | 79% | 87% | 92% | 94% |
| Poznan_Street | 90% | 93% | 96% | 98% | 72% | 76% | 80% | 87% |
| Poznan_Hall2 | 92% | 96% | 97% | 98% | 69% | 75% | 82% | 89% |
|
| ||||||||
| Average | 90% | 94% | 97% | 99% | 71% | 78% | 83% | 89% |
Encoder parameters settings.
| Codec | HTM4.1 |
|---|---|
| Number of frames | Full length |
| GOP size | 8 (Intrapicture is every 24 frames) |
| DMM | On |
| Motion search range | 64 |
| MaxCU size | 64 × 64 |
| MaxCU depth level | 4 |
| Depth QP values | 34, 39, 42, 45 |
| Texture QP values | 25, 30, 35, 40 |
Results of each individual algorithm compared to 3D-HEVC.
| Sequences | C2 | C3 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ETMD | AD-CUDRD | ETMD | AD-CUDRD | |||||||||
| BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | |
| Kendo | 1.22 | −0.04 | −35 | 0.79 | −0.02 | −31 | 1.15 | −0.03 | −36 | 0.52 | −0.01 | −32 |
| Balloons | 1.38 | −0.05 | −31 | 0.88 | −0.02 | −28 | 1.32 | −0.04 | −32 | 0.73 | −0.01 | −29 |
| Newspaper | 1.57 | −0.07 | −37 | 1.09 | −0.04 | −25 | 1.35 | −0.06 | −39 | 0.82 | −0.03 | −25 |
| Shark | 0.76 | −0.02 | −32 | 0.46 | −0.01 | −37 | 0.63 | −0.02 | −33 | 0.28 | −0.01 | −39 |
| Undo_Dancer | 0.65 | −0.03 | −34 | 0.39 | −0.02 | −39 | 0.54 | −0.03 | −35 | 0.23 | −0.02 | −41 |
| GT_Fly | 0.89 | −0.03 | −40 | 0.62 | −0.01 | −35 | 0.72 | −0.02 | −41 | 0.41 | −0.01 | −36 |
| Poznan_Street | 1.23 | −0.06 | −31 | 1.23 | −0.03 | −26 | 1.31 | −0.05 | −31 | 0.89 | −0.02 | −26 |
| Poznan_Hall2 | 2.12 | −0.09 | −39 | 1.38 | −0.06 | −23 | 1.92 | −0.08 | −40 | 1.02 | −0.04 | −23 |
|
| ||||||||||||
| Average | 1.22 | −0.049 | −34.9 | 0.86 | −0.026 | −30.5 | 1.12 | −0.041 | −35.9 | 0.61 | −0.019 | −31.4 |
The proposed overall algorithm compared with a state-of-the-art fast intramode decision algorithm in [10].
| Sequences | C2 | C3 | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Overall algorithm | FDMMS | Overall algorithm | FDMMS | |||||||||
| BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | BDBR (%) | BDPSNR (dB) | Dtime (%) | |
| Kendo | 1.28 | −0.04 | −45 | 0.51 | −0.01 | −29 | 1.21 | −0.04 | −47 | 0.52 | −0.01 | −30 |
| Balloons | 1.53 | −0.05 | −43 | 0.54 | −0.02 | −30 | 1.47 | −0.05 | −44 | 0.49 | −0.01 | −30 |
| Newspaper | 1.64 | −0.08 | −46 | 0.89 | −0.03 | −22 | 1.62 | −0.07 | −48 | 0.79 | −0.02 | −23 |
| Shark | 0.79 | −0.04 | −44 | 0.36 | −0.01 | −25 | 0.72 | −0.03 | −45 | 0.35 | −0.01 | −26 |
| Undo_Dancer | 0.64 | −0.04 | −45 | 0.39 | −0.02 | −24 | 0.61 | −0.04 | −46 | 0.37 | −0.02 | −24 |
| GT_Fly | 0.88 | −0.03 | −46 | 0.42 | −0.01 | −23 | 0.86 | −0.03 | −49 | 0.38 | −0.01 | −23 |
| Poznan_Street | 1.97 | −0.07 | −41 | 0.35 | −0.02 | −28 | 1.91 | −0.06 | −42 | 0.27 | −0.02 | −29 |
| Poznan_Hall2 | 2.33 | −0.11 | −47 | 0.71 | −0.03 | −33 | 2.32 | −0.11 | −48 | 0.64 | −0.03 | −34 |
|
| ||||||||||||
| Average | 1.38 | −0.057 | −44.6 | 0.52 | −0.019 | −26.7 | 1.34 | −0.053 | −46.1 | 0.48 | −0.016 | −27.4 |
Figure 2Experimental results of “Poznan_Hall2” (1920 × 1088) under different QPs combinations for texture video and depth map (25, 34), (30, 39), (35, 42), and (40, 45).