| Literature DB >> 26963813 |
Pallab Kanti Podder1, Manoranjan Paul1, Manzur Murshed2.
Abstract
The emerging High Efficiency Video Coding (HEVC) standard introduces a number of innovative and powerful coding tools to acquire better compression efficiency compared to its predecessor H.264. The encoding time complexities have also increased multiple times that is not suitable for realtime video coding applications. To address this limitation, this paper employs a novel coding strategy to reduce the time complexity in HEVC encoder by efficient selection of appropriate block-partitioning modes based on human visual features (HVF). The HVF in the proposed technique comprise with human visual attention modelling-based saliency feature and phase correlation-based motion features. The features are innovatively combined through a fusion process by developing a content-based adaptive weighted cost function to determine the region with dominated motion/saliency (RDMS)- based binary pattern for the current block. The generated binary pattern is then compared with a codebook of predefined binary pattern templates aligned to the HEVC recommended block-paritioning to estimate a subset of inter-prediction modes. Without exhaustive exploration of all modes available in the HEVC standard, only the selected subset of modes are motion estimated and motion compensated for a particular coding unit. The experimental evaluation reveals that the proposed technique notably down-scales the average computational time of the latest HEVC reference encoder by 34% while providing similar rate-distortion (RD) performance for a wide range of video sequences.Entities:
Mesh:
Year: 2016 PMID: 26963813 PMCID: PMC4786253 DOI: 10.1371/journal.pone.0150673
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Process diagram of the proposed mode selection technique.
Subset of mode selection at 16×16 and 8×8 coding depth level based on the patterns of RDMS.
| Pattern of RDMS at 16×16 and 8×8 Block Level | Selected Subset of Inter-modes | |||||
|---|---|---|---|---|---|---|
| 16×8, 8×16 or 8×8 | ||||||
| 16×8, 8×16 or 8×8 | ||||||
| 16×8 and 8×16 | ||||||
| 16×8 and 8×16 | ||||||
| 16×8 and 8×16 | ||||||
| 16×12, 16×4, 12×16 or 4×16 | ||||||
Performance comparison of different fast mode selection algorithms compared to HEVC encoder in terms of BD-Bit Rate (BD-BR), BD-PSNR and computational time.
| Algorithms | BD-BR(%) | BD-PSNR (dB) | Average Time Savings (%) | Remarks |
|---|---|---|---|---|
| 5.70 | - 0.80 | 50 | 03 videos | |
| 0.60 | - 0.01 | 38 | 15 videos | |
| 0.50 | - 0.08 | 30 | 17 videos | |
| 0.32 | - 0.11 | 35 | 19 videos | |
| 1.20 | - | 46 | 18 videos | |
| 0.13 | - 0.01 | 32 | 12 videos (SVS) | |
| 0.10 | -0.01 | 34 | 14 videos(SCVS) |
Mode selection at 32×32 block level based on the codebook of predefined binary pattern templates.
| RDMS Based Templates at 32×32 Block Level | Selection of Modes at 32×32 Block Level |
|---|---|
| Skip or Inter 32×32 | |
| Intra 16×16 or Inter 16×16 | |
| Inter {32×16 or 16×16} | |
| Inter {32×8 or 16×16} | |
| Inter {32×8} | |
| Inter {32×24 or 16×16} | |
| Inter {32×24} | |
| Inter {16×32 or 16×16} | |
| Inter {8×32 or 16×16} | |
| Inter {8×32} | |
| Inter {24×32 or 16×16} | |
| Inter {24×32} |
A theoretical analysis- percentage of time savings by the proposed method (against HM) for different sequence types based on average number of Inter-modes selected per CU.
| Name of the Sequences | Average no. of inter-modes selected per CU by HM | Average no. of inter-modes selected per CU by proposed method | Average Percentage (%) of time saving |
|---|---|---|---|
| 20.15 | 11.07 | 45.06 | |
| 18.05 | 10.26 | 43.15 | |
| Average time saving for SD type video | 44.10 | ||
| 20.68 | 12.34 | 40.32 | |
| 19.21 | 11.39 | 40.70 | |
| Average time saving for HD type video | 40.51 | ||
| 20.66 | 13.16 | 36.30 | |
| 18.12 | 12.04 | 33.55 | |
| Average time saving for MV type video | 34.92 | ||
| Average time saving for the standard video sequences (SVS) | 39.85 | ||
| 21.05 | 12.23 | 41.90 | |
| 18.56 | 12.18 | 34.37 | |
| 19.78 | 11.29 | 42.92 | |
| 18.96 | 10.13 | 46.57 | |
| Average time saving for the standard class video sequences (SCVS) | 41.44 | ||
Performance comparison of proposed technique compared to HM12.1 using BD-BR and BD-PSNR for the SVS.
| Sequence Resolutions | Name of the Sequences | BD-PSNR(dB) | BD-BR(%) |
|---|---|---|---|
| -0.01 | 0.08 | ||
| -0.01 | 0.14 | ||
| -0.01 | 0.12 | ||
| -0.01 | 0.11 | ||
| -0.01 | 0.10 | ||
| -0.01 | 0.14 | ||
| -0.01 | 0.11 | ||
| -0.01 | 0.15 | ||
| -0.02 | 0.21 | ||
| -0.02 | 0.16 | ||
| -0.01 | 0.14 | ||
| -0.02 | 0.16 | ||
| -0.01 | 0.13 | ||
| -0.01 | 0.11 | ||
| -0.01 | 0.01 | ||
| -0.01 | 0.13 | ||
Performance comparison of proposed technique compared to HM12.1 using BD-BR and BD-PSNR for the SCVS.
| Class | Sequence Resolutions | Name of the Sequences | BD-PSNR(dB) | BD-BR(%) |
|---|---|---|---|---|
| 2560×1600 | -0.01 | 0.14 | ||
| 1920×1080 | -0.01 | 0.11 | ||
| 1920×1080 | 0.00 | 0.04 | ||
| 1920×1080 | -0.02 | 0.18 | ||
| 1920×1080 | 0.00 | 0.07 | ||
| 832×480 | -0.01 | 0.10 | ||
| 832×480 | 0.00 | 0.01 | ||
| 416×240 | BasketballPass | -0.02 | 0.17 | |
| 416×240 | BlowingBubbles | -0.01 | 0.09 | |
| 416×240 | BQSquare | -0.01 | 0.13 | |
| 416×240 | Flowervase | 0.00 | 0.07 | |
| 1280×720 | Vidyo1 | -0.01 | 0.10 | |
| 1280×720 | Vidyo3 | -0.01 | 0.15 | |
| 1280×720 | Vidyo4 | -0.01 | 0.09 | |
| -0.01 | 0.10 | |||