| Literature DB >> 35898027 |
Liqiang He1, Shuhua Xiong1, Ruolan Yang1, Xiaohai He1, Honggang Chen1.
Abstract
Despite the fact that Versatile Video Coding (VVC) achieves a superior coding performance to High-Efficiency Video Coding (HEVC), it takes a lot of time to encode video sequences due to the high computational complexity of the tools. Among these tools, Multiple Transform Selection (MTS) require the best of several transforms to be obtained using the Rate-Distortion Optimization (RDO) process, which increases the time spent video encoding, meaning that VVC is not suited to real-time sensor application networks. In this paper, a low-complexity multiple transform selection, combined with the multi-type tree partition algorithm, is proposed to address the above issue. First, to skip the MTS process, we introduce a method to estimate the Rate-Distortion (RD) cost of the last Coding Unit (CU) based on the relationship between the RD costs of transform candidates and the correlation between Sub-Coding Units' (sub-CUs') information entropy under binary splitting. When the sum of the RD costs of sub-CUs is greater than or equal to their parent CU, the RD checking of MTS will be skipped. Second, we make full use of the coding information of neighboring CUs to terminate MTS early. The experimental results show that, compared with the VVC, the proposed method achieves a 26.40% reduction in time, with a 0.13% increase in Bjøontegaard Delta Bitrate (BDBR).Entities:
Keywords: CU partition; fast intra-coding; multiple transform selection; real-time sensor networks; versatile video coding
Mesh:
Substances:
Year: 2022 PMID: 35898027 PMCID: PMC9331267 DOI: 10.3390/s22155523
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
The mapping relationship between MTS indexes and transforms.
| MTS Candidate Indexes | Horizontal | Vertical |
|---|---|---|
| 0 | DST-VII | DST-VII |
| 1 | DST-VII | DCT-VIII |
| 2 | DCT-VIII | DST-VII |
| 3 | DCT-VIII | DCT-VIII |
The probability of using the same optimal transform in two adjacent sub-CUs under binary splitting.
| Video Sequences | Probability |
|---|---|
| BasketballPass | 94.7% |
| RaceHorsesC | 93.6% |
| Johnny | 96.7% |
| BasketballDrive | 95.5% |
| Average | 95.1% |
Figure 1The relationship between RD cost ratio and entropy ratio of adjacent sub-CUs under binary splitting.
Figure 2Illustration of location between the current CU and neighbouring CUs.
The probability of being able to use the same optimal transform between the current CU and its neighboring CUs.
| Video Sequences |
|
|
|---|---|---|
| BasketballPass | 88.5% | 78.6% |
| RaceHorsesC | 80.6% | 82.7% |
| Johnny | 82.3% | 79.3% |
| BasketballDrive | 85.6% | 80.5% |
| Average | 84.3% | 80.3% |
The environments and conditions of simulation.
| Items | Descriptions |
|---|---|
| Software | VTM-3.0 |
| Configuration File | encoder intra vtm.cfg |
| Video Sequence Size | 416 × 240, 832 × 480, |
| 1280 × 720, 1920 × 1080 | |
| Number of Encoded Frames | 30 |
| Quantization Parameter (QP) | 22, 27, 32 and 37 |
| Sampling of Luminance to Chrominance | 4:2:0 |
Detailed characteristics of the experimental video sequences.
| Class | Sequences | Size | Bit-Depth | Frame Rate |
|---|---|---|---|---|
| BasketballDrive | 1920 × 1280 | 8 | 50 | |
| B | BQTerrace | 1920 × 1280 | 8 | 60 |
| Cactus | 1920 × 1280 | 8 | 50 | |
| BasketballDrill | 832 × 480 | 8 | 50 | |
| C | BQMall | 832 × 480 | 8 | 60 |
| PartyScene | 832 × 480 | 8 | 50 | |
| BasketballPass | 416 × 240 | 8 | 50 | |
| D | BlowingBubbles | 416 × 240 | 8 | 50 |
| RaceHorses | 416 × 240 | 8 | 30 | |
| FourPeople | 1280 × 720 | 8 | 60 | |
| E | Johhny | 1280 × 720 | 8 | 60 |
| KristenAndSara | 1280 × 720 | 8 | 60 | |
| Slideshow | 1280 × 720 | 8 | 20 | |
| F | SlideEditing | 1280 × 720 | 8 | 30 |
| BasketballDrillText | 832 × 480 | 8 | 50 |
The proposed algorithm compared to the original VVC experimental results.
| Class | Sequences | BDBR/% | BD-PSNR/db | |
|---|---|---|---|---|
| BasketballDrive | 0.08 | −0.007 | 29.16 | |
| B | BQTerrace | 0.10 | −0.004 | 28.02 |
| Cactus | 0.15 | −0.008 | 26.06 | |
| BasketballDrill | 0.12 | −0.007 | 27.42 | |
| C | BQMall | 0.09 | −0.003 | 24.53 |
| PartyScene | 0.10 | −0.009 | 29.30 | |
| BasketballPass | 0.15 | −0.007 | 24.18 | |
| D | BlowingBubbles | 0.12 | −0.008 | 27.09 |
| RaceHorses | 0.14 | −0.007 | 25.13 | |
| FourPeople | 0.16 | −0.006 | 26.89 | |
| E | Johhny | 0.14 | −0.007 | 25.47 |
| KristenAndSara | 0.11 | −0.005 | 29.40 | |
| Slideshow | 0.18 | −0.009 | 24.64 | |
| F | SlideEditing | 0.14 | −0.008 | 22.35 |
| BasketballDrillText | 0.13 | −0.008 | 26.42 | |
| Average | - | 0.13 | −0.007 | 26.40 |
The proposed algorithm compared to the state-of-the-art experimental results.
| Sequences | Fu et al. [ | Zhang et al. [ | Proposed | |||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Cactus | 0.18 | 23 | −0.01 | 10 | 0.15 | 26.06 |
| BQTerrace | 0.12 | 25 | 8 | 0.10 | 28.02 | |
| BasketballDrive | 0.09 | 23 | 8 | 0.08 | 29.16 | |
| BQMall | 0.11 | 24 | 0.02 | 3 | 0.09 | 24.53 |
| PartyScene | 0.16 | 25 | 5 | 0.10 | 29.30 | |
| BasketballDrill | 0.14 | 21 | 9 | 0.12 | 27.42 | |
| BasketballPass | 0.19 | 23 | 0.06 | 7 | 0.15 | 24.18 |
| BlowingBubbles | 0.17 | 24 | 6 | 0.12 | 27.09 | |
| RaceHorses | 0.16 | 23 | 1 | 0.14 | 25.13 | |
| FourPeople | 0.22 | 23 | 0.03 | 7 | 0.16 | 26.89 |
| KristenAndSara | 0.19 | 23 | 10 | 0.11 | 29.40 | |
| Johnny | 0.2 | 22 | 9 | 0.14 | 25.47 | |
| Average | 0.16 | 23.30 | 0.03 | 6.92 | 0.12 | 26.89 |
Figure 3The R-D curves of sequences “BlowingBubbles” (Class D) and “BasketballDrill” (Class C) under AI configuration. (a) BlowingBubbles; (b) BasketballDrill.
Figure 4Subjective quality comparison of the first decoding frame of “BasketballPass” from Class D. (a) original VVC; (b) proposed algorithm.