| Literature DB >> 36268535 |
Naouel Haggui1,2, Wassim Hamidouche1, Fatma Belghith2, Nouri Masmoudi2, Jean-François Nezan1.
Abstract
The emergence of the new video coding standard, Versatile Video Coding (VVC), has resulted in a 40-50% coding gain over its predecessor HEVC for the same visual quality. However, this is accompanied by a sharp increase in computational complexity. The emergence of the VVC standard and the increase in video resolution have exceeded the capacity of single-core architectures. This fact has led researchers to use multicore architectures for the implementation of video standards and to use the parallelism of these architectures for real-time applications. With the strong growth in both areas, video coding and multicore architecture, there is a great need for a design methodology that facilitates the exploration of heterogeneous multicore architectures, which automatically generates optimized code for these architectures in order to reduce time to market. In this context, this paper aims to use the methodology based on data flow modeling associated with the PREESM software. This paper shows how the software has been used to model a complete standard VVC video decoder using Parameterized and Interfaced Synchronous Dataflow (PiSDF) model. The proposed model takes advantage of the parallelism strategies of the OpenVVC decoder and in particular the tile-based parallelism. Experimental results show that the speed of the VVC decoder in PiSDF is slightly higher than the OpenVVC decoder handwritten in C/C++ languages, by up to 11% speedup on a 24-core processor. Thus, the proposed decoder outperforms the state-of-the-art dataflow decoders based on the RVC-CAL model.Entities:
Keywords: Dataflow modeling; OpenVVC; PiSDF; Tiles; VVC
Year: 2022 PMID: 36268535 PMCID: PMC9569024 DOI: 10.1007/s11265-022-01819-7
Source DB: PubMed Journal: J Signal Process Syst ISSN: 1939-8115
Figure 1VVC decoder block diagram.
Figure 2PiSDF model for the OpenVVC Decoder.
Figure 3The inside of the actor.
Benchmarks of UHD (38402160) sequences used in the experiment.
| BD-Rate | ||
|---|---|---|
| Sequence | 12 Tiles | 24 Tiles |
| DaylightRoad2 | 0.88% | 1.26% |
| Campfire | 0.84% | 1.33% |
| CatRobot | 0.91% | 1.76% |
| ParkRunning | 0.29% | 0.48% |
| Tango2 | 1.79% | 2.84% |
| FoodMarket | 1.32% | 2.13% |
Benchmarks of FHD (1920x1080) sequences used in the experiment.
| BD-Rate | ||
|---|---|---|
| Sequence | 12 Tiles | 24 Tiles |
| BQTerrace | 0.77% | 1.27% |
| Cactus | 1.08% | 1.79% |
| BasketballDrive | 1.80% | 3.08% |
| MarketPlace | 1.10% | 1.84% |
| RitualDance | 1.49% | 2.45% |
Figure 4Speed-up of the proposed model and of the OpenVVC decoder while using UHD sequences encoded with 12 tiles.
Figure 5Speed-up of the proposed model and of the OpenVVC decoder while using FHD sequences encoded with 12 tiles.
Figure 6Speed-up of the proposed model and of the OpenVVC decoder while using UHD sequences encoded with 24 tiles.
Figure 7Speed-up of the proposed model and of the OpenVVC decoder while using FHD sequences encoded with 24 tiles.
Figure 8Tiles partitioning for Class A ().
Figure 9Per tile decoding time in percentage for a frame of BQTerrace sequence.
Decoding time on single core of UHD sequences, QP = 22.
| Sequence | Decoding time (s) | ||
|---|---|---|---|
| VTM12.0 | OpenVVC | Dataflow model | |
| DaylightRoad2 | 42.62 ± 0.205 | 15.39 ± 0.375 | 15.10 ± 0.042 |
| Campfire | 34.32 ± 0.571 | 11.55 ± 0.405 | 11.65 ± 0.67 |
| CatRobot | 34.65 ± 0.265 | 11.15 ± 0.091 | 10.95 ± 0.04 |
| ParkRunning | 35.86 ± 0.078 | 12.97 ± 0.078 | 12.71 ± 0.046 |
| Tango2 | 23.66 ± 0.054 | 8.32 ± 0.053 | 8.07 ± 0.082 |
| FoodMarket | 21.49 ± 0.06 | 7.32 ± 0.059 | 7.19 ± 0.037 |