Literature DB >> 31502973

Tunable VVC Frame Partitioning based on Lightweight Machine Learning.

Thomas Amestoy, Alexandre Mercat, Wassim Hamidouche, Daniel Menard, Cyril Bergeron.   

Abstract

Block partition structure is a critical module in video coding scheme to achieve significant gap of compression performance. Under the exploration of the future video coding standard, named Versatile Video Coding (VVC), a new Quad Tree Binary Tree (QTBT) block partition structure has been introduced. In addition to the QT block partitioning defined in High Efficiency Video Coding (HEVC) standard, new horizontal and vertical BT partitions are enabled, which drastically increases the encoding time compared to HEVC. In this paper, we propose a lightweight and tunable QTBT partitioning scheme based on a Machine Learning (ML) approach. The proposed solution uses Random Forest classifiers to determine for each coding block the most probable partition modes. To minimize the encoding loss induced by misclassification, risk intervals for classifier decisions are introduced in the proposed solution. By varying the size of risk intervals, tunable trade-off between encoding complexity reduction and coding loss is achieved. The proposed solution implemented in the JEM-7.0 software offers encoding complexity reductions ranging from 30average for only 0.7% to 3.0% Bjxntegaard Delta Rate (BDBR) increase in Random Access (RA) coding configuration, with very slight overhead induced by Random Forest. The proposed solution based on Random Forest classifiers is also efficient to reduce the complexity of the Multi-Type Tree (MTT) partitioning scheme under the VTM-5.0 software, with complexity reductions ranging from 25% to 61% in average for only 0.4% to 2.2% BD-BR increase.

Entities:  

Year:  2019        PMID: 31502973     DOI: 10.1109/TIP.2019.2938670

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  3 in total

1.  Fast Sample Adaptive Offset Jointly Based on HOG Features and Depth Information for VVC in Visual Sensor Networks.

Authors:  Ruyan Wang; Liuwei Tang; Tong Tang
Journal:  Sensors (Basel)       Date:  2020-11-26       Impact factor: 3.576

Review 2.  Machine Learning for Multimedia Communications.

Authors:  Nikolaos Thomos; Thomas Maugey; Laura Toni
Journal:  Sensors (Basel)       Date:  2022-01-21       Impact factor: 3.576

Review 3.  Complexity Analysis of a Versatile Video Coding Decoder over Embedded Systems and General Purpose Processors.

Authors:  Anup Saha; Miguel Chavarrías; Fernando Pescador; Ángel M Groba; Kheyter Chassaigne; Pedro L Cebrián
Journal:  Sensors (Basel)       Date:  2021-05-11       Impact factor: 3.576

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.