Literature DB >> 32324554

Study of Subjective and Objective Quality Assessment of Audio-Visual Signals.

Xiongkuo Min, Guangtao Zhai, Jiantao Zhou, Mylene C Q Farias, Alan Conrad Bovik.   

Abstract

The topics of visual and audio quality assessment (QA) have been widely researched for decades, yet nearly all of this prior work has focused only on single-mode visual or audio signals. However, visual signals rarely are presented without accompanying audio, including heavy-bandwidth video streaming applications. Moreover, the distortions that may separately (or conjointly) afflict the visual and audio signals collectively shape user-perceived quality of experience (QoE). This motivated us to conduct a subjective study of audio and video (A/V) quality, which we then used to compare and develop A/V quality measurement models and algorithms. The new LIVE-SJTU Audio and Video Quality Assessment (A/V-QA) Database includes 336 A/V sequences that were generated from 14 original source contents by applying 24 different A/V distortion combinations on them. We then conducted a subjective A/V quality perception study on the database towards attaining a better understanding of how humans perceive the overall combined quality of A/V signals. We also designed four different families of objective A/V quality prediction models, using a multimodal fusion strategy. The different types of A/V quality models differ in both the unimodal audio and video quality prediction models comprising the direct signal measurements and in the way that the two perceptual signal modes are combined. The objective models are built using both existing state-of-the-art audio and video quality prediction models and some new prediction models, as well as quality-predictive features delivered by a deep neural network. The methods of fusing audio and video quality predictions that are considered include simple product combinations as well as learned mappings. Using the new subjective A/V database as a tool, we validated and tested all of the objective A/V quality prediction models. We will make the database publicly available to facilitate further research.

Entities:  

Year:  2020        PMID: 32324554     DOI: 10.1109/TIP.2020.2988148

Source DB:  PubMed          Journal:  IEEE Trans Image Process        ISSN: 1057-7149            Impact factor:   10.856


  3 in total

1.  Cross-Domain Feature Similarity Guided Blind Image Quality Assessment.

Authors:  Chenxi Feng; Long Ye; Qin Zhang
Journal:  Front Neurosci       Date:  2022-01-14       Impact factor: 4.677

2.  Editorial: Computational Neuroscience for Perceptual Quality Assessment.

Authors:  Xiongkuo Min; Ke Gu; Lu Zhang; Vinit Jakhetiya; Guangtao Zhai
Journal:  Front Neurosci       Date:  2022-03-28       Impact factor: 4.677

3.  Neural Network Model for Perceptual Evaluation of Product Modelling Design Based on Multimodal Image Recognition.

Authors:  Jie Wu; Long Jia
Journal:  Comput Intell Neurosci       Date:  2022-08-09
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.