Abstract:In recent years, User Generated Content (UGC) has grown dramatically in video sharing applications. It is necessary for service-providers to use video quality assessment (VQA) to monitor and control users' Quality of Experience when watching UGC videos. However, most existing UGC VQA studies only focus on the visual distortions of videos, ignoring that the perceptual quality also depends on the accompanying audio signals. In this paper, we conduct a comprehensive study on UGC audio-visual quality assessment (AVQA) from both subjective and objective perspectives. Specially, we construct the first UGC AVQA database named SJTU-UAV database, which includes 520 in-the-wild UGC audio and video (A/V) sequences collected from the YFCC100m database. A subjective AVQA experiment is conducted on the database to obtain the mean opinion scores (MOSs) of the A/V sequences. To demonstrate the content diversity of the SJTU-UAV database, we give a detailed analysis of the SJTU-UAV database as well as other two synthetically-distorted AVQA databases and one authentically-distorted VQA database, from both the audio and video aspects. Then, to facilitate the development of AVQA fields, we construct a benchmark of AVQA models on the proposed SJTU-UAV database and other two AVQA databases, of which the benchmark models consist of AVQA models designed for synthetically distorted A/V sequences and AVQA models built through combining the popular VQA methods and audio features via support vector regressor (SVR). Finally, considering benchmark AVQA models perform poorly in assessing in-the-wild UGC videos, we further propose an effective AVQA model via jointly learning quality-aware audio and visual feature representations in the temporal domain, which is seldom investigated by existing AVQA models. Our proposed model outperforms the aforementioned benchmark AVQA models on the SJTU-UAV database and two synthetically distorted AVQA databases. The SJTU-UAV database and the c- de of the proposed model will be released to facilitate further research.

Perceptual Quality Assessment of Internet Videos.

From QoS to QoE: A Tutorial on Video Quality Assessment.

Predicting the Quality of Compressed Videos With Pre-Existing Distortions

Visual Quality Assessment for Web Videos

Video Quality Assessment: A Comprehensive Survey

User-generated Video Quality Assessment: A Subjective and Objective Study

Subjective and Objective Audio-Visual Quality Assessment for User Generated Content

UGC-VIDEO: Perceptual Quality Assessment of User-Generated Videos

Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment

Perceptual Quality Assessment of Virtual Reality Videos in the Wild

UGC-VQA: Benchmarking Blind Video Quality Assessment for User Generated Content

Video Quality Assessment Based on Measuring Perceptual Noise from Spatial and Temporal Perspectives.

Audio-Visual Quality Assessment for User Generated Content: Database and Method

StableVQA: A Deep No-Reference Quality Assessment Model for Video Stability

Towards Explainable In-the-Wild Video Quality Assessment: A Database and a Language-Prompted Approach

Subjective and Objective Quality Assessment of Colonoscopy Videos

KVQ: Kwai Video Quality Assessment for Short-form Videos

Deep Video Quality Assessment Using Constrained Multi-Task Regression and Spatio-temporal Feature Fusion.

XGC-VQA: A unified video quality assessment model for User, Professionally, and Occupationally-Generated Content

Perceptual Video Quality Assessment: A Survey