Abstract:<p>Benefiting with the rapid development of communication networks, effective video quality assessment (VQA) models which provide guidance for video transmission and compression technologies are highly demanded. This paper proposes a general-purpose full-reference VQA method combining DenseNet with spatial pyramid pooling and RankNet to not only extract high-level distortion representation and global spatial information of samples but also characterize the temporal correlation among frames. Firstly, the pretrained DenseNet is modified and finetuned to extract high-level features of distorBenefiting with the rapid development of communication networks, effective video quality assessment (VQA) models which provide guidance for video transmission and compression technologies are highly demanded. This paper proposes a general-purpose full-reference VQA method combining DenseNet with spatial pyramid pooling and RankNet to not only extract high-level distortion representation and global spatial information of samples but also characterize the temporal correlation among frames. Firstly, the pretrained DenseNet is modified and finetuned to extract high-level features of distorted videos. Then, spatial pyramid pooling is equipped in the DenseNet module to process flexible inputs with arbitrary spatial resolution. Thus, this kind of input which has the same spatial resolution as the original distorted video is processed by the well-trained DenseNet to generate frame-level quality, which considers the global spatial information of videos directly. Finally, learning to rank is introduced to explore the high-level temporal correlation of distorted videos by taking the RankNet as the temporal pooling function. The experimental results on two public VQA databases show that the proposed algorithm performs consistently with human visual perception.</p>

RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

Human Visual Perception Based Image Quality Assessment for Video Prediction

RMT-BVQA: Recurrent Memory Transformer-based Blind Video Quality Assessment for Enhanced Video Content

Video quality assessment with dense features and ranking pooling

ReLaX-VQA: Residual Fragment and Layer Stack Extraction for Enhancing Video Quality Assessment

Blind Video Quality Assessment for Ultra-High-Definition Video Based on Super-Resolution and Deep Reinforcement Learning

Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment

HVS Revisited: A Comprehensive Video Quality Assessment Framework

Attention-Guided Neural Networks for Full-Reference and No-Reference Audio-Visual Quality Assessment

Blind Video Quality Prediction by Uncovering Human Video Perceptual Representation

Video Quality Assessment: A Comprehensive Survey

FAVER: Blind quality prediction of variable frame rate videos

Channel Recombination and Projection Network for Blind Image Quality Measurement

Highly Efficient No-reference 4K Video Quality Assessment with Full-Pixel Covering Sampling and Training Strategy

FOVQA: Blind Foveated Video Quality Assessment

No-reference quality assessment of variable frame-rate videos using temporal bandpass statistics

Evaluation of Retinal Image Quality Assessment Networks in Different Color-spaces

VQ-NeRV: A Vector Quantized Neural Representation for Videos

Convolutional Neural Networks for Video Quality Assessment

RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training

NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods