Abstract:Omnidirectional images provide an immersive viewing experience in a Virtual Reality (VR) environment, surpassing the limitations of traditional 2D media beyond the conventional screen. This VR technology allows users to interact with visual information in an exciting and engaging manner. However, the storage and transmission requirements for 360-degree panoramic images are substantial, leading to the establishment of compression frameworks. Unfortunately, these frameworks introduce projection distortion and compression artifacts. With the rapid growth of VR applications, it becomes crucial to investigate the quality of the perceptible omnidirectional experience and evaluate the extent of visual degradation caused by compression. In this regard, viewport plays a significant role in omnidirectional image quality assessment (OIQA), as it directly affects the user's perceived quality and overall viewing experience. Extracting viewports compatible with users viewing behavior plays a crucial role in OIQA. Different users may focus on different regions, and the model's performance may be sensitive to the chosen viewport extraction strategy. Improper selection of viewports could lead to biased quality predictions. Instead of assessing the entire image, attention can be directed to areas that are more importance to the overall quality. Feature extraction is vital in OIQA as it plays a significant role in representing image content that aligns with human perception. Taking this into consideration, the proposed ATtention enabled VIewport Selection (ATVIS-OIQA) employs attention based view port selection with Vision Transformers(ViT) for feature extraction. Furthermore, the spatial relationship between the viewports is established using graph convolution, enabling intuitive prediction of the objective visual quality of omnidirectional images. The effectiveness of the proposed model is demonstrated by achieving state-of-the-art results on publicly available benchmark datasets, namely OIQA and CVIQD.

Viewport Proposal CNN for 360° Video Quality Assessment

Viewport-based CNN: A Multi-task Approach for Assessing 360° Video Quality

360° video quality assessment based on saliency-guided viewport extraction

Panoramic Video Quality Assessment Based on Non-Local Spherical CNN

Viewport-Sphere-Branch Network for Blind Quality Assessment of Stitched 360° Omnidirectional Images

A Spherical Convolution Approach for Learning Long Term Viewport Prediction in 360 Immersive Video

MC360IQA: THE MULTI-CHANNEL CNN FOR BLIND 360-DEGREE IMAGE QUALITY ASSESSMENT

C3DVQA: Full-Reference Video Quality Assessment with 3D Convolutional Neural Network

Saliency Prediction Network for $360^\circ$ Videos

Saliency and Depth-aware Full Reference 360-degree Image Quality Assessment

VSOIQE: A Novel Viewport-Based Stitched 360° Omnidirectional Image Quality Evaluator

Blind Omnidirectional Image Quality Assessment with Viewport Oriented Graph Convolutional Networks

Omnidirectional Video Quality Assessment With Causal Intervention

VMP360

Predicting 360° Video Saliency: A ConvLSTM Encoder-Decoder Network with Spatio-temporal Consistency

Stereoscopic Video Quality Assessment Based on 3D Convolutional Neural Networks

Multi-feature 360 Video Quality Estimation

CaV3: Cache-assisted Viewport Adaptive Volumetric Video Streaming

HVS Revisited: A Comprehensive Video Quality Assessment Framework

Attention enabled viewport selection with graph convolution for omnidirectional visual quality assessment

Towards Low Latency Multi-viewpoint 360° Interactive Video: A Multimodal Deep Reinforcement Learning Approach