Abstract:Tile-based rate adaption can improve the quality of experience (QoE) for adaptive 360-degree video streaming under constrained network conditions, which, however, is a challenging problem due to the requirements of accurate prediction for users’ viewports and optimal bitrate allocation for tiles. In this paper, we propose a strategy that deploys reinforcement learning-based Rate Adaptation with adaptive Prediction and Tiling for 360-degree video streaming, named RAPT360, to address these challenges. Specifically, to improve the accuracy of the state-of-the-art viewport prediction approaches, we fit the time-varying Laplace distribution-based probability density function of the prediction error for different prediction lengths. On the basis of that, we develop a viewport identification method to determine the viewport area of a user depending on the buffer occupancy, where the obtained viewport can cover the real viewport with any given probability confidence level. We then propose a viewport-aware adaptive tiling scheme to improve the bandwidth efficiency, where three types of tile granularities are allocated according to the shape and position of the 2-D projection of that viewport. By establishing an adaptive streaming model and QoE metric specific to 360-degree videos, we finally formulate the rate adaptation problem for tile-based 360-degree video streaming as a non-linear discrete optimization problem that targets at maximizing the long-term user QoE under a bandwidth-constrained network. To efficiently solve this problem, we model the rate adaptation logic as a Markov decision process (MDP) and employ the deep reinforcement learning (DRL)-based algorithm to dynamically learn the optimal bitrate allocation of tiles. Extensive experimental results show that RAPT360 achieves a performance gain of at least 1.47 dB on average chunk QoE, including a video quality improvement of at least 1.33 dB, in comparison to the existing strategies for tile-based adaptive 360-degree video streaming.

Deep Reinforcement Learning-based Rate Adaptation for Adaptive 360-Degree Video Streaming

QoE-Aware Dynamic Video Rate Adaptation

Deep-Reinforcement-Learning-based User-Preference-Aware Rate Adaptation for Video Streaming

Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming

RAPT360: Reinforcement Learning-Based Rate Adaptation for 360-Degree Video Streaming with Adaptive Prediction and Tiling

360HRL: Hierarchical Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming

DRL360: 360-Degree Video Streaming with Deep Reinforcement Learning

Deep Reinforcement Learning Based Adaptive 360-Degree Video Streaming with Field of View Joint Prediction

Perceptual Quality Aware Adaptive 360-Degree Video Streaming with Deep Reinforcement Learning

Deep Reinforcement Learning-Driven Intelligent Panoramic Video Bitrate Adaptation

Multiuser Video Streaming Rate Adaptation: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

Adaptive Streaming Algorithm Based on Reinforcement Learning

Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP

A Hierarchical Buffer Management Approach to Rate Adaptation for 360-Degree Video Streaming

Reinforcement Learning Based Adaptive Bitrate Algorithm For Transmitting Panoramic Videos

Off-Policy - Soft Actor-Critic-based Adaptive Streaming for 360-degree Video in Heterogeneous Wireless Networks.

MADRL-Based Rate Adaptation for 360° Video Streaming with Multi-Viewpoint Prediction

MADRL-Based Rate Adaptation for 360° Video Streaming With Multiviewpoint Prediction

Enhancing Neural Adaptive Wireless Video Streaming via Lower-Layer Information Exposure and Online Tuning

DeepVR: Deep Reinforcement Learning for Predictive Panoramic Video Streaming.

Soft Actor-Critic Algorithm for 360-Degree Video Streaming with Long-Term Viewport Prediction