Abstract:Tile-based rate adaption can improve the quality of experience (QoE) for adaptive 360-degree video streaming under constrained network conditions, which, however, is a challenging problem due to the requirements of accurate prediction for users’ viewports and optimal bitrate allocation for tiles. In this paper, we propose a strategy that deploys reinforcement learning-based Rate Adaptation with adaptive Prediction and Tiling for 360-degree video streaming, named RAPT360, to address these challenges. Specifically, to improve the accuracy of the state-of-the-art viewport prediction approaches, we fit the time-varying Laplace distribution-based probability density function of the prediction error for different prediction lengths. On the basis of that, we develop a viewport identification method to determine the viewport area of a user depending on the buffer occupancy, where the obtained viewport can cover the real viewport with any given probability confidence level. We then propose a viewport-aware adaptive tiling scheme to improve the bandwidth efficiency, where three types of tile granularities are allocated according to the shape and position of the 2-D projection of that viewport. By establishing an adaptive streaming model and QoE metric specific to 360-degree videos, we finally formulate the rate adaptation problem for tile-based 360-degree video streaming as a non-linear discrete optimization problem that targets at maximizing the long-term user QoE under a bandwidth-constrained network. To efficiently solve this problem, we model the rate adaptation logic as a Markov decision process (MDP) and employ the deep reinforcement learning (DRL)-based algorithm to dynamically learn the optimal bitrate allocation of tiles. Extensive experimental results show that RAPT360 achieves a performance gain of at least 1.47 dB on average chunk QoE, including a video quality improvement of at least 1.33 dB, in comparison to the existing strategies for tile-based adaptive 360-degree video streaming.

360HRL: Hierarchical Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming

360SRL: A Sequential Reinforcement Learning Approach for ABR Tile-Based 360 Video Streaming.

Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming

Deep Reinforcement Learning-based Rate Adaptation for Adaptive 360-Degree Video Streaming

Deep-Reinforcement-Learning-based User-Preference-Aware Rate Adaptation for Video Streaming

Sequential Reinforced 360-Degree Video Adaptive Streaming with Cross-user Attentive Network

DRL360: 360-Degree Video Streaming with Deep Reinforcement Learning

Deep Reinforcement Learning Based Adaptive 360-Degree Video Streaming with Field of View Joint Prediction

RAPT360: Reinforcement Learning-Based Rate Adaptation for 360-Degree Video Streaming with Adaptive Prediction and Tiling

Learning Tailored Adaptive Bitrate Algorithms to Heterogeneous Network Conditions: A Domain-Specific Priors and Meta-Reinforcement Learning Approach

PAAS: a preference-aware deep reinforcement learning approach for 360° video streaming

MADRL-Based Rate Adaptation for 360° Video Streaming with Multi-Viewpoint Prediction

MADRL-Based Rate Adaptation for 360° Video Streaming With Multiviewpoint Prediction

Reinforcement Learning Based Adaptive Bitrate Algorithm For Transmitting Panoramic Videos

Improving Generalization for Neural Adaptive Video Streaming Via Meta Reinforcement Learning

DRL Empowered On-policy and Off-policy ABR for 5G Mobile Ultra-HD Video Delivery

TS360: A Two-Stage Deep Reinforcement Learning System for 360-Degree Video Streaming

Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP

PPO-ABR: Proximal Policy Optimization based Deep Reinforcement Learning for Adaptive BitRate streaming

Cross Layer Optimization and Distributed Reinforcement Learning for Wireless 360° Video Streaming

A Hierarchical Buffer Management Approach to Rate Adaptation for 360-Degree Video Streaming