Abstract:Internet adaptive video streaming is a typical form of video delivery that leverages adaptive bitrate (ABR) algorithms to provide video services with high quality of experience (QoE) for various users in diverse and unique network conditions. Such heterogeneous network environments, which can be viewed as exogenous input processes, often lead to the unstable performance of ABR algorithms. Unfortunately, learning-based ABR algorithm which generated by state-of-the-art reinforcement learning (RL) technologies achieves good average performance but fails to perform well in all kinds of network conditions. In this work, considering the video playback process as the Input-driven Markov Decision Process (IMDP), we propose BR (Adaptation of ABR), a novel meta-RL ABR approach. BR is mainly composed of an online stage and an offline stage. It leverages meta-RL to learn an initial meta-policy with various network conditions at the offline stage and makes decisions in personalized network conditions at the online stage. At the same time, we continually optimize the meta-policy to the tailor-made ABR policy for varying the current network environment within few shots. Moreover, in order to improve the learning efficiency, we fully utilize domain knowledge for implementing a virtual player to replay the previously experienced network. Using trace-driven experiments on various scenarios including different vehicles, users, network types, and heterogeneous user-preferences, we show that BR outperforming recent ABR approaches with rapidly adapting to the personalized QoE metrics and specific network conditions. Testbed experimental results also illustrate the superiority of BR in adapting to the unseen environments.

Dancing with Shackles, Meet the Challenge of Industrial Adaptive Streaming Via Offline Reinforcement Learning

Deep-Reinforcement-Learning-based User-Preference-Aware Rate Adaptation for Video Streaming

Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck Principle

Edge-Cloud Collaborative Streaming Video Analytics with Multi-agent Deep Reinforcement Learning

Batch Adaptative Streaming for Video Analytics

Frame-Level Video Caching and Transmission Scheduling Via Stochastic Learning

Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming

PAAS: a preference-aware deep reinforcement learning approach for 360° video streaming

Sequential Reinforced 360-Degree Video Adaptive Streaming with Cross-user Attentive Network

A Joint Online Transcoding and Delivery Approach for Dynamic Adaptive Streaming

DRL360: 360-Degree Video Streaming with Deep Reinforcement Learning

Adaptive Streaming Continuous Learning System for Video Analytics

Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services

Deep Reinforced Bitrate Ladders for Adaptive Video Streaming.

Learning Accurate Network Dynamics for Enhanced Adaptive Video Streaming

Learning Tailored Adaptive Bitrate Algorithms to Heterogeneous Network Conditions: A Domain-Specific Priors and Meta-Reinforcement Learning Approach

Enhancement or Super-Resolution: Learning-based Adaptive Video Streaming with Client-Side Video Processing

Towards Optimal Real-time Volumetric Video Streaming: A Rolling Optimization and Deep Reinforcement Learning Based Approach

ShadowStream

Improving Quality of Experience by Adaptive Video Streaming with Super-Resolution

Self-play Reinforcement Learning for Video Transmission