Abstract:Internet adaptive video streaming is a typical form of video delivery that leverages adaptive bitrate (ABR) algorithms to provide video services with high quality of experience (QoE) for various users in diverse and unique network conditions. Such heterogeneous network environments, which can be viewed as exogenous input processes, often lead to the unstable performance of ABR algorithms. Unfortunately, learning-based ABR algorithm which generated by state-of-the-art reinforcement learning (RL) technologies achieves good average performance but fails to perform well in all kinds of network conditions. In this work, considering the video playback process as the Input-driven Markov Decision Process (IMDP), we propose BR (Adaptation of ABR), a novel meta-RL ABR approach. BR is mainly composed of an online stage and an offline stage. It leverages meta-RL to learn an initial meta-policy with various network conditions at the offline stage and makes decisions in personalized network conditions at the online stage. At the same time, we continually optimize the meta-policy to the tailor-made ABR policy for varying the current network environment within few shots. Moreover, in order to improve the learning efficiency, we fully utilize domain knowledge for implementing a virtual player to replay the previously experienced network. Using trace-driven experiments on various scenarios including different vehicles, users, network types, and heterogeneous user-preferences, we show that BR outperforming recent ABR approaches with rapidly adapting to the personalized QoE metrics and specific network conditions. Testbed experimental results also illustrate the superiority of BR in adapting to the unseen environments.

From Ember to Blaze: Swift Interactive Video Adaptation via Meta-Reinforcement Learning

QoE-Aware Dynamic Video Rate Adaptation

Deep-Reinforcement-Learning-based User-Preference-Aware Rate Adaptation for Video Streaming

MetaLive: Meta-Reinforcement Learning Based Collective Bitrate Adaptation for Multi-Party Live Streaming

MetaLive: Meta-Reinforcement Learning Based Collective Bitrate Adaptation for Multi-Party Live Streaming.

Learning Tailored Adaptive Bitrate Algorithms to Heterogeneous Network Conditions: A Domain-Specific Priors and Meta-Reinforcement Learning Approach

MetaABR: A Meta-Learning Approach on Adaptative Bitrate Selection for Video Streaming

Improving Generalization for Neural Adaptive Video Streaming Via Meta Reinforcement Learning

Multiuser Video Streaming Rate Adaptation: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

An Intelligent Learning Approach to Achieve Near-Second Low-Latency Live Video Streaming under Highly Fluctuating Networks

Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP

Learning-Based Online QoE Optimization in Multi-Agent Video Streaming

Personalized 360-Degree Video Streaming: A Meta-Learning Approach

Personalized 360-Degree Video Streaming

Deep Reinforcement Learning-based Rate Adaptation for Adaptive 360-Degree Video Streaming

Preference-Aware Dynamic Bitrate Adaptation for Mobile Short-Form Video Feed Streaming

A Meta-Learning Framework for Learning Multi-User Preferences in QoE Optimization of DASH

Imitation Learning for Adaptive Video Streaming with Future Adversarial Information Bottleneck Principle

Optimizing QoE of Multiple Users over DASH: A Meta-learning Approach.

FedABR: A Personalized Federated Reinforcement Learning Approach for Adaptive Video Streaming.

Intelligent Video Streaming at Network Edge: An Attention-Based Multiagent Reinforcement Learning Solution