Abstract:Internet adaptive video streaming is a typical form of video delivery that leverages adaptive bitrate (ABR) algorithms to provide video services with high quality of experience (QoE) for various users in diverse and unique network conditions. Such heterogeneous network environments, which can be viewed as exogenous input processes, often lead to the unstable performance of ABR algorithms. Unfortunately, learning-based ABR algorithm which generated by state-of-the-art reinforcement learning (RL) technologies achieves good average performance but fails to perform well in all kinds of network conditions. In this work, considering the video playback process as the Input-driven Markov Decision Process (IMDP), we propose BR (Adaptation of ABR), a novel meta-RL ABR approach. BR is mainly composed of an online stage and an offline stage. It leverages meta-RL to learn an initial meta-policy with various network conditions at the offline stage and makes decisions in personalized network conditions at the online stage. At the same time, we continually optimize the meta-policy to the tailor-made ABR policy for varying the current network environment within few shots. Moreover, in order to improve the learning efficiency, we fully utilize domain knowledge for implementing a virtual player to replay the previously experienced network. Using trace-driven experiments on various scenarios including different vehicles, users, network types, and heterogeneous user-preferences, we show that BR outperforming recent ABR approaches with rapidly adapting to the personalized QoE metrics and specific network conditions. Testbed experimental results also illustrate the superiority of BR in adapting to the unseen environments.

Pioneer: Offline Reinforcement Learning Based Bandwidth Estimation for Real-Time Communication.

Reinforcement learning for bandwidth estimation and congestion control in real-time communications

Offline to Online Learning for Real-Time Bandwidth Estimation

WiEdge: Edge Computing for Audio Sensing Applications with Accurate Wireless Link Prediction.

ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge

Learning Tailored Adaptive Bitrate Algorithms to Heterogeneous Network Conditions: A Domain-Specific Priors and Meta-Reinforcement Learning Approach

Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC

Offline and Distributional Reinforcement Learning for Radio Resource Management

Balancing Generalization and Specialization: Offline Metalearning for Bandwidth Estimation

Enhancing Low Latency Adaptive Live Streaming Through Precise Bandwidth Prediction

How Often Channel Estimation is Required for Adaptive IRS Beamforming: A Bilevel Deep Reinforcement Learning Approach

A Novel Deep Reinforcement Learning Architecture for Dynamic Power and Bandwidth Allocation in Multibeam Satellites

Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets

Data-driven Bandwidth Adaptation for Radio Access Network Slices

BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

QoE-based Deep Reinforcement Learning for Resource Allocation in Real Time XR Video Transmission

DRL-based Resource Allocation in Remote State Estimation

Enabling Robust DRL-Driven Networking Systems Via Teacher-Student Learning

Communication Scheduling by Deep Reinforcement Learning for Remote Traffic State Estimation with Bayesian Inference

Reinforcement Learning Agent Design and Optimization with Bandwidth Allocation Model

Realtime mobile bandwidth prediction using LSTM neural network and Bayesian fusion