Abstract:The emerging mobile edge computing (MEC) technology has been recently applied to improve the Quality of Experience (QoE) of network services, such as live video streaming. In this paper, we study an energy-aware adaptive live streaming scheme in wireless edge networks. In particular, we aim to design a joint uplink transmission and edge transcoding algorithm maximizing the video followers’ QoE, while minimizing the energy consumption of the video streamer. We formulate the problem as a Markov decision process (MDP), and propose a deep reinforcement learning (DRL) based framework, named SACCT, to determine the streamer’s encoding bitrate, the uploading power as well as the edge transcoding bitrates and frequency. We decompose the MDP problem into inter-frame and intra-frame problems to address the key design challenges that arise from continuous-discrete hybrid action space, time-varying state and action spaces, and unknown network variation. By doing so, SACCT integrates model-based optimization and model-free DRL to determine the intra-frame continuous resource allocation decisions and the inter-frame discrete bitrate adaptation decisions, respectively. To integrate both the numerical features (e.g., channel gain) and the categorical features (e.g., bitrate), we propose a communication Transformer (CT) as a backbone of SACCT by representing network states as communication tokens and running Transformers to model multi-scale dependencies. Extensive simulations manifest that compared with state-of-the-art approaches, SACCT can provide 128.23% (on average) extra reward. As such, by leveraging joint uplink adaption and edge transcoding, the proposed scheme enables an intelligent wireless network edge with QoE-assured and energy-aware live streaming services.

Off-Policy - Soft Actor-Critic-based Adaptive Streaming for 360-degree Video in Heterogeneous Wireless Networks.

DRL360: 360-Degree Video Streaming with Deep Reinforcement Learning

Cross Layer Optimization and Distributed Reinforcement Learning for Wireless 360° Video Streaming

Deep-Reinforcement-Learning-based User-Preference-Aware Rate Adaptation for Video Streaming

PAAS: a preference-aware deep reinforcement learning approach for 360° video streaming

Sequential Reinforced 360-Degree Video Adaptive Streaming with Cross-user Attentive Network

Robust Saliency-Driven Quality Adaptation for Mobile 360-Degree Video Streaming

Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming

SMDP-based Resource Allocation for Video Streaming in Cognitive Vehicular Networks.

QoE-based Deep Reinforcement Learning for Resource Allocation in Real Time XR Video Transmission

QoE-Oriented Resource Allocation for 360-degree Video Transmission over Heterogeneous Networks

MADRL-Based Rate Adaptation for 360° Video Streaming With Multiviewpoint Prediction

Edge-Cloud Collaborative Streaming Video Analytics with Multi-agent Deep Reinforcement Learning

Multiuser Video Streaming Rate Adaptation: A Physical Layer Resource-Aware Deep Reinforcement Learning Approach

Toward High-Quality Low-Latency 360° Video Streaming With Edge–Client Collaborative Caching and Super-Resolution

Quality-Driven Adaptive Video Streaming for Cognitive VANETs

Enhancement or Super-Resolution: Learning-based Adaptive Video Streaming with Client-Side Video Processing

Resource Allocation for Video Streaming in Heterogeneous Cognitive Vehicular Networks.

MADRL-Based Rate Adaptation for 360° Video Streaming with Multi-Viewpoint Prediction

Pareto-Optimal Multi-Agent Cooperative Caching Relying on Multi-Policy Reinforcement Learning

Deep Reinforcement Learning With Communication Transformer for Adaptive Live Streaming in Wireless Edge Networks