RAV: Learning-Based Adaptive Streaming to Coordinate the Audio and Video Bitrate Selections

Weihe Li,Jiawei Huang,Wenjun Lyu,Baoshen Guo,Wanchun Jiang,Jianxin Wang
DOI: https://doi.org/10.1109/tmm.2022.3198013
IF: 7.3
2023-01-01
IEEE Transactions on Multimedia
Abstract:Most commercial players adopt adaptive bitrate (ABR) algorithms to dynamically decide each chunk's bitrate based on the perceived network bandwidth and buffer occupancy. However, current ABR algorithms are agnostic of audio bitrate selection since they deem it has negligible influence on video bitrate selection due to small size of audio chunks. Nevertheless, with the development of audio technologies, the bitrate of audio content increases dramatically in recent years. Thus, inappropriate audio selection can significantly affect video selection and deteriorate the viewing experience. To tackle these inefficiencies, we propose a deep R einforcement learning-based ABR algorithm that takes A udio and V ideo quality into account (RAV) to circumvent a series of suboptimal performances, like low playback quality, frequent playback interruptions, poor playback smoothness, and undesirable combinations of video and audio chunks. Furthermore, RAV trains a neural network model that automatically outputs the bitrates for future audio and video chunks without relying on any presumptions about the environment, achieving good robustness to a broad spectrum of conditions. By conducting trace-driven and real-world experiments, we demonstrate that RAV significantly ameliorates the average overall viewing quality by 37.96%-118.20% over the state-of-the-art ABR algorithms. In addition, we also conduct subjective experiments by inviting 32 volunteers, and 27/32 users strongly agree that RAV provides them a better viewing experience than existing ABR solutions.
What problem does this paper attempt to address?