Beimin: Serverless-based Adaptive Real-Time Video Processing

Jiaye Zhang,Zili Meng,Mingwei Xu
DOI: https://doi.org/10.1109/icme57554.2024.10687715
2024-01-01
Abstract:Video-sharing websites need to process the uploaded videos (e.g., face recognition) before distributing them to users. The timely processing of videos is critical for users to always enjoy the latest content. However, videos uploaded by different users are diverse in content, with the volume fluctuating at different times in one day. The static resource allocations will result in frequent overutilization and underutilization when the demands and contents change, while container and virtual machine(VM)-based solutions will incur significant additional overhead. Moreover, it is also challenging to predict the required resources in the future due to the complicated relationship between resources, contents, demands, etc. This paper introduces Beimin, an adaptive video processing framework designed for heterogeneous video processing with flexible demands in real-time. Beimin adopts a serverless framework to efficiently allocate resources, and a deep reinforcement learning (DRL) model to predict the resources to allocate with multi-dimensional inputs (contents, demands, etc.). We conducted tests with Amazon Lambda using a synthetic dataset from Imagenet VID, and the results demonstrate that Beimin reduces cost by 1.61% and processing time by 33.17% compared to existing solutions without harm to accuracy.
What problem does this paper attempt to address?