Poster Abstract: E4: Energy-Efficient Early-Exit DNN Inference Framework for Edge Video Analytics

Ziyang Zhang,Yang Zhao,Jie Liu
DOI: https://doi.org/10.1145/3625687.3628383
2024-01-01
Abstract:Deep neural networks (DNNs) are becoming extremely popular in video analytics applications at the edge. However, compute-intensive DNNs pose new challenges to achieve energy-efficient DNN inference on resource-constrained edge devices. In this paper, we propose E4, an energy-efficient DNN inference framework for edge video analytics. First, E4 analyzes video frame complexity by employing an attention-based cascade module that automatically determines DNN exit points. Second, E4's just-in-time (JIT) profiler leverages coordinate descent search to co-optimize the CPU and GPU clock frequencies for each layer before the DNN exit point. Preliminary experimental results show that E4 outperforms exiting methods in terms of power consumption and inference latency.
What problem does this paper attempt to address?