Collaborative Inference for MEC Services Based on Multimodal Deep Neural Network.

Yu Liu,Chuxuan Wang,Yilin Xiao,Zefang Lv,Liang Xiao,Xiangyang Ji
DOI: https://doi.org/10.1109/iccc57788.2023.10233276
2023-01-01
Abstract:Collaborative inference can effectively ease the computational burden imposed on mobile devices by deep neural networks (DNNs), such as employing multimodal DNNs to achieve a comprehensive understanding of the environment. Although reinforcement learning (RL)-based collaborative inference adapts to dynamic environments and enhances inference performance for DNNs, especially those with chain-structured architectures, the policy selection for multimodal DNNs must account for multiple computation-intensive feature encoders. This paper proposes a hierarchical reinforcement learning (RL)-based collaborative inference scheme for multimodal DNNs that concurrently determines the partition point and identifies the appropriate DNN-based feature encoder for each modality within the mobile edge computing system. The hierarchical structure designed for policy selection has alleviated the exponential growth of the modalities. Moreover, the mobile device maintains a confidence score based on probabilities of predefined classes to assess the risk of inference policies that results in low inference accuracy and thus avoids exploring the risk policy. Simulation results demonstrate that our proposed scheme surpasses the Neurosurgeon benchmark in reducing inference latency and energy consumption, all while maintaining a high level of confidence score without significant degradation.
What problem does this paper attempt to address?