Efficient Transformer Inference for Extremely Weak Edge Devices Using Masked Autoencoders.

Tao Liu,Peng Li,Yu Gu,Peng Liu
DOI: https://doi.org/10.1109/icc45041.2023.10279202
2023-01-01
Abstract:The abundance of data provided by mobile edge devices enables a wide range of mobile edge computing (MEC) applications. Numerous studies have investigated efficient offloading methods for bandwidth savings in MEC. However, they focus on trading the device's computational cost for a reduction in communication, while edge devices can be rather resource-limited and must handle several jobs simultaneously. In this paper, the computation overhead on the device is pushed to its absolute minimum (almost no overhead), and consideration is given to enhancing the accuracy of the image recognition task within the constraints of the transmission volume limitation. We propose a mask-reconstruct system called MOT to mask images on the device side and recover images with the Masked Autoencoders (MAE)-based model on the server side. We further design a feedback-driven scheme to achieve content-aware transmission. Extensive experiments have been conducted to verify the effectiveness of the MOT.
What problem does this paper attempt to address?