DIET: Customized Slimming for Incompatible Networks in Sequential Recommendation

Kairui Fu,Shengyu Zhang,Zheqi Lv,Jingyuan Chen,Jiwei Li
2024-06-15
Abstract:Due to the continuously improving capabilities of mobile edges, recommender systems start to deploy models on edges to alleviate network congestion caused by frequent mobile requests. Several studies have leveraged the proximity of edge-side to real-time data, fine-tuning them to create edge-specific models. Despite their significant progress, these methods require substantial on-edge computational resources and frequent network transfers to keep the model up to date. The former may disrupt other processes on the edge to acquire computational resources, while the latter consumes network bandwidth, leading to a decrease in user satisfaction. In response to these challenges, we propose a customizeD slImming framework for incompatiblE neTworks(DIET). DIET deploys the same generic backbone (potentially incompatible for a specific edge) to all devices. To minimize frequent bandwidth usage and storage consumption in personalization, DIET tailors specific subnets for each edge based on its past interactions, learning to generate slimming subnets(diets) within incompatible networks for efficient transfer. It also takes the inter-layer relationships into account, empirically reducing inference time while obtaining more suitable diets. We further explore the repeated modules within networks and propose a more storage-efficient framework, DIETING, which utilizes a single layer of parameters to represent the entire network, achieving comparably excellent performance. The experiments across four state-of-the-art datasets and two widely used models demonstrate the superior accuracy in recommendation and efficiency in transmission and storage of our framework.
Distributed, Parallel, and Cluster Computing,Artificial Intelligence,Information Retrieval
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key challenges faced by edge devices in the sequence recommendation system, specifically including: 1. **Transmission efficiency**: - Current methods frequently download models from the cloud to adapt to changes in user interests (as shown in Figure 1(b)), which can lead to severe network congestion and non - negligible transmission delays. Therefore, how to design the representation and transmission mode of the model is a problem worthy of consideration. 2. **Storage consumption**: - On large - scale platforms (such as Amazon and Taobao), each user may access the recommendation system through multiple channels (such as front - page recommendation, post - purchase recommendation, short - video recommendation, etc.). In these cases, as shown in Figure 1(c), previous methods will send a large number of models to edge devices, resulting in significant storage pressure. Intuitively, there may be similarities between models of different channels, and this can be utilized to reduce the storage occupation on edge devices. 3. **Inference cost**: - The inference speed of the edge - side model determines the waiting time after a user's request. A faster inference speed enables the recommendation system to process more requests within the same time. However, as shown in Figure 1(d), reducing the time required for inference on resource - constrained edge devices is an additional challenge. To address these problems, the authors propose a lightweight and efficient edge - cloud collaborative recommendation framework - DIET (Customized Slimming Framework), whose goal is to provide personalized models (including parameters and structures) for different edge devices under strict resource constraints and minimize costs as much as possible to adapt to their local interests. Specifically, DIET can quickly adapt to user interests, achieve lightweight transmission and storage, and improve the inference speed by learning to generate personalized "diets" (i.e., sub - networks), which can be optimized at both the element level and the filter level. In addition, for the repetitive modules (such as CNN and Transformer) existing in widely used recommenders, the authors also propose a more storage - saving framework, DIETING, which uses single - layer parameters to represent the entire network and achieves equally excellent performance. ### Summary The main contributions of the paper are as follows: - **First realization** of simultaneous structure and parameter personalization under strict edge constraints. - Proposing to generate edge - specific "diets" according to users' past interactions, enabling it to quickly adapt to changes in user interests and be lightweight in terms of transmission and storage. - Considering the importance of the filter level to correct the generated "diets", which improves the stability of the framework and empirically reduces the inference cost of edge devices. - Conducting extensive experiments on four real - world datasets, demonstrating the superior performance of DIET under the constraints of limited computing resources and transmission delays.