Towards Integrated Fine-tuning and Inference when Generative AI meets Edge Intelligence

Ning Chen,Zhipeng Cheng,Xuwei Fan,Xiaoyu Xia,Lianfen Huang
2024-01-05
Abstract:The high-performance generative artificial intelligence (GAI) represents the latest evolution of computational intelligence, while the blessing of future 6G networks also makes edge intelligence (EI) full of development potential. The inevitable encounter between GAI and EI can unleash new opportunities, where GAI's pre-training based on massive computing resources and large-scale unlabeled corpora can provide strong foundational knowledge for EI, while EI can harness fragmented computing resources to aggregate personalized knowledge for GAI. However, the natural contradictory features pose significant challenges to direct knowledge sharing. To address this, in this paper, we propose the GAI-oriented synthetical network (GaisNet), a collaborative cloud-edge-end intelligence framework that buffers contradiction leveraging data-free knowledge relay, where the bidirectional knowledge flow enables GAI's virtuous-cycle model fine-tuning and task inference, achieving mutualism between GAI and EI with seamless fusion and collaborative evolution. Experimental results demonstrate the effectiveness of the proposed mechanisms. Finally, we discuss the future challenges and directions in the interplay between GAI and EI.
Distributed, Parallel, and Cluster Computing,Machine Learning
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper primarily explores the combination and complementary advantages between Generative AI (GAI) and Edge Intelligence (EI). Specifically: 1. **Imbalance of Data and Computational Resources**: - GAI typically requires large-scale parameters and massive amounts of data for pre-training but faces a shortage of high-quality public data, and the computational resources needed for pre-training are very expensive. - EI tends to deploy lightweight models on the user side, utilizing the data and computational resources of terminal devices, but the model size is small and lacks prior knowledge. 2. **Barriers to Knowledge Transfer**: - Due to differences in parameter scale, application domains, network architecture, data size, and resource availability, knowledge transfer between GAI and EI is hindered. - Upstream knowledge flow interruption: Data from terminal devices cannot be uploaded to the cloud, leading to a blockage in the knowledge pipeline from EI to GAI. - Downstream knowledge flow interruption: Large-scale models of GAI are difficult to train or infer on resource-constrained terminal devices, leading to a blockage in the transfer of foundational knowledge from GAI to EI. To address the above issues, the paper proposes a collaborative cloud-edge-end intelligent framework named GAI-oriented synthetical network (GaisNet). This framework achieves bidirectional knowledge flow through data-independent knowledge relays, enabling sustainable model fine-tuning and task inference. Specific contributions include: - Proposing a collaborative cloud-edge-end intelligent framework GaisNet, where domain-specific edge models act as data-independent knowledge relays, unlocking bidirectional knowledge flow between GAI and EI. - Listing and analyzing the main issues that GaisNet may encounter during integrated fine-tuning and inference processes. - Experimental results validating the effectiveness of the proposed mechanism. - Discussing future challenges and development directions for the interaction between GAI and EI.