An Edge-Cloud Collaboration Framework for Generative AI Service Provision with Synergetic Big Cloud Model and Small Edge Models

Yuqing Tian,Zhaoyang Zhang,Yuzhi Yang,Zirui Chen,Zhaohui Yang,Richeng Jin,Tony Q. S. Quek,Kai-Kit Wong
2024-01-03
Abstract:Generative artificial intelligence (GenAI) offers various services to users through content creation, which is believed to be one of the most important components in future networks. However, training and deploying big artificial intelligence models (BAIMs) introduces substantial computational and communication overhead.This poses a critical challenge to centralized approaches, due to the need of high-performance computing infrastructure and the reliability, secrecy and timeliness issues in long-distance access of cloud services. Therefore, there is an urging need to decentralize the services, partly moving them from the cloud to the edge and establishing native GenAI services to enable private, timely, and personalized experiences. In this paper, we propose a brand-new bottom-up BAIM architecture with synergetic big cloud model and small edge models, and design a distributed training framework and a task-oriented deployment scheme for efficient provision of native GenAI services. The proposed framework can facilitate collaborative intelligence, enhance adaptability, gather edge knowledge and alleviate edge-cloud burden. The effectiveness of the proposed framework is demonstrated through an image generation use case. Finally, we outline fundamental research directions to fully exploit the collaborative potential of edge and cloud for native GenAI and BAIM applications.
Networking and Internet Architecture
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the issue of excessive consumption of computational and communication resources during the deployment of Generative Artificial Intelligence (GenAI) services. Specifically: 1. **Limitations of Centralized Cloud Services**: The training and deployment of large Artificial Intelligence Models (BAIM) require high-performance computing infrastructure. There are issues related to reliability, privacy, and timeliness when accessing cloud services over long distances. Therefore, it is urgent to transfer some services from the cloud to edge devices to achieve localized GenAI services. 2. **Adaptability and Edge Knowledge Acquisition**: A unified large model needs to meet the needs of all users. In real systems, edge nodes have heterogeneity in terms of communication, computing, and storage capabilities. Thus, a scalable large model architecture is needed to adapt to these changes. 3. **Alleviating Cloud Burden**: Centralized BAIM training requires a large amount of data storage, model parameter caching, and computational costs. Edge networks can provide abundant computing resources, and distributed training can effectively utilize these resources, making the entire process more environmentally friendly and cost-effective. To this end, the paper proposes a novel "bottom-up" BAIM architecture, combining the design concepts of collaborative large cloud models and small edge models. It designs a distributed training framework and task-oriented deployment scheme to achieve efficient localized GenAI services. The effectiveness of the proposed framework is validated through image generation use cases, and future research directions are outlined to fully exploit the potential of edge-cloud collaboration.