User Association and Resource Allocation in Large Language Model Based Mobile Edge Computing System over 6G Wireless Communications

Liangxin Qian,Jun Zhao
2024-03-08
Abstract:In the rapidly evolving landscape of large language models (LLMs) and mobile edge computing for 6G, the need for efficient service delivery to mobile users with constrained computational resources has become paramount. Addressing this, our paper delves into a collaborative framework for model training where user data and model adapters are shared with servers to optimize performance. Within this framework, users initially update the first several layers of the adapters while freezing the other layers of them, leveraging their local datasets. Once this step is complete, these partially trained parameters are transmitted to servers. The servers, equipped with more robust computational capabilities, then update the subsequent layers. After this training, they send the enhanced parameters back to the users. This collaborative training approach ensures that mobile users with limited computational capacities can still benefit from advanced LLM services without being burdened by exhaustive computations. Central to our methodology is the DASHF algorithm, which encapsulates the Dinkelbach algorithm, alternating optimization, semidefinite relaxation (SDR), the Hungarian method, and a pioneering fractional programming technique from a recent IEEE JSAC paper [1]. The crux of DASHF is its capability to reformulate an optimization problem as Quadratically Constrained Quadratic Programming (QCQP) via meticulously crafted transformations, making it solvable by SDR and the Hungarian algorithm. Through extensive simulations, we demonstrate the effectiveness of the DASHF algorithm, offering significant insights for the advancement of collaborative LLM service deployments.
Information Theory,Signal Processing
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is: in the 6G wireless communication environment, how to effectively provide large - language - model (LLM) services for mobile users with limited computing resources through the mobile edge computing (MEC) system. Specifically, the research focuses on user association and resource allocation, aiming to optimize the collaborative training process between mobile devices and servers, ensuring that even mobile users with limited computing power can benefit from advanced LLM services. ### Specific description of the problem 1. **Limited computing resources**: Due to limitations in computing power and battery life, it is difficult for mobile devices to run complex large - language models efficiently. 2. **Wireless communication challenges**: The unpredictability of the wireless communication environment (such as data rate fluctuations and high latency) further increases the difficulty of deploying LLM in the mobile environment. 3. **Need for collaborative training**: To overcome the above challenges, a collaborative training framework is required, enabling users to initially train some parameters of the model locally, transmit these parameters to a more powerful server for further optimization, and then return the optimized parameters to the users. ### Overview of the solution The paper proposes an innovative alternating optimization algorithm named DASHF, which combines the Dinkelbach algorithm, alternating optimization, semi - definite relaxation (SDR), the Hungarian algorithm, and fractional programming techniques. The core of the DASHF algorithm is to reformulate the optimization problem as a quadratically - constrained quadratic program (QCQP) and make it solvable by SDR and the Hungarian algorithm through carefully - designed transformations. ### Main contributions - **Joint optimization problem**: A joint optimization problem is proposed to optimize the offloading ratio of user adapter parameter training and user association, in order to provide customized LLM services. - **Service - Cost Ratio (SCR) for users**: The concept of SCR is introduced to quantify the balance between the user service score and the overall latency and energy consumption. - **Innovative alternating optimization method**: The DASHF algorithm is proposed, which effectively solves the non - convex fractional programming problem and realizes the joint optimization of user association, parameter offloading, and resource allocation. Through these methods, the paper demonstrates how to effectively support LLM services for mobile users through the MEC system in the 6G wireless communication environment, thereby improving service quality and resource utilization efficiency.