A Cloud-Edge Collaborative Architecture for Multimodal LLMs-Based Advanced Driver Assistance Systems in IoT Networks
Yaqi Hu,Dongdong Ye,Jiawen Kang,Maoqiang Wu,Rong Yu
DOI: https://doi.org/10.1109/jiot.2024.3509628
IF: 10.6
2024-01-01
IEEE Internet of Things Journal
Abstract:Advanced Driver Assistance Systems (ADAS) enhance driving safety and convenience by providing auxiliary functions. However, traditional rule-based or learning-based ADAS lack the capability for commonsense-based environmental understanding and multi-sensor data fusion, which leads to limitations in complex dynamic environments. Multimodal large language models (MLLMs) can effectively integrate data from different modalities and possess strong environmental perception and commonsense reasoning abilities, offering more intelligent driver assistance services within Internet of Things (IoT) networks. In this paper, we propose a cloud-edge collaborative ADAS based on MLLMs, utilizing IoT networks by deploying a smaller model, CogVLM2, at the edge and a larger model, ChatGPT-4o, in the cloud to achieve collaborative driver assistance services. Specifically, we first re-annotate the BDD-X dataset and use it to fine-tune CogVLM2 with LoRA, while applying few-shot learning to ChatGPT-4o to enhance their understanding and decision-making capabilities in traffic scenarios. We then formulate service latency, energy consumption, and quality of service (QoS) models for the cloud-edge collaborative ADAS in IoT networks, optimizing the combination of these models. Finally, we design an improved DDPG-based task offloading algorithm by introducing a multi-step reward mechanism and using a diffusion model to generate noise, aiming to determine the optimal execution location (i.e., cloud, edge, or local) for each task. Experimental results show that both CogVLM2 and ChatGPT-4o can achieve basic ADAS functionality. After fine-tuning and few-shot learning, their task success rates were significantly improved. Moreover, compared to other mainstream DRL-based task offloading algorithms, the improved DDPG task offloading algorithm demonstrates better performance in latency, energy consumption, and QoS within IoT networks.