LMMCoDrive: Cooperative Driving with Large Multimodal Model

Haichao Liu,Ruoyu Yao,Zhenmin Huang,Shaojie Shen,Jun Ma
2024-09-18
Abstract:To address the intricate challenges of decentralized cooperative scheduling and motion planning in Autonomous Mobility-on-Demand (AMoD) systems, this paper introduces LMMCoDrive, a novel cooperative driving framework that leverages a Large Multimodal Model (LMM) to enhance traffic efficiency in dynamic urban environments. This framework seamlessly integrates scheduling and motion planning processes to ensure the effective operation of Cooperative Autonomous Vehicles (CAVs). The spatial relationship between CAVs and passenger requests is abstracted into a Bird's-Eye View (BEV) to fully exploit the potential of the LMM. Besides, trajectories are cautiously refined for each CAV while ensuring collision avoidance through safety constraints. A decentralized optimization strategy, facilitated by the Alternating Direction Method of Multipliers (ADMM) within the LMM framework, is proposed to drive the graph evolution of CAVs. Simulation results demonstrate the pivotal role and significant impact of LMM in optimizing CAV scheduling and enhancing decentralized cooperative optimization process for each vehicle. This marks a substantial stride towards achieving practical, efficient, and safe AMoD systems that are poised to revolutionize urban transportation. The code is available at <a class="link-external link-https" href="https://github.com/henryhcliu/LMMCoDrive" rel="external noopener nofollow">this https URL</a>.
Robotics
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address the complex challenges of decentralized cooperative scheduling and motion planning in Autonomous Mobility on Demand (AMoD) systems. Specifically, the paper proposes a new cooperative driving framework **LMMCoDrive**, which leverages large multimodal models (LMM) to enhance traffic efficiency in dynamic urban environments. ### Main Issues Include: 1. **Integration of Task Scheduling and Cooperative Motion Planning**: - Traditional research often treats task scheduling and cooperative motion planning separately, ignoring their interdependencies, which leads to limitations in practical deployment. - A method is needed that can simultaneously perform task scheduling and motion planning to ensure effective vehicle cooperation and collision avoidance. 2. **Complexity of Dynamic Urban Traffic Systems**: - Urban traffic systems are highly dynamic, requiring constant path adjustments to respond to passenger requests and changing traffic conditions. - Developing a general scheduling strategy that can adapt to this dynamism is a significant challenge. 3. **Computational Efficiency of Large-Scale Cooperative Motion Planning**: - Achieving large-scale cooperative motion planning while ensuring safety and efficiency is a complex task. - An efficient optimization algorithm is needed to handle the cooperative driving issues of a large number of cooperative autonomous vehicles (CAVs). ### Solutions - **LMMCoDrive Framework**: - By integrating the task scheduling and motion planning processes, it utilizes multimodal bird's-eye view (BEV) to fully exploit the potential of LMM. - Proposes a decentralized optimization algorithm based on the Alternating Direction Method of Multipliers (ADMM) for graph evolution, efficiently partitioning CAVs into multiple subgraphs to reduce computational burden. - **Experimental Results**: - Simulation results show that LMM plays a key role in optimizing the graph evolution process for CAVs scheduling and cooperative driving tasks, significantly improving the practicality and efficiency of AMoD systems. ### Summary By introducing the LMMCoDrive framework, the paper addresses the integration of task scheduling and cooperative motion planning in AMoD systems, enhancing traffic efficiency in dynamic urban environments. This represents an important step towards achieving practical, efficient, and safe AMoD systems.