OPTIMA: Optimized Policy for Intelligent Multi-Agent Systems Enables Coordination-Aware Autonomous Vehicles

Rui Du,Kai Zhao,Jinlong Hou,Qiang Zhang,Peter Zhang

2024-10-09

Abstract:Coordination among connected and autonomous vehicles (CAVs) is advancing due to developments in control and communication technologies. However, much of the current work is based on oversimplified and unrealistic task-specific assumptions, which may introduce vulnerabilities. This is critical because CAVs not only interact with their environment but are also integral parts of it. Insufficient exploration can result in policies that carry latent risks, highlighting the need for methods that explore the environment both extensively and efficiently. This work introduces OPTIMA, a novel distributed reinforcement learning framework for cooperative autonomous vehicle tasks. OPTIMA alternates between thorough data sampling from environmental interactions and multi-agent reinforcement learning algorithms to optimize CAV cooperation, emphasizing both safety and efficiency. Our goal is to improve the generality and performance of CAVs in highly complex and crowded scenarios. Furthermore, the industrial-scale distributed training system easily adapts to different algorithms, reward functions, and strategies.

Multiagent Systems,Machine Learning,Robotics

What problem does this paper attempt to address?

The problem this paper attempts to address is: The current coordination between connected and autonomous vehicles (CAVs) largely relies on overly simplified and unrealistic task-specific assumptions, which may lead to potential safety risks. CAVs not only interact with the environment but are also an integral part of it, meaning their behaviors influence each other, forming complex feedback loops. If CAVs are not exposed to a wide variety of scenarios during training, they may fail to handle the diversity encountered in real-world driving. When faced with unfamiliar situations, CAVs may exhibit unexpected behaviors, triggering a chain reaction that could put other vehicles in jeopardy. To address these issues, the paper proposes OPTIMA, a new distributed reinforcement learning framework for cooperative autonomous vehicle tasks. OPTIMA optimizes CAV cooperation by alternating between deep environmental data sampling and multi-agent reinforcement learning algorithms, emphasizing safety and efficiency. Its goal is to enhance the generality and performance of CAVs in highly complex and congested situations, and the industrial-scale distributed training system can easily adapt to different algorithms, reward functions, and strategies.

OPTIMA: Optimized Policy for Intelligent Multi-Agent Systems Enables Coordination-Aware Autonomous Vehicles

Autonomous Intersection Management with Heterogeneous Vehicles: A Multi-Agent Reinforcement Learning Approach

Optimal Cooperative Maneuver Planning for Multiple Nonholonomic Robots in a Tiny Environment Via Adaptive-Scaling Constrained Optimization

Integrated Operations Strategies for Shared and Privately-Owned Autonomous Vehicles: A Deep Reinforcement Learning Framework

Multi-Agent Reinforcement Learning for Unmanned Aerial Vehicle Coordination by Multi-Critic Policy Gradient Optimization

Efficient Domain Coverage for Vehicles with Second-Order Dynamics via Multi-Agent Reinforcement Learning

Multi-Agent Constrained Policy Optimization for Conflict-Free Management of Connected Autonomous Vehicles at Unsignalized Intersections

Coordination for Connected and Automated Vehicles at Non-Signalized Intersections: A Value Decomposition-Based Multiagent Deep Reinforcement Learning Approach

Towards Socially Responsive Autonomous Vehicles: A Reinforcement Learning Framework with Driving Priors and Coordination Awareness

An Optimization-Based Cooperative Path-Following Framework for Multiple Robotic Vehicles

Altruistic Maneuver Planning for Cooperative Autonomous Vehicles Using Multi-agent Advantage Actor-Critic

Robustness and Adaptability of Reinforcement Learning-Based Cooperative Autonomous Driving in Mixed-Autonomy Traffic

Safety Guaranteed Robust Multi-Agent Reinforcement Learning with Hierarchical Control for Connected and Automated Vehicles

A Multi-Agent Deep Reinforcement Learning Coordination Framework for Connected and Automated Vehicles at Merging Roadways

Combinatorial-hybrid Optimization for Multi-agent Systems under Collaborative Tasks

Optimization for Master-UAV-powered Auxiliary-Aerial-IRS-assisted IoT Networks: An Option-based Multi-agent Hierarchical Deep Reinforcement Learning Approach

Joint Optimization of Sensing, Decision-making and Motion-controlling for Autonomous Vehicles: A Deep Reinforcement Learning Approach

Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents

MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs' Cooperative Decision-Making

Communication-Efficient Decentralized Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control

Enhancing Multi-Agent Coordination through Common Operating Picture Integration