Abstract:Lane-changing decisions, which are crucial for autonomous vehicle path planning, face practical challenges due to rule-based constraints and limited data. Deep reinforcement learning has become a major research focus due to its advantages in data acquisition and interpretability. However, current models often overlook collaboration, which affects not only impacts overall traffic efficiency but also hinders the vehicle's own normal driving in the long run. To address the aforementioned issue, this paper proposes a method named Mix Q-learning for Lane Changing(MQLC) that integrates a hybrid value Q network, taking into account both collective and individual benefits for the greater good. At the collective level, our method coordinates the individual Q and global Q networks by utilizing global information. This enables agents to effectively balance their individual interests with the collective benefit. At the individual level, we integrated a deep learning-based intent recognition module into our observation and enhanced the decision network. These changes provide agents with richer decision information and more accurate feature extraction for improved lane-changing decisions. This strategy enables the multi-agent system to learn and formulate optimal decision-making strategies effectively. Our MQLC model, through extensive experimental results, impressively outperforms other state-of-the-art multi-agent decision-making methods, achieving significantly safer and faster lane-changing decisions.

TLMIX: Twin Leader Mixing Network for Cooperative Multi-Agent Reinforcement Learning.

MO-MIX: Multi-Objective Multi-Agent Cooperative Decision-Making With Deep Reinforcement Learning

Learning Intra-group Cooperation in Multi-agent Systems.

Target-Value-Competition-Based Multi-Agent Deep Reinforcement Learning Algorithm for Distributed Nonconvex Economic Dispatch

Decomposing Shared Networks for Separate Cooperation with Multi-Agent Reinforcement Learning

Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning

Learning Multi-Agent Cooperation via Considering Actions of Teammates

MAR2MIX: A Novel Model for Dynamic Problem in Multi-agent Reinforcement Learning.

CoMIX: A Multi-agent Reinforcement Learning Training Architecture for Efficient Decentralized Coordination and Independent Decision-Making

QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition

AgentMixer: Multi-Agent Correlated Policy Factorization

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Multi-Agent Q-Value Mixing Network with Covariance Matrix Adaptation Strategy for the Voltage Regulation Problem

Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning

Regularized Softmax Deep Multi-Agent Q-Learning.

Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization

Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

Value function factorization with dynamic weighting for deep multi-agent reinforcement learning

POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning

CMIX: Deep Multi-agent Reinforcement Learning with Peak and Average Constraints

Multi-Agent Collaboration via Reward Attribution Decomposition