Abstract:Resource allocation in dense vehicle-to-everything (V2X) communication networks poses intricate challenges due to scalability issues, dynamic environments, and diverse quality of service requirements. Traditional optimizations like genetic algorithms struggle with computational complexity and dynamic planning. Meanwhile, pure multi-agent reinforcement learning (MARL)—a type of machine learning where multiple agents learn to optimize their decisions through interactions with each other and the environment—faces the curse of dimensionality and communication overhead, which limits the large-scale deployment of dynamic vehicular networks. This paper proposes an innovative approach to joint spectrum allocation and power control within dense V2X networks, conceptualizing it as an MARL-based multi-objective optimization problem. To mitigate computational demands, we adopt a hybrid approach combining centralized-training-with-decentralized-execution paradigm, alongside parameter-sharing techniques, within classic Mean-Field MARL (MF-MARL) framework, and name it Scalable-V2X-MF-MARL (ScalV-MF-MARL), enabling a lightweight training process in dense V2X networks with numerous V2V agents. It also technically encodes observations and mean field of actions, thereby enhancing the model's dynamism and scalability. This advancement permits the application of a single model across varying vehicle densities. Experimental results demonstrate that ScalV-MF-MARL achieves 99.5% of the performance of density-specific MF-MARL models, while reducing GPU memory usage by 79.49% to 94.03% during training as vehicle number increases from 40 to 160. Additionally, it outstrips conventional algorithms. Its generalization capabilities across diverse V2X network densities facilitate training in less dense scenarios, with seamless application to denser networks. In conclusion, ScalV-MF-MARL streamlines the V2X network deployment and effectively handles dynamic changes in vehicle numbers.

Federated Multi-Agent Deep Reinforcement Learning for Resource Allocation of Vehicle-to-Vehicle Communications

Multi-Agent Reinforcement Learning-Based Decentralized Spectrum Access in Vehicular Networks with Emergent Communication

Multi-Agent RL Enables Decentralized Spectrum Access in Vehicular Networks

Deep Reinforcement Learning Based Mode Selection and Resource Allocation for Cellular V2X Communications

Platoon Leader Selection, User Association and Resource Allocation on a C-V2X based highway: A Reinforcement Learning Approach

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

Spectrum-Energy-Efficient Mode Selection and Resource Allocation for Heterogeneous V2X Networks: A Federated Multi-Agent Deep Reinforcement Learning Approach

Deep Reinforcement Learning for Multi-Functional RIS-Aided Over-the-Air Federated Learning in Internet of Robotic Things

A Scalable Mean-Field MARL Framework for Multi-Objective V2X Resource Allocation

Joint Channel Selection using FedDRL in V2X

Transfer Learning in Multi-Agent Reinforcement Learning with Double Q-Networks for Distributed Resource Sharing in V2X Communication

AoI-Aware Resource Allocation for Platoon-Based C-V2X Networks via Multi-Agent Multi-Task Reinforcement Learning

Deep Reinforcement Learning for Resource Allocation in V2V Communications

Meta Federated Reinforcement Learning for Distributed Resource Allocation

Distributed Resource Allocation with Multi-Agent Deep Reinforcement Learning for 5G-V2V Communication

Joint mode selection and resource allocation for cellular V2X communication using distributed deep reinforcement learning under 5G and beyond networks

Collaborative Optimization of Wireless Communication and Computing Resource Allocation based on Multi-Agent Federated Weighting Deep Reinforcement Learning

Multi-Agent Deep Reinforcement Learning for Cooperative Connected Vehicles

Deep Reinforcement Learning Based Vehicle Selection for Asynchronous Federated Learning Enabled Vehicular Edge Computing

Decentralized Multi-Agent DQN-Based Resource Allocation for Heterogeneous Traffic in V2X Communications

Multi-Agent Deep Reinforcement Learning for Urban Traffic Light Control in Vehicular Networks