Optimizing Vehicular Networks with Variational Quantum Circuits-based Reinforcement Learning

Zijiang Yan,Ramsundar Tanikella,Hina Tabassum
2024-05-29
Abstract:In vehicular networks (VNets), ensuring both road safety and dependable network connectivity is of utmost importance. Achieving this necessitates the creation of resilient and efficient decision-making policies that prioritize multiple objectives. In this paper, we develop a Variational Quantum Circuit (VQC)-based multi-objective reinforcement learning (MORL) framework to characterize efficient network selection and autonomous driving policies in a vehicular network (VNet). Numerical results showcase notable enhancements in both convergence rates and rewards when compared to conventional deep-Q networks (DQNs), validating the efficacy of the VQC-MORL solution.
Machine Learning,Artificial Intelligence,Networking and Internet Architecture
What problem does this paper attempt to address?
The paper aims to address two key issues in Vehicular Networks (VNets): road safety and reliable network connectivity. Specifically, the goal of the paper is to maximize data transmission rates and traffic flow on multi-lane highways by optimizing cell-association and autonomous driving strategies, while ensuring collision avoidance. To tackle these problems, the authors propose a Multi-Objective Reinforcement Learning (MORL) framework based on Variational Quantum Circuits (VQC). The core contributions of the paper are: 1. **Proposing the VQC-MORL framework**: Utilizing Variational Quantum Circuits instead of traditional neural networks as Q-function approximators to overcome the challenges faced by classical reinforcement learning methods in high-dimensional state spaces. 2. **Improving decision-making efficiency**: Compared to traditional Deep Q-Networks (DQNs), the VQC-MORL method demonstrates significant advantages in terms of convergence speed and reward acquisition. 3. **Multi-objective optimization**: Modeling the problem as a Multi-Objective Markov Decision Process (MOMDP) and transforming it into quantum eigenstates and eigenactions on quantum circuits. Experimental results show that the VQC-MORL method improves training efficiency by 31.32% over traditional methods and achieves an average increase of 18.64% in communication and traffic rewards. Additionally, the trend of total rewards with the increase in the number of autonomous vehicles and the desired speed is also analyzed in detail.