Dependency-Aware CAV Task Scheduling via Diffusion-Based Reinforcement Learning

Xiang Cheng,Zhi Mao,Ying Wang,Wen Wu
2024-11-27
Abstract:In this paper, we propose a novel dependency-aware task scheduling strategy for dynamic unmanned aerial vehicle-assisted connected autonomous vehicles (CAVs). Specifically, different computation tasks of CAVs consisting of multiple dependency subtasks are judiciously assigned to nearby CAVs or the base station for promptly completing tasks. Therefore, we formulate a joint scheduling priority and subtask assignment optimization problem with the objective of minimizing the average task completion time. The problem aims at improving the long-term system performance, which is reformulated as a Markov decision process. To solve the problem, we further propose a diffusion-based reinforcement learning algorithm, named Synthetic DDQN based Subtasks Scheduling, which can make adaptive task scheduling decision in real time. A diffusion model-based synthetic experience replay is integrated into the reinforcement learning framework, which can generate sufficient synthetic data in experience replay buffer, thereby significantly accelerating convergence and improving sample efficiency. Simulation results demonstrate the effectiveness of the proposed algorithm on reducing task completion time, comparing to benchmark schemes.
Artificial Intelligence,Robotics
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the task - scheduling problem of unassisted connected autonomous vehicles (CAVs) in a dynamic environment. Specifically, the author focuses on how to effectively allocate and schedule computational tasks consisting of multiple dependent subtasks to minimize the average task completion time and improve the long - term performance of the system. #### Main problem description: 1. **Task Scheduling and Resource Limitations**: - When performing various computationally - intensive and delay - sensitive tasks, such as perception fusion, real - time navigation based on video or augmented reality (AR), and multimedia entertainment, autonomous vehicles (CAVs) need to be jointly processed to ensure safe driving and meet quality - of - service requirements. - Due to the limited computing resources of CAVs, when multiple tasks need to be processed simultaneously, the task completion time will be extended. 2. **Limitations of Existing Solutions**: - The method of directly offloading the entire task to the base station (BS) for rapid processing may extend the completion time due to additional task transmissions. - Although the partial offloading scheme reduces the transmission delay, it is difficult to ensure efficient task scheduling in a highly dynamic environment. - Using vehicle - edge computing (VEC) to offload tasks to nearby service vehicles (SVs) can accelerate task completion, but geographical limitations make it uneconomical and infeasible to deploy a large number of base stations on highways. 3. **Optimization Objectives**: - The goal of the paper is to improve system performance by optimizing the task - scheduling strategy to minimize the average completion time of all tasks. - Specifically, the author proposes a dependency - aware task - scheduling strategy, which takes into account the dependencies between tasks and realizes real - time adaptive task - scheduling decisions through the synthetic DDQN (SDSS) algorithm driven by deep reinforcement learning (DRL). #### Key Points of the Solution: - **Model Construction**: - A model including task dependencies and a scheduling priority adjustment mechanism is established to determine the scheduling priority of subtasks and select the optimal offloading target (local vehicle, service vehicle, or base station). - **Optimization Problem Modeling**: - The long - term subtask - scheduling problem is modeled as a Markov decision process (MDP) to minimize the average completion time of all tasks. - **Algorithm Design**: - The SDSS algorithm is proposed, which combines the diffusion model and the double - deep Q - network (DDQN) to accelerate convergence and improve sample efficiency by generating synthetic experience data. #### Summary: The paper solves the task - scheduling problem of CAVs in a dynamic environment by proposing a dependency - aware task - scheduling strategy. In particular, in the case of limited computing resources, it shows how to minimize the task completion time through reasonable task offloading and scheduling. By introducing deep reinforcement learning and the diffusion model, this method can achieve efficient real - time scheduling decisions in complex and changeable environments.