Abstract:The heterogeneous fleet and demand vehicle routing problem with time-window constraints (HFDVRPTW) is a crucial optimization problem of significant importance in real-world logistics operations. In this paper, we propose a deep reinforcement learning (DRL)-based method, termed spatial Edge-Feature EnhanCed mulTIgraph fusion encoder With spectral-based embedding and hieRarchical decOder with learnable TEmpoRal positional embedding (EFECTIW-ROTER, pronounced "Effective Router"), to tackle this complex and practical optimization problem. EFECTIW-ROTER utilizes two sparse graphs to represent node connectivity, where nodes correspond to customers and the depot. This sparsity results from the time-window constraints and customers' demand relative to the list of acceptable vehicle attributes specified for service within a heterogeneous fleet, determined by the reachability of the nodes based on these two factors. Leveraging two graph Transformer models, EFECTIW-ROTER's encoding module captures the interactions between the nodes based on these factors. One model encodes customers' heterogeneous demand with spatial edge features based on travel time between the nodes, while the second employs temporal positional embeddings to capture temporal relationships based on time-window ordering. A fusion model is introduced to integrate node interactions based on these graphs. Additionally, a spectral-attention-based pooling ensures effective state representation for the DRL-based method. EFECTIW-ROTER features a hierarchical attention decoder operating in two stages: heterogeneous vehicle selection and node selection. Enhanced with positional embeddings, the decoder is empowered to make effective routing decisions based on time-window constraints' ordering. Experimental results using real-world traffic data from two major Canadian cities confirm EFECTIW-ROTER's better performance over current state-of-the-art DRL-based and heuristic methods. EFECTIW-ROTER reduces travel times while also achieving faster computational times when compared to conventional heuristics. Additional experiments demonstrate its generalizability across larger instances.

Token-based Deep Reinforcement Learning for Heterogeneous VRP with Service Time Constraints

Uniformity of Markov Elements in Deep Reinforcement Learning for Traffic Signal Control

Solving the Vehicle Routing Problem with Stochastic Travel Cost Using Deep Reinforcement Learning

Reinforcement Learning for Solving Multiple Vehicle Routing Problem with Time Window

Deep Reinforcement Learning for Solving the Heterogeneous Capacitated Vehicle Routing Problem

SoC-VRP: A Deep-Reinforcement-Learning-Based Vehicle Route Planning Mechanism for Service-Oriented Cooperative ITS

Coordinated Multi‐agent Hierarchical Deep Reinforcement Learning to Solve Multi‐trip Vehicle Routing Problems with Soft Time Windows

A deep reinforcement learning approach for solving the Traveling Salesman Problem with Drone

EFECTIW-ROTER: Deep Reinforcement Learning Approach for Solving Heterogeneous Fleet and Demand Vehicle Routing Problem with Time-Window Constraints

Deep Reinforcement Learning-Based Multi-Agent Algorithm for Vehicle Routing Problem in Complex Logistics Scenarios

Deep Reinforcement Learning for Solving Vehicle Routing Problems With Backhauls

Transit Signal Priority Strategy with Heterogeneous Graph-Based Deep Reinforcement Learning for Autonomous Public Transit Vehicles

Multi-Type Attention for Solving Multi-Depot Vehicle Routing Problems

A Hybrid of Deep Reinforcement Learning and Local Search for the Vehicle Routing Problems

Multi-Vehicle Routing Problems with Soft Time Windows: A Multi-Agent Reinforcement Learning Approach

Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints

Heterogeneous Attentions for Solving Pickup and Delivery Problem via Deep Reinforcement Learning

The Vehicle Routing Problem with Time Windows and Time Costs

Deep Reinforcement Learning for the Dynamic and Uncertain Vehicle Routing Problem

Graph attention reinforcement learning with flexible matching policies for multi-depot vehicle routing problems

Computational Resource Sharing in a Vehicular Cloud Network Via Deep Reinforcement Learning