Abstract:Bike-Sharing Systems provide eco-friendly urban mobility, contributing to the alleviation of traffic congestion and to healthier lifestyles. Efficiently operating such systems and maintaining high customer satisfaction is challenging due to the stochastic nature of trip demand, leading to full or empty stations. Devising effective rebalancing strategies using vehicles to redistribute bikes among stations is therefore of uttermost importance for operators. As a promising alternative to classical mathematical optimization, reinforcement learning is gaining ground to solve sequential decision-making problems. This paper introduces a spatio-temporal reinforcement learning algorithm for the dynamic rebalancing problem with multiple vehicles. We first formulate the problem as a Multi-agent Markov Decision Process in a continuous time framework. This allows for independent and cooperative vehicle rebalancing, eliminating the impractical restriction of time-discretized models where vehicle departures are synchronized. A comprehensive simulator under the first-arrive-first-serve rule is then developed to facilitate the learning process by computing immediate rewards under diverse demand scenarios. To estimate the value function and learn the rebalancing policy, various Deep Q-Network configurations are tested, minimizing the lost demand. Experiments are carried out on various datasets generated from historical data, affected by both temporal and weather factors. The proposed algorithms outperform benchmarks, including a multi-period Mixed-Integer Programming model, in terms of lost demand. Once trained, it yields immediate decisions, making it suitable for real-time applications. Our work offers practical insights for operators and enriches the integration of reinforcement learning into dynamic rebalancing problems, paving the way for more intelligent and robust urban mobility solutions.

Deep reinforcement learning for the dynamic vehicle dispatching problem: An event-based approach

A Deep Reinforcement Learning Approach to Ride-Sharing Vehicle Dispatching in Autonomous Mobility-on-Demand Systems.

Deep Dispatching: A Deep Reinforcement Learning Approach for Vehicle Dispatching on Online Ride-Hailing Platform

Reinforcement Learning-based Approach for Dynamic Vehicle Routing Problem with Stochastic Demand

Supply-Demand-aware Deep Reinforcement Learning for Dynamic Fleet Management

Real-time Dispatch Management of Shared Autonomous Vehicles with On-Demand and Pre-Booked Requests

A Distributed Model-Free Ride-Sharing Approach for Joint Matching, Pricing, and Dispatching using Deep Reinforcement Learning

Dynamic Balancing-Charging Management for Shared Autonomous Electric Vehicle Systems: A Two-Stage Learning-Based Approach

A Reinforcement Learning Approach for Dynamic Rebalancing in Bike-Sharing System

H-TD2: Hybrid Temporal Difference Learning for Adaptive Urban Taxi Dispatch

Multi-task dispatch of shared autonomous electric vehicles for Mobility-on-Demand services - combination of deep reinforcement learning and combinatorial optimization method

A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems

Deep-Dispatch: A Deep Reinforcement Learning-Based Vehicle Dispatch Algorithm for Advanced Air Mobility

AdaPool: A Diurnal-Adaptive Fleet Management Framework using Model-Free Deep Reinforcement Learning and Change Point Detection

Adaptive Dynamic Programming for Multi-Driver Order Dispatching at Large-Scale

Exploring Deep Reinforcement Learning for Task Dispatching in Autonomous On-Demand Services

dFDA-VeD: A Dynamic Future Demand Aware Vehicle Dispatching System

Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

Dynamic Queue-Jump Lane for Emergency Vehicles under Partially Connected Settings: A Multi-Agent Deep Reinforcement Learning Approach

An Online Reinforcement Learning Approach to Charging and Order-Dispatching Optimization for an E-hailing Electric Vehicle Fleet

Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning