Abstract:Driver-preferred route planning often evaluates the quality of a planned route based on how closely it is followed by the driver. Despite decades of research in this area, there still exist nonnegligible deviations from planned routes. Recently, with the prevalence of GPS data, Inverse Reinforcement Learning (IRL) has attracted much interest due to its ability to directly learn routing patterns from GPS trajectories. However, existing IRL methods are limited in that: (1) They rely on numerical approximations to calculate the expected state visitation frequencies (SVFs), which are inaccurate and time-consuming; and (2) They ignore the fact that the coverage of GPS trajectories is skewed toward popular road segments, causing difficulties in learning from sparsely covered ones. To overcome these challenges, we propose a recursive logit-based meta-IRL approach, where (1) We use the recursive logit model to capture drivers' route choice behavior so that the expected SVFs can be analytically derived, which substantially reduces the computational efforts; and (2) We introduce meta-parameters and employ meta-learning techniques so that the learning on sparsely covered road segments can benefit from that on popular ones. When training our IRL model, we update the rewards of road segments with the expected SVFs by solving several systems of linear equations and update the meta-parameters through a two-level optimization structure to ensure its fast adaption and versatility. We validate our approach using real GPS data in Chengdu, China. Results show that our planned routes better match actual routes compared with state-of-the-art methods including the recursive logit model, Deep-IRL and Dij-IRL: the F1-Score increases by 4.17% with the introduction of the recursive logit model and further increases to 5.19% after meta-learning is employed. Moreover, we can reduce training time by over 95%.

Pareto Improver: Learning Improvement Heuristics for Multi-Objective Route Planning

Learning for multiple purposes: A Q-learning enhanced hybrid metaheuristic for parallel drone scheduling traveling salesman problem

Recursive logit-based meta-inverse reinforcement learning for driver-preferred route planning

Learning Pareto Set for Multi-Objective Continuous Robot Control

To Optimize Human-in-the-loop Learning in Repeated Routing Games

Population Game-Assisted Multi-Agent Reinforcement Learning Method for Dynamic Multi-Vehicle Route Selection

Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems

Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning

Learn-n-Route: Learning implicit preferences for vehicle routing

Multiobjective Combinatorial Optimization Using a Single Deep Reinforcement Learning Model

Multi-Objective Order Scheduling via Reinforcement Learning

Inverse Optimization for Routing Problems

Integer-Programming-Based Narrow-Passage Multi-Robot Path Planning with Effective Heuristics

Smart City Public Transportation Route Planning Based on Multi-objective Optimization: A Review

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

A Pareto Front-Based Multiobjective Path Planning Algorithm

Learning-based Preference Prediction for Constrained Multi-Criteria Path-Planning

Integer Programming as a General Solution Methodology for Path-Based Optimization in Robotics: Principles, Best Practices, and Applications

Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies

Pareto-Optimal Transit Route Planning with Multi-Objective Monte-Carlo Tree Search

Multi-Objective Deep Reinforcement Learning for Crowd Route Guidance Optimization