Abstract:The interactive decision-making in multi-agent autonomous racing offers insights valuable beyond the domain of self-driving cars. Mapless online path planning is particularly of practical appeal but poses a challenge for safely overtaking opponents due to the limited planning horizon. To address this, we introduce RaceMOP, a novel method for mapless online path planning designed for multi-agent racing of F1TENTH cars. Unlike classical planners that rely on predefined racing lines, RaceMOP operates without a map, utilizing only local observations to execute high-speed overtaking maneuvers. Our approach combines an artificial potential field method as a base policy with residual policy learning to enable long-horizon planning. We advance the field by introducing a novel approach for policy fusion with the residual policy directly in probability space. Extensive experiments on twelve simulated racetracks validate that RaceMOP is capable of long-horizon decision-making with robust collision avoidance during overtaking maneuvers. RaceMOP demonstrates superior handling over existing mapless planners and generalizes to unknown racetracks, affirming its potential for broader applications in robotics. Our code is available at <a class="link-external link-http" href="http://github.com/raphajaner/racemop" rel="external noopener nofollow">this http URL</a>.

What problem does this paper attempt to address?

This paper attempts to solve the problem of map - free online path planning in multi - agent autonomous racing, especially to achieve safe overtaking operations without pre - drawn map data. Specifically, the paper proposes a new method named RaceMOP for map - free online path planning in multi - agent autonomous racing of F1TENTH racing cars. ### Main problems 1. **Path planning in map - free environments**: Traditional path planning methods usually rely on predefined race routes and existing map data. However, in unknown or dynamically changing environments, it is impractical to obtain and use map data. Therefore, how to perform efficient path planning without a map is a key issue. 2. **Safe overtaking**: In multi - agent autonomous racing, safely overtaking opponents is a challenging task. Due to the limited planning horizon, traditional methods are difficult to ensure the safety during the overtaking process, especially in complex track environments. 3. **Long - horizon decision - making**: In order to achieve successful overtaking, vehicles need to have long - horizon decision - making capabilities, that is, they can predict and respond to the opponents' behaviors in multiple future time steps. This requires the algorithm to not only rely on the current local observations but also have the ability to reason about future situations. ### RaceMOP's solutions - **Map - free online path planning**: RaceMOP uses only the vehicle's local sensors (such as 2D - LiDAR) for path planning without any pre - drawn map data. This method makes the system more flexible and can be quickly deployed in unknown environments. - **Residual Policy Learning (RPL)**: RaceMOP combines the Artificial Potential Field (APF) as a base strategy and improves the vehicle's control actions through Residual Policy Learning. RPL uses a Deep Neural Network (DNN) to train a residual policy to correct the output of the base strategy, thereby achieving better long - horizon decision - making capabilities. - **Policy fusion in probability space**: The paper proposes a novel policy fusion method that directly combines the base strategy and the residual strategy in the probability space. This fusion method can effectively limit the action space and ensure the rationality and robustness of actions. ### Experimental verification Through extensive experiments on 12 simulated tracks, including 4 unseen tracks, the superior performance of RaceMOP in long - horizon decision - making and collision avoidance has been verified. The experimental results show that RaceMOP not only performs well on known tracks but also generalizes well to unknown environments. ### Summary The main contributions of the paper are: - Proposing a map - free online path planning method RaceMOP, which is suitable for multi - agent autonomous racing. - Introducing a new policy fusion method that directly combines the base strategy and the residual strategy in the probability space. - Verifying the generalization ability and robustness of RaceMOP through experiments, especially in overtaking operations in complex track environments. These contributions provide new ideas and technical support for autonomous racing and the broader field of robot path planning.

RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learning

Residual Policy Learning Facilitates Efficient Model-Free Autonomous Racing

CrowdMove: Autonomous Mapless Navigation in Crowded Scenarios

Polynomial-based Online Planning for Autonomous Drone Racing in Dynamic Environments

Mapless-Planner: A Robust and Fast Planning Framework for Aggressive Autonomous Flight Without Map Fusion

Deep High-Level Policy Model Predictive Contour Control for Autonomous Racing.

Learning the Subsystem of Local Planning for Autonomous Racing

A Novel Trajectory Planning Method Based on Trust Region Policy Optimization

Residual Policy Learning for Vehicle Control of Autonomous Racing Cars

A Data-Driven Aggressive Autonomous Racing Framework Utilizing Local Trajectory Planning with Velocity Prediction

High-performance Racing on Unmapped Tracks using Local Maps

Learning from Demonstration with Hierarchical Policy Abstractions Toward High-Performance and Courteous Autonomous Racing

Scaling Lifelong Multi-Agent Path Finding to More Realistic Settings: Research Challenges and Opportunities

Efficient and Balanced Exploration-driven Decision Making for Autonomous Racing Using Local Information

Autonomous Racing with Multiple Vehicles using a Parallelized Optimization with Safety Guarantee using Control Barrier Functions

Safe and Efficient Manoeuvring for Emergency Vehicles in Autonomous Traffic using Multi-Agent Proximal Policy Optimisation

Sampling-Based Motion Planning with Online Racing Line Generation for Autonomous Driving on Three-Dimensional Race Tracks

MAP3F: a decentralized approach to multi-agent pathfinding and collision avoidance with scalable 1D, 2D, and 3D feature fusion

Multi-policy Soft Actor-Critic Reinforcement Learning for Autonomous Racing

Efficient Multi-agent Navigation with Lightweight DRL Policy

High-Speed Robot Navigation using Predicted Occupancy Maps