RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learning

Raphael Trumpp,Ehsan Javanmardi,Jin Nakazato,Manabu Tsukada,Marco Caccamo
2024-09-16
Abstract:The interactive decision-making in multi-agent autonomous racing offers insights valuable beyond the domain of self-driving cars. Mapless online path planning is particularly of practical appeal but poses a challenge for safely overtaking opponents due to the limited planning horizon. To address this, we introduce RaceMOP, a novel method for mapless online path planning designed for multi-agent racing of F1TENTH cars. Unlike classical planners that rely on predefined racing lines, RaceMOP operates without a map, utilizing only local observations to execute high-speed overtaking maneuvers. Our approach combines an artificial potential field method as a base policy with residual policy learning to enable long-horizon planning. We advance the field by introducing a novel approach for policy fusion with the residual policy directly in probability space. Extensive experiments on twelve simulated racetracks validate that RaceMOP is capable of long-horizon decision-making with robust collision avoidance during overtaking maneuvers. RaceMOP demonstrates superior handling over existing mapless planners and generalizes to unknown racetracks, affirming its potential for broader applications in robotics. Our code is available at <a class="link-external link-http" href="http://github.com/raphajaner/racemop" rel="external noopener nofollow">this http URL</a>.
Robotics
What problem does this paper attempt to address?
This paper attempts to solve the problem of map - free online path planning in multi - agent autonomous racing, especially to achieve safe overtaking operations without pre - drawn map data. Specifically, the paper proposes a new method named RaceMOP for map - free online path planning in multi - agent autonomous racing of F1TENTH racing cars. ### Main problems 1. **Path planning in map - free environments**: Traditional path planning methods usually rely on predefined race routes and existing map data. However, in unknown or dynamically changing environments, it is impractical to obtain and use map data. Therefore, how to perform efficient path planning without a map is a key issue. 2. **Safe overtaking**: In multi - agent autonomous racing, safely overtaking opponents is a challenging task. Due to the limited planning horizon, traditional methods are difficult to ensure the safety during the overtaking process, especially in complex track environments. 3. **Long - horizon decision - making**: In order to achieve successful overtaking, vehicles need to have long - horizon decision - making capabilities, that is, they can predict and respond to the opponents' behaviors in multiple future time steps. This requires the algorithm to not only rely on the current local observations but also have the ability to reason about future situations. ### RaceMOP's solutions - **Map - free online path planning**: RaceMOP uses only the vehicle's local sensors (such as 2D - LiDAR) for path planning without any pre - drawn map data. This method makes the system more flexible and can be quickly deployed in unknown environments. - **Residual Policy Learning (RPL)**: RaceMOP combines the Artificial Potential Field (APF) as a base strategy and improves the vehicle's control actions through Residual Policy Learning. RPL uses a Deep Neural Network (DNN) to train a residual policy to correct the output of the base strategy, thereby achieving better long - horizon decision - making capabilities. - **Policy fusion in probability space**: The paper proposes a novel policy fusion method that directly combines the base strategy and the residual strategy in the probability space. This fusion method can effectively limit the action space and ensure the rationality and robustness of actions. ### Experimental verification Through extensive experiments on 12 simulated tracks, including 4 unseen tracks, the superior performance of RaceMOP in long - horizon decision - making and collision avoidance has been verified. The experimental results show that RaceMOP not only performs well on known tracks but also generalizes well to unknown environments. ### Summary The main contributions of the paper are: - Proposing a map - free online path planning method RaceMOP, which is suitable for multi - agent autonomous racing. - Introducing a new policy fusion method that directly combines the base strategy and the residual strategy in the probability space. - Verifying the generalization ability and robustness of RaceMOP through experiments, especially in overtaking operations in complex track environments. These contributions provide new ideas and technical support for autonomous racing and the broader field of robot path planning.