From Two-Dimensional to Three-Dimensional Environment with Q-Learning: Modeling Autonomous Navigation with Reinforcement Learning and no Libraries

Ergon Cugler de Moraes Silva
2024-03-27
Abstract:Reinforcement learning (RL) algorithms have become indispensable tools in artificial intelligence, empowering agents to acquire optimal decision-making policies through interactions with their environment and feedback mechanisms. This study explores the performance of RL agents in both two-dimensional (2D) and three-dimensional (3D) environments, aiming to research the dynamics of learning across different spatial dimensions. A key aspect of this investigation is the absence of pre-made libraries for learning, with the algorithm developed exclusively through computational mathematics. The methodological framework centers on RL principles, employing a Q-learning agent class and distinct environment classes tailored to each spatial dimension. The research aims to address the question: How do reinforcement learning agents adapt and perform in environments of varying spatial dimensions, particularly in 2D and 3D settings? Through empirical analysis, the study evaluates agents' learning trajectories and adaptation processes, revealing insights into the efficacy of RL algorithms in navigating complex, multi-dimensional spaces. Reflections on the findings prompt considerations for future research, particularly in understanding the dynamics of learning in higher-dimensional environments.
Machine Learning,Artificial Intelligence,Computation
What problem does this paper attempt to address?
The problem this paper attempts to address is: How do Reinforcement Learning (RL) agents adapt and perform in different spatial dimensions (2D and 3D environments)? Specifically, the core issues of the study are: - **Research Objective**: To explore the learning dynamics and adaptability of RL agents in 2D and 3D environments. - **Main Question**: How do RL agents adapt and perform in environments of different spatial dimensions? Particularly in 2D and 3D settings. The study aims to achieve this goal by using mathematical modeling methods without relying on any pre-existing learning libraries, aiming to reveal the effectiveness of RL algorithms in complex multidimensional spaces. Through empirical analysis, the research evaluates the learning trajectories and adaptation processes of agents at different training stages, thereby providing an in-depth understanding of the ability of RL algorithms to navigate complex, multidimensional spaces.