Abstract:Deep reinforcement learning (RL) is achieving significant success in various applications like control, robotics, games, resource management, and scheduling. However, the important problem of emergency evacuation, which clearly could benefit from RL, has been largely unaddressed. Indeed, emergency evacuation is a complex task that is difficult to solve with RL. An emergency situation is highly dynamic, with a lot of changing variables and complex constraints that make it challenging to solve. Also, there is no standard benchmark environment available that can be used to train RL agents for evacuation. A realistic environment can be complex to design. In this article, we propose the first fire evacuation environment to train RL agents for evacuation planning. The environment is modeled as a graph capturing the building structure. It consists of realistic features like fire spread, uncertainty, and bottlenecks. The implementation of our environment is in the OpenAI gym format, to facilitate future research. We also propose a new RL approach that entails pretraining the network weights of a DQN-based agent [DQN/Double-DQN (DDQN)/Dueling-DQN] to incorporate information on the shortest path to the exit. We achieved this by using tabular $Q$ -learning to learn the shortest path on the building model’s graph. This information is transferred to the network by deliberately overfitting it on the $Q$ -matrix. Then, the pretrained DQN model is trained on the fire evacuation environment to generate the optimal evacuation path under time varying conditions due to fire spread, bottlenecks, and uncertainty. We perform comparisons of the proposed approach with state-of-the-art RL algorithms like DQN, DDQN, Dueling-DQN, PPO, VPG, state-action-reward-state-action (SARSA), actor–critic method, and ACKTR. The results show that our method is able to outperform state-of-the-art models by a huge margin including the original DQN-based models. Finally, our model is tested on a large and complex real building consisting of 91 rooms, with the possibility to move to any other room, hence giving 8281 actions. In order to reduce the action space, we propose a strategy that involves one step simulation. That is, an action importance vector is added to the final output of the pretrained DQN and acts like an attention mechanism. Using this strategy, the action space is reduced by 90.1%. In this manner, the model is able to deal with large action spaces. Hence, our model achieves near optimal performance on the real world emergency environment.

Reinforcement Learning Based Escape Route Generation in Low Visibility Environments

EvacuAI: An Analysis of Escape Routes in Indoor Environments with the Aid of Reinforcement Learning

Deep Q-Learning With Q-Matrix Transfer Learning for Novel Fire Evacuation Environment

Robot-Assisted Pedestrian Evacuation in Fire Scenarios Based on Deep Reinforcement Learning

Optimal path planning in real time for dynamic building fire rescue operations using wireless sensors and visual guidance

Dual deep Q-learning network guiding a multiagent path planning approach for virtual fire emergency scenarios

A Metaverse-Based Teaching Building Evacuation Training System With Deep Reinforcement Learning

Rescue path planning for urban flood: A deep reinforcement learning-based approach

Reinforcement learning for safe evacuation time of fire in Hong Kong-Zhuhai-Macau immersed tube tunnel

Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles

Vision-based navigation and obstacle avoidance via deep reinforcement learning

Indoor AR Navigation and Emergency Evacuation System Based on Machine Learning and IoT Technologies

Learning to Assess Danger from Movies for Cooperative Escape Planning in Hazardous Environments

BIM and Computer Vision-Based Framework for Fire Emergency Evacuation Considering Local Safety Performance

Deep adaptive learning for safe and efficient navigation of pedestrian dynamics

Smart Fire Evacuation Service Based on Internet of Things Computing for Web3D

Deep reinforcement learning and 3D physical environments applied to crowd evacuation in congested scenarios

Design of Intelligent Firefighting and Smart Escape Route Planning System Based on Improved Ant Colony Algorithm

Researches advanced in path planning to indoor fire escape and rescue based on SLAM

A path planning method based on deep reinforcement learning for crowd evacuation

Optimal Wildfire Escape Route Planning for Drones under Dynamic Fire and Smoke