AI2-THOR: An Interactive 3D Environment for Visual AI

Eric Kolve,Roozbeh Mottaghi,Winson Han,Eli VanderBilt,Luca Weihs,Alvaro Herrasti,Matt Deitke,Kiana Ehsani,Daniel Gordon,Yuke Zhu,Aniruddha Kembhavi,Abhinav Gupta,Ali Farhadi
DOI: https://doi.org/10.48550/arXiv.1712.05474
2022-08-27
Abstract:We introduce The House Of inteRactions (THOR), a framework for visual AI research, available at <a class="link-external link-http" href="http://ai2thor.allenai.org" rel="external noopener nofollow">this http URL</a>. AI2-THOR consists of near photo-realistic 3D indoor scenes, where AI agents can navigate in the scenes and interact with objects to perform tasks. AI2-THOR enables research in many different domains including but not limited to deep reinforcement learning, imitation learning, learning by interaction, planning, visual question answering, unsupervised representation learning, object detection and segmentation, and learning models of cognition. The goal of AI2-THOR is to facilitate building visually intelligent models and push the research forward in this domain.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to construct an interactive 3D - environment framework that can support visual artificial intelligence research. Specifically, AI2 - THOR aims to provide a highly realistic collection of 3D indoor scenes, in which AI agents can navigate and interact with objects in the scenes to perform tasks. The research areas supported by this framework are extensive, including but not limited to deep reinforcement learning, imitation learning, learning through interaction, planning, visual question answering, unsupervised representation learning, object detection and segmentation, and cognitive model learning. The key contributions of AI2 - THOR lie in its provision of rich interaction capabilities, a large number of interactive objects and scenes, high - quality image rendering, and a powerful Python API to interact with the Unity 3D game engine. These features make AI2 - THOR an ideal platform for developing more intelligent visual models and promoting the research in this field forward. By providing a simulation environment close to the real world, AI2 - THOR helps researchers test and verify new algorithms without having to face the costs and risks of experiments in the real world.