BIMCaP: BIM-based AI-supported LiDAR-Camera Pose Refinement

Miguel Arturo Vega Torres,Anna Ribic,Borja García de Soto,André Borrmann
2024-12-05
Abstract:This paper introduces BIMCaP, a novel method to integrate mobile 3D sparse LiDAR data and camera measurements with pre-existing building information models (BIMs), enhancing fast and accurate indoor mapping with affordable sensors. BIMCaP refines sensor poses by leveraging a 3D BIM and employing a bundle adjustment technique to align real-world measurements with the model. Experiments using real-world open-access data show that BIMCaP achieves superior accuracy, reducing translational error by over 4 cm compared to current state-of-the-art methods. This advancement enhances the accuracy and cost-effectiveness of 3D mapping methodologies like SLAM. BIMCaP's improvements benefit various fields, including construction site management and emergency response, by providing up-to-date, aligned digital maps for better decision-making and productivity. Link to the repository: <a class="link-external link-https" href="https://github.com/MigVega/BIMCaP" rel="external noopener nofollow">this https URL</a>
Robotics,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the accuracy and cost - effectiveness of 3D reconstruction in complex and dynamic environments (such as construction sites). Specifically, the authors propose a new method named BIMCaP, which aims to integrate mobile 3D sparse LiDAR data and camera measurements with pre - existing Building Information Models (BIM) to enhance rapid and accurate indoor mapping and use low - cost sensors. ### Main problems: 1. **Limitations of existing 3D data acquisition methods**: - Traditional 3D data acquisition methods (such as terrestrial laser scanning) have the problems of high cost and time - consuming, especially in scenarios such as construction site monitoring or disaster rescue. - Even the current state - of - the - art Simultaneous Localization and Mapping (SLAM) algorithms still face challenges when dealing with complex and dynamic environments (such as construction sites) and cannot achieve high - precision 3D reconstruction. 2. **Accuracy of sensor pose correction**: - SLAM algorithms will produce drift errors after running for a long time, resulting in inaccurate sensor pose estimation. - An effective method is needed to utilize the geometric and semantic information in BIM to refine the sensor pose, so as to reduce drift errors and improve the accuracy of 3D reconstruction. ### Solutions: BIMCaP solves the above problems in the following ways: - **Fusion of LiDAR and camera data**: Generate accurate depth maps and project pixel information into 3D space. - **Semantic segmentation**: Detect permanent elements in the image (such as walls, pillars, floors) and align these elements with the information in BIM. - **Bundle adjustment optimization**: Use a custom cost function to iteratively optimize the sensor pose to ensure that the generated map is optimally aligned with the semantic vectorized floor plan in BIM. ### Experimental results: Experiments show that BIMCaP can significantly improve accuracy compared to existing methods, especially reducing the translation error by more than 4 centimeters. In addition, BIMCaP also performs well in optimizing the rotation error, especially when dealing with only rotational offsets. ### Application prospects: The improvement of BIMCaP not only enhances the accuracy and cost - effectiveness of 3D reconstruction technology, but also provides more accurate and real - time digital maps for construction site management, emergency response and other fields, which is helpful for better decision - making and improving productivity. ### Formula representation: The formulas involved in the paper include: - Modeling of translation offset: \(\Delta T_{i + 1}\sim N(\Delta t_i,\sigma_t^2)\), where \(\Delta T_1\sim N(0,\sigma_t^2)\). - Modeling of rotation offset: \(\Delta\phi\sim N(0,\sigma_p^2)\), \(\Delta\theta\sim N(0,\sigma_{th}^2)\). These formulas are used to generate synthetic trajectories to simulate the output of the SLAM framework in order to evaluate the effectiveness of BIMCaP.