Scan-to-BIM for As-built Roads: Automatic Road Digital Twinning from Semantically Labeled Point Cloud Data

Yuexiong Ding,Mengtian Yin,Ran Wei,Ioannis Brilakis,Muyang Liu,Xiaowei Luo
2024-06-18
Abstract:Creating geometric digital twins (gDT) for as-built roads still faces many challenges, such as low automation level and accuracy, limited asset types and shapes, and reliance on engineering experience. A novel scan-to-building information modeling (scan-to-BIM) framework is proposed for automatic road gDT creation based on semantically labeled point cloud data (PCD), which considers six asset types: Road Surface, Road Side (Slope), Road Lane (Marking), Road Sign, Road Light, and Guardrail. The framework first segments the semantic PCD into spatially independent instances or parts, then extracts the sectional polygon contours as their representative geometric information, stored in JavaScript Object Notation (JSON) files using a new data structure. Primitive gDTs are finally created from JSON files using corresponding conversion algorithms. The proposed method achieves an average distance error of 1.46 centimeters and a processing speed of 6.29 meters per second on six real-world road segments with a total length of 1,200 meters.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve The paper aims to address multiple challenges encountered in the creation of existing road geometric digital twins (gDT), including low automation, insufficient accuracy, limited asset types and shapes, and reliance on engineering experience. Specifically, the paper proposes an automatic scan-to-BIM framework based on semantically annotated point cloud data (PCD) for creating geometric digital twins of roads. The framework mainly addresses the following issues: 1. **Low automation**: Existing methods require a significant amount of manual intervention when extracting geometric information of road assets, leading to inefficiency. 2. **Insufficient accuracy**: Existing methods have large errors when creating geometric digital twins, failing to meet high-precision requirements. 3. **Limited asset types and shapes**: Existing methods mainly focus on extracting road assets with regular shapes (such as pavements and lane markings), while providing insufficient support for assets with arbitrary shapes (such as road signs, street lights, and guardrails). 4. **Reliance on engineering experience**: Existing methods rely on specific engineering experience and standards when processing input data due to limited or low-quality information, which restricts their flexibility and accuracy in practical applications. To solve these problems, the paper proposes a new framework that achieves the following steps: 1. **Instance segmentation**: Using density clustering algorithms (such as DBSCAN) to segment point cloud data with the same semantic labels into multiple spatially independent instances. 2. **Geometric information extraction**: Developing specific algorithms for different types of assets (planar, pole-like, and guardrails) to extract their geometric information. For example, for pole-like assets, using a deep learning part segmentation model to further segment them into specific parts. 3. **Geometric information storage**: Storing the extracted geometric information in a structured and compressed manner in JSON files, rather than directly converting it into specific 3D/BIM file formats. 4. **Geometric digital twin creation**: Developing corresponding conversion algorithms to construct the original geometric digital twin representation from the extracted and stored geometric information. The framework was validated on six actual road segments with a total length of 1200 meters, achieving an average distance error of 1.46 centimeters and an average processing speed of 6.29 meters per second.