Improved deep learning segmentation of outdoor point clouds with different sampling strategies and using intensities

Harintaka Harintaka,Calvin Wijaya
DOI: https://doi.org/10.1515/geo-2022-0611
2024-01-01
Open Geosciences
Abstract:Abstract The rapid growth of outdoor digital twin data sets and advancements in 3D data acquisition technology have sparked interest in improving segmentation performance using deep learning. This research aims to analyze and evaluate different sampling strategies and optimization techniques while exploring the intensity information of outdoor point cloud data. Two sampling strategies, random and stratified sampling, are employed to divide a limited data set. Additionally, the data set is divided into point cloud data with and without intensity. The PointNet++ model is used to segment the point cloud data into two classes, vegetation and structure. The results indicate that stratified sampling outperforms random sampling, yielding a considerable improvement in mean intersection over union scores of up to 10%. Interestingly, the inclusion of intensity information in the data set does not universally enhance performance. Although the use of intensity improves the performance of random sampling, it does not benefit stratified sampling. This research provides insights into the effectiveness of different sampling strategies for outdoor point cloud data segmentation. The findings can contribute to the development of optimized approaches to improving segmentation accuracy in outdoor digital twin applications using deep learning techniques.
geosciences, multidisciplinary
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the performance of outdoor point - cloud data segmentation. Specifically, the research aims to analyze and evaluate different sampling strategies and optimization techniques, and explore the impact of intensity information in outdoor point - cloud data on segmentation performance. The research uses two sampling strategies - random sampling and stratified sampling, as well as data sets with and without intensity information, through these methods to improve the accuracy of deep - learning models in distinguishing between two types of objects: vegetation and structures. ### Research Background and Objectives With the rapid growth of outdoor digital twin data sets and the progress of 3D data acquisition technology, there is an increasing interest in using deep learning to improve segmentation performance. This paper aims to improve the segmentation effect of outdoor point - cloud data in the following ways: 1. **Analyze and evaluate different sampling strategies**: The research compares two strategies, random sampling and stratified sampling, and explores their impact on segmentation performance. 2. **Explore the role of intensity information**: The research examines the role of intensity information in point - cloud data segmentation, especially how it affects the performance of the model. 3. **Use the PointNet++ model**: The PointNet++ model is selected for point - cloud data segmentation because this model performs excellently in capturing local and global features of point - cloud data. ### Main Findings - **Stratified sampling is superior to random sampling**: Stratified sampling is significantly superior to random sampling in the mean intersection - over - union (mIoU) score, with a maximum improvement of 10%. - **The impact of intensity information**: Although intensity information can improve performance in random sampling, it does not bring significant improvement in stratified sampling. - **Prediction of small objects**: Regardless of which sampling strategy is adopted, the model can predict small objects well, such as small vegetation in pots, pavilions and park lights. - **Segmentation of structural - type objects**: Stratified sampling performs better in segmenting structural - type objects (such as dome buildings), while random sampling may misclassify some structures as vegetation. ### Conclusions This research provides valuable insights for outdoor point - cloud data segmentation by comparing different sampling strategies and the use of intensity information. The research results show that stratified sampling has obvious advantages in improving segmentation accuracy, and intensity information can also improve the model performance in some cases. These findings are helpful for developing more optimized methods and improving the segmentation accuracy in outdoor digital twin applications.