DFSNet: A 3D Point Cloud Segmentation Network toward Trees Detection in an Orchard Scene

Xinrong Bu,Chao Liu,Hui Liu,Guanxue Yang,Yue Shen,Jie Xu
DOI: https://doi.org/10.3390/s24072244
IF: 3.9
2024-04-01
Sensors
Abstract:In order to guide orchard management robots to realize some tasks in orchard production such as autonomic navigation and precision spraying, this research proposed a deep-learning network called dynamic fusion segmentation network (DFSNet). The network contains a local feature aggregation (LFA) layer and a dynamic fusion segmentation architecture. The LFA layer uses the positional encoders for initial transforming embedding, and progressively aggregates local patterns via the multi-stage hierarchy. The fusion segmentation module (Fus-Seg) can format point tags by learning a multi-embedding space, and the generated tags can further mine the point cloud features. At the experimental stage, significant segmentation results of the DFSNet were demonstrated on the dataset of orchard fields, achieving an accuracy rate of 89.43% and an mIoU rate of 74.05%. DFSNet outperforms other semantic segmentation networks, such as PointNet, PointNet++, D-PointNet++, DGCNN, and Point-NN, with improved accuracies over them by 11.73%, 3.76%, 2.36%, and 2.74%, respectively, and improved mIoUs over the these networks by 28.19%, 9.89%, 6.33%, 9.89, and 24.69%, respectively, on the all-scale dataset (simple-scale dataset + complex-scale dataset). The proposed DFSNet can capture more information from orchard scene point clouds and provide more accurate point cloud segmentation results, which are beneficial to the management of orchards.
engineering, electrical & electronic,chemistry, analytical,instruments & instrumentation
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve the 3D point cloud segmentation task of tree detection in the orchard scene. Specifically, the author proposes a deep - learning network named Dynamic Fusion Segmentation Network (DFSNet), aiming to improve the environmental perception ability of orchard management robots in tasks such as autonomous navigation and precision spraying. By introducing the Local Feature Aggregation (LFA) module and the Dynamic Fusion Segmentation (Fus - Seg) architecture, DFSNet can more effectively capture information from the point cloud data of the orchard scene and provide more accurate point cloud segmentation results. ### Background and Problems of the Paper 1. **Requirements of Modern Agriculture**: - The development of modern agriculture and smart agriculture has enabled the orchard industry to create huge economic value. - Orchard management machines still rely on manual assessment of tree growth, which is both time - consuming and labor - intensive. 2. **Requirements for High - Precision Semantic Segmentation Technology**: - High - precision semantic segmentation technology is the key technology for orchard management robots to achieve environmental perception. - Point cloud technology is widely used in the visual perception tasks of orchard management robots because of its insensitivity to factors such as illumination and shadow. 3. **Limitations of Existing Methods**: - Existing point cloud segmentation methods face problems such as non - uniform, sparse, and permutation - invariant point clouds when dealing with orchard scenes. - There is a need for improved methods to enhance the ability of point cloud segmentation, especially in terms of feature fusion between different levels. ### Solutions Proposed in the Paper 1. **DFSNet Network Structure**: - **Local Feature Aggregation (LFA) Module**: Use a position encoder for initial embedding and gradually aggregate local patterns through multi - stage hierarchies. - **Dynamic Fusion Segmentation (Fus - Seg) Architecture**: Generate point labels by learning multi - embedding spaces and further mine point cloud features. 2. **Experimental Verification**: - Experiments were carried out on the orchard scene data set, and DFSNet performed excellently in terms of accuracy and mIoU (mean Intersection over Union), reaching 89.43% and 74.05% respectively. - Compared with other semantic segmentation networks such as PointNet, PointNet++, D - PointNet++, DGCNN, and Point - NN, DFSNet has a significant improvement in both accuracy and mIoU. ### Main Contributions 1. **Proposed a deep - learning network for 3D point cloud semantic segmentation applicable to agricultural scenes**. 2. **Designed a simple but efficient network architecture that can fuse features at different levels in the network**. 3. **Discussed the influence of different sampling strategies in the local feature aggregation module**. 4. **Provided an end - to - end implementation from data annotation to network training and prediction, which is suitable for semantic segmentation in natural orchard scenes**. Through these innovations, DFSNet can provide more accurate environmental perception in orchard management, thereby improving the efficiency and quality of orchard management.