FinnWoodlands Dataset

Juan Lagos,Urho Lempiö,Esa Rahtu
DOI: https://doi.org/10.1007/978-3-031-31435-3_7
2023-04-03
Abstract:While the availability of large and diverse datasets has contributed to significant breakthroughs in autonomous driving and indoor applications, forestry applications are still lagging behind and new forest datasets would most certainly contribute to achieving significant progress in the development of data-driven methods for forest-like scenarios. This paper introduces a forest dataset called \textit{FinnWoodlands}, which consists of RGB stereo images, point clouds, and sparse depth maps, as well as ground truth manual annotations for semantic, instance, and panoptic segmentation. \textit{FinnWoodlands} comprises a total of 4226 objects manually annotated, out of which 2562 objects (60.6\%) correspond to tree trunks classified into three different instance categories, namely "Spruce Tree", "Birch Tree", and "Pine Tree". Besides tree trunks, we also annotated "Obstacles" objects as instances as well as the semantic stuff classes "Lake", "Ground", and "Track". Our dataset can be used in forestry applications where a holistic representation of the environment is relevant. We provide an initial benchmark using three models for instance segmentation, panoptic segmentation, and depth completion, and illustrate the challenges that such unstructured scenarios introduce.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper mainly addresses the following issues: 1. **Dataset Scarcity**: Most of the existing public datasets are derived from urban environments, which aids the development of applications such as autonomous driving. However, datasets in unstructured environments like forests are relatively scarce, hindering research and development in fields such as forest management and agricultural applications. 2. **Lack of Comprehensive Scene Understanding Datasets**: In forest environments, datasets that provide panoptic segmentation ground truth annotations are extremely rare. These datasets are crucial for comprehensive understanding tasks such as object detection, semantic segmentation, and instance segmentation in forest environments. 3. **Data Collection Challenges**: Collecting high-quality data in forests is challenging due to the complex and variable environment, which is difficult to control. Factors such as seasonal changes and vegetation diversity add to the complexity. To address the above issues, the paper contributes a dataset named FinnWoodlands. It includes stereo RGB images, LiDAR point clouds, sparse depth maps, and manually annotated semantic segmentation, instance segmentation, and panoptic segmentation data collected from Finnish forests. Additionally, benchmark results for three different models (Mask R-CNN for instance segmentation, EfficientPS for panoptic segmentation, and FuseNet for depth map completion) are provided to demonstrate the challenges of forest scene analysis and to offer a starting point for future research.