Abstract:Computer vision tasks, such as motion estimation, depth estimation, object detection, etc., are better suited to light field images with more structural information than traditional 2D monocular images. However, since costly data acquisition instruments are difficult to calibrate, it is always hard to obtain real-world scene light field images. The majority of the datasets for static light field images now available are modest in size and cannot be used in methods such as transformer to fully leverage local and global correlations. Additionally, studies on dynamic situations, such as object tracking and motion estimates based on 4D light field images, have been rare, and we anticipate a superior performance. In this paper, we firstly propose a new static light field dataset that contains up to 50 scenes and takes 8 to 10 perspectives for each scene, with the ground truth including disparities, depths, surface normals, segmentations, and object poses. This dataset is larger scaled compared to current mainstream datasets for depth estimation refinement, and we focus on indoor and some outdoor scenarios. Second, to generate additional optical flow ground truth that indicates 3D motion of objects in addition to the ground truth obtained in static scenes in order to calculate more precise pixel level motion estimation, we released a light field scene flow dataset with dense 3D motion ground truth of pixels, and each scene has 150 frames. Thirdly, by utilizing the DistgDisp and DistgASR, which decouple the angular and spatial domain of the light field, we perform disparity estimation and angular super-resolution to evaluate the performance of our light field dataset. The performance and potential of our dataset in disparity estimation and angular super-resolution have been demonstrated by experimental results.

UrbanLF: A Comprehensive Light Field Dataset for Semantic Segmentation of Urban Scenes

LASDU: A Large-Scale Aerial LiDAR Dataset for Semantic Labeling in Dense Urban Areas.

TrafficScene: A Multi-modal Dataset Including Light Field for Semantic Segmentation of Traffic Scenes

End-to-End Semantic Segmentation Utilizing Multi-scale Baseline Light Field

4D Light Field Segmentation from Light Field Super-Pixel Hypergraph Representation

Semantic Segmentation With Light Field Imaging and Convolutional Neural Networks

Light Field Segmentation from Super-pixel Graph Representation.

WHU-Urban3D: An urban scene LiDAR point cloud dataset for semantic instance segmentation

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

LFRSNet: A robust light field semantic segmentation network combining contextual and geometric features

Semi-Supervised Semantic Segmentation for Light Field Images Using Disparity Information

4D Light Field Superpixel and Segmentation.

A New Parallel Intelligence Based Light Field Dataset for Depth Refinement and Scene Flow Estimation

Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways

The Fieldscapes Dataset for Semantic Field Scene Understanding

Delving into Light-Dark Semantic Segmentation for Indoor Scenes Understanding

Enhancing Underwater Imaging with 4-D Light Fields: Dataset and Method

Semantic Segmentation for Urban-Scene Images

A survey for light field super-resolution

The UAVid Dataset for Video Semantic Segmentation.