A Lightweight Depth Estimation Network for Wide-Baseline Light Fields

Yan Li,Qiong Wang,Lu Zhang,Gauthier Lafruit
DOI: https://doi.org/10.1109/tip.2021.3051761
IF: 10.6
2021-01-01
IEEE Transactions on Image Processing
Abstract:Existing traditional and ConvNet-based methods for light field depth estimation mainly work on the narrow-baseline scenario. This paper explores the feasibility and capability of ConvNets to estimate depth in another promising scenario: wide-baseline light fields. Due to the deficiency of training samples, a large-scale and diverse synthetic wide-baseline dataset with labelled data is introduced for depth prediction tasks. Considering the practical goal for real-world applications, we design an end-to-end trained lightweight convolutional network to infer depths from light fields, called LLF-Net. The proposed LLF-Net is built by incorporating a cost volume which allows variable angular light field inputs and an attention module that enables to recover details at occlusion areas. Evaluations are made on the synthetic and real-world wide-baseline light fields, and experimental results show that the proposed network achieves the best performance when compared to recent state-of-the-art methods. We also evaluate our LLF-Net on narrow-baseline datasets, and it consequently improves the performance of previous methods.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to perform depth estimation in wide - baseline light fields. Existing traditional methods and methods based on convolutional neural networks (ConvNet) mainly focus on light - field depth estimation in narrow - baseline scenarios. While wide - baseline light fields can provide higher depth accuracy due to their large baselines and high spatial resolutions, there are currently few studies on them. Specifically, the paper addresses the following key issues: 1. **Lack of datasets**: To train a convolutional neural network, a large amount of labeled data is required. However, at that time, there was no large - scale publicly available wide - baseline light - field dataset. Therefore, the author created a large - scale and diverse synthetic wide - baseline light - field dataset (WLF) to support the training and evaluation of the model. 2. **Model design**: The existing convolutional neural network models for wide - baseline light - field depth estimation have a very large number of parameters and are difficult to be applied in practical scenarios, especially in resource - constrained environments such as mobile devices. For this reason, the author designed a lightweight end - to - end trainable convolutional network (LLF - Net), which can reduce the number of parameters while ensuring performance. 3. **Handling occlusion problems**: In wide - baseline light fields, the occlusion problem is a challenge for depth estimation. The author introduced an attention mechanism to handle the detail recovery of occluded areas and maintain depth discontinuities. ### Main contributions of the paper 1. **Dataset construction**: For the first time, a large - scale and diverse synthetic wide - baseline light - field dataset (WLF) was introduced, which contains about 381 light fields, and each light field provides 9×9 angular (RGB) images and a real - depth map. This provides a valuable resource for future research. 2. **Lightweight network design**: A new lightweight end - to - end trainable network (LLF - Net) was proposed. This network has only 1.8 million parameters but performs excellently in the wide - baseline light - field depth - estimation task and outperforms existing methods. 3. **Innovative module**: A new cost - volume module was introduced, which allows flexible light - field inputs and combines an attention mechanism to better handle occlusion problems and maintain depth discontinuities. 4. **Superior performance**: The experimental results show that LLF - Net not only outperforms existing methods in wide - baseline scenarios but also performs excellently in narrow - baseline scenarios. Through these innovations, the paper provides an effective solution for wide - baseline light - field depth estimation and promotes the further development of this field.