DAVIDE: Depth-Aware Video Deblurring

German F. Torres,Jussi Kalliola,Soumya Tripathy,Erman Acar,Joni-Kristian Kämäräinen
2024-09-02
Abstract:Video deblurring aims at recovering sharp details from a sequence of blurry frames. Despite the proliferation of depth sensors in mobile phones and the potential of depth information to guide deblurring, depth-aware deblurring has received only limited attention. In this work, we introduce the 'Depth-Aware VIdeo DEblurring' (DAVIDE) dataset to study the impact of depth information in video deblurring. The dataset comprises synchronized blurred, sharp, and depth videos. We investigate how the depth information should be injected into the existing deep RGB video deblurring models, and propose a strong baseline for depth-aware video deblurring. Our findings reveal the significance of depth information in video deblurring and provide insights into the use cases where depth cues are beneficial. In addition, our results demonstrate that while the depth improves deblurring performance, this effect diminishes when models are provided with a longer temporal context. Project page: <a class="link-external link-https" href="https://germanftv.github.io/DAVIDE.github.io/" rel="external noopener nofollow">this https URL</a> .
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? The main goal of this paper is to investigate the role of depth information in video deblurring and to propose a Depth-Aware Video Deblurring Dataset (DAVIDE). Specifically: 1. **Dataset Contribution**: - Introduced the first large-scale video deblurring dataset (DAVIDE) that includes real depth information, comprising synchronized blurry frames, clear frames, and depth map videos. - The dataset was captured using an iPhone 13 Pro, utilizing the LiDAR sensor to obtain depth information. 2. **Method Innovation**: - Proposed a depth-aware video deblurring network based on the Shift-Net architecture, using Grouped Spatial Shift (GSS) blocks and Depth-aware Transformer (DaT) blocks to better integrate depth information. - Investigated how to inject depth information into existing depth-RGB video deblurring models. 3. **Experimental Analysis**: - Conducted multiple experiments to verify the impact of depth information on video deblurring performance. - Found that depth information significantly improves deblurring performance when the input sequence's time window is short (e.g., T=1 or T=3); however, as the time window increases to 5 frames or more, this improvement gradually diminishes. - Also discovered that indoor scenes and close-range objects perform better with the help of depth information. Through these efforts, the paper aims to explore the potential application value of depth information in video deblurring and provide a benchmark dataset for subsequent research.