DAVIDE: Depth-Aware Video Deblurring

German F. Torres,Jussi Kalliola,Soumya Tripathy,Erman Acar,Joni-Kristian Kämäräinen

2024-09-02

Abstract:Video deblurring aims at recovering sharp details from a sequence of blurry frames. Despite the proliferation of depth sensors in mobile phones and the potential of depth information to guide deblurring, depth-aware deblurring has received only limited attention. In this work, we introduce the 'Depth-Aware VIdeo DEblurring' (DAVIDE) dataset to study the impact of depth information in video deblurring. The dataset comprises synchronized blurred, sharp, and depth videos. We investigate how the depth information should be injected into the existing deep RGB video deblurring models, and propose a strong baseline for depth-aware video deblurring. Our findings reveal the significance of depth information in video deblurring and provide insights into the use cases where depth cues are beneficial. In addition, our results demonstrate that while the depth improves deblurring performance, this effect diminishes when models are provided with a longer temporal context. Project page: <a class="link-external link-https" href="https://germanftv.github.io/DAVIDE.github.io/" rel="external noopener nofollow">this https URL</a> .

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? The main goal of this paper is to investigate the role of depth information in video deblurring and to propose a Depth-Aware Video Deblurring Dataset (DAVIDE). Specifically: 1. **Dataset Contribution**: - Introduced the first large-scale video deblurring dataset (DAVIDE) that includes real depth information, comprising synchronized blurry frames, clear frames, and depth map videos. - The dataset was captured using an iPhone 13 Pro, utilizing the LiDAR sensor to obtain depth information. 2. **Method Innovation**: - Proposed a depth-aware video deblurring network based on the Shift-Net architecture, using Grouped Spatial Shift (GSS) blocks and Depth-aware Transformer (DaT) blocks to better integrate depth information. - Investigated how to inject depth information into existing depth-RGB video deblurring models. 3. **Experimental Analysis**: - Conducted multiple experiments to verify the impact of depth information on video deblurring performance. - Found that depth information significantly improves deblurring performance when the input sequence's time window is short (e.g., T=1 or T=3); however, as the time window increases to 5 frames or more, this improvement gradually diminishes. - Also discovered that indoor scenes and close-range objects perform better with the help of depth information. Through these efforts, the paper aims to explore the potential application value of depth information in video deblurring and provide a benchmark dataset for subsequent research.

DAVIDE: Depth-Aware Video Deblurring

DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-Resolution

Depth-Aware Unpaired Video Dehazing

Domain-adaptive Video Deblurring via Test-time Blurring

Efficiently Exploiting Spatially Variant Knowledge for Video Deblurring

Learning an Occlusion-Aware Network for Video Deblurring

Fast Ultra High-Definition Video Deblurring via Multi-scale Separable Network

Depth and DOF Cues Make A Better Defocus Blur Detector

DAVANet: Stereo Deblurring with View Aggregation

VDPI: Video Deblurring with Pseudo-inverse Modeling

Camera-Independent Single Image Depth Estimation from Defocus Blur

Defocus Deblurring Using Dual-Pixel Data

Towards Real-World Video Deblurring by Exploring Blur Formation Process

Depth Any Video with Scalable Synthetic Data

Learning Blind Motion Deblurring

CMTA: Cross-Modal Temporal Alignment for Event-guided Video Deblurring

Deep Lidar-guided Image Deblurring

Human-Aware Motion Deblurring

Depth Error Elimination for RGB-D Cameras

Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model

Learning Event-Based Motion Deblurring