Recurrent Cross-Modality Fusion for Time-of-Flight Depth Denoising

Guanting Dong,Yueyi Zhang,Xiaoyan Sun,Zhiwei Xiong
DOI: https://doi.org/10.1109/tci.2024.3496312
IF: 5.4
2024-11-29
IEEE Transactions on Computational Imaging
Abstract:The widespread use of Time-of-Flight (ToF) depth cameras in academia and industry is limited by noise, such as Multi-Path-Interference (MPI) and shot noise, which hampers their ability to produce high-quality depth images. Learning-based ToF denoising methods currently in existence often face challenges in delivering satisfactory performance in complex scenes. This is primarily attributed to the impact of multiple reflected signals on the formation of MPI, rendering it challenging to predict MPI directly through spatially-varying convolutions. To address this limitation, we adopt a recurrent architecture that exploits the prior that MPI is decomposable into an additive combination of the geometric information for the neighboring pixels. Our approach employs a Gated Recurrent Unit (GRU) based network to estimate a long-distance aggregation process, simplifying the MPI removal and updating depth correction over multiple steps. Additionally, we introduce a global restoration module and a local update module to fuse depth and amplitude features, which improves denoising performance and prevents structural distortions. Experimental results on both synthetic and real-world datasets demonstrate the superiority of our approach over state-of-the-art methods.
engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?