SIERRA: A robust bilateral feature upsampler for dense prediction

Hongtao Fu,Wenze Liu,Yuliang Liu,Zhiguo Cao,Hao Lu
DOI: https://doi.org/10.1016/j.cviu.2023.103762
IF: 4.886
2023-06-29
Computer Vision and Image Understanding
Abstract:Feature upsampling is a fundamental operation in modern deep network architectures. Existing upsamplers, however, are prone to cause negative upsampling—performance of an upsampler falls behind naive interpolation. For instance, the recent dynamic upsampler CARAFE is the best performing operator in semantic segmentation, but it turns out to be the worst one in image matting. In this work, we present robuSt bIlatERal featuRe upsAmpler (SIERRA), a simple, task-robust, plug-and-play, and ultra lightweight upsampler. Its key idea is to use an efficient gradient-prior kernel to modulate a (shifted) distance-prior kernel to control which feature points participate in interpolation, which shares a similar spirit to joint bilateral filtering (JBF). Yet, in contrast to JBF that requires high-res guidance, SIERRA generates kernels from the low-res decoder feature alone. Extensive experiments demonstrate the superiority and robustness of SIERRA on five dense prediction tasks. Code will be available online.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?