CEDNet: A cascade encoder–decoder network for dense prediction

Gang Zhang,Ziyi Li,Chufeng Tang,Jianmin Li,Xiaolin Hu
DOI: https://doi.org/10.1016/j.patcog.2024.111072
IF: 8
2024-11-01
Pattern Recognition
Abstract:The prevailing methods for dense prediction tasks typically utilize a heavy classification backbone to extract multi-scale features and then fuse these features using a lightweight module. However, these methods allocate most computational resources to the classification backbone, which delays the multi-scale feature fusion and potentially leads to inadequate feature fusion. Although some methods perform feature fusion from early stages, they either fail to fully leverage high-level features to guide low-level feature learning or have complex structures, resulting in sub-optimal performance. We propose a streamlined cascade encoder–decoder network, named CEDNet, tailored for dense prediction tasks. All stages in CEDNet share the same encoder–decoder structure and perform multi-scale feature fusion within each decoder, thereby enhancing the effectiveness of multi-scale feature fusion. We explored three well-known encoder–decoder structures: Hourglass, UNet, and FPN, all of which yielded promising results. Experiments on various dense prediction tasks demonstrated the effectiveness of our method. 1
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?