SCTNet: Single-Branch CNN with Transformer Semantic Information for Real-Time Segmentation

Zhengze Xu,Dongyue Wu,Changqian Yu,Xiangxiang Chu,Nong Sang,Changxin Gao
DOI: https://doi.org/10.1609/aaai.v38i6.28457
2024-01-01
Proceedings of the AAAI Conference on Artificial Intelligence
Abstract:Recent real-time semantic segmentation methods usually adopt an additionalsemantic branch to pursue rich long-range context. However, the additionalbranch incurs undesirable computational overhead and slows inference speed. Toeliminate this dilemma, we propose SCTNet, a single branch CNN with transformersemantic information for real-time segmentation. SCTNet enjoys the richsemantic representations of an inference-free semantic branch while retainingthe high efficiency of lightweight single branch CNN. SCTNet utilizes atransformer as the training-only semantic branch considering its superb abilityto extract long-range context. With the help of the proposed transformer-likeCNN block CFBlock and the semantic information alignment module, SCTNet couldcapture the rich semantic information from the transformer branch in training.During the inference, only the single branch CNN needs to be deployed. Weconduct extensive experiments on Cityscapes, ADE20K, and COCO-Stuff-10K, andthe results show that our method achieves the new state-of-the-art performance.The code and model is available at https://github.com/xzz777/SCTNet
What problem does this paper attempt to address?