Abstract:Semantic segmentation is a fundamental problem in multimedia which requires delicate per-pixel predictions of object categories. Recently, many researchers strive to refine the pixel-wise feature with spatial -contextual information. However, many of them still neglect the invisible hand of cross- channel information which provides inherent semantics to facilitate the segmentation performance. On the one hand, in the feature extraction stage, enhancing informative channels and suppressing trivial ones contribute to the acquisition of valuable semantic features, and thus improving the segmentation accuracy. On the other hand, in the prediction stage, we can predict the complete objects more clearly by finding the connections and complements between different channels, which can also contribute to the pixel prediction. And based on this idea, we propose a novel Channel-Adaptive Network for semantic segmentation, which is capable of enhancing the features from the perspective of channels in both feature extraction stage and prediction stage. Specifically, we propose two modules: (i) the Comprehensive Information Channel Attention (CiCA) module that addresses the shortcomings of existing channel attention by learning both low and high frequency components within each channel for emphasizing the informative channels; (ii) the Inter-Channel Relationship Reasoning (iCRR) module which is applied on the top of the feature extractor to adaptively enhance the interdependent channels by mining the complementary associations between them. Besides, our Channel-Adaptive Network is highly flexible, with a plug-and-play design. Extensive experiments have demonstrated that our method achieves the state-of-the-art segmentation performance on three challenging datasets, including Cityscapes (82.1%), ADE20K (46.51%) and PASCAL Context (55.0%).

Unsupervised Representation for Semantic Segmentation by Implicit Cycle-Attention Contrastive Learning.

Point Contrastive Prediction with Semantic Clustering for Self-Supervised Learning on Point Cloud Videos

Region-aware Contrastive Learning for Semantic Segmentation

Saliency Guided Contrastive Learning on Scene Images

Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Spatial Structure Constraints for Weakly Supervised Semantic Segmentation

CFCG: Semi-Supervised Semantic Segmentation Via Cross-Fusion and Contour Guidance Supervision

Cross-Image Pixel Contrasting for Semantic Segmentation

Adversarial Dense Contrastive Learning for Semi-Supervised Semantic Segmentation

Self-Supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

Progressive Learning With Cross-Window Consistency for Semi-Supervised Semantic Segmentation

Pairwise-Pixel Self-Supervised and Superpixel-Guided Prototype Contrastive Loss for Weakly Supervised Semantic Segmentation

SMC-NCA: Semantic-guided Multi-level Contrast for Semi-supervised Temporal Action Segmentation

Contrast, Stylize and Adapt: Unsupervised Contrastive Learning Framework for Domain Adaptive Semantic Segmentation

Spatial and Semantic Consistency Contrastive Learning for Self-Supervised Semantic Segmentation of Remote Sensing Images

Bottom-Up Top-Down Cues for Weakly-Supervised Semantic Segmentation

Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation

Learning Cross-Channel Representations for Semantic Segmentation