Abstract:The amount of multimedia data, such as images and videos, has been increasing rapidly with the development of various imaging devices and the Internet, bringing more stress and challenges to information storage and transmission. The redundancy in images can be reduced to decrease data size via lossy compression, such as the most widely used standard Joint Photographic Experts Group (JPEG). However, the decompressed images generally suffer from various artifacts (e.g., blocking, banding, ringing, and blurring) due to the loss of information, especially at high compression ratios. This article presents a feature-enriched deep convolutional neural network for compression artifacts reduction (FeCarNet, for short). Taking the dense network as the backbone, FeCarNet enriches features to gain valuable information via introducing multi-scale dilated convolutions, along with the efficient 1 ×1 convolution for lowering both parameter complexity and computation cost. Meanwhile, to make full use of different levels of features in FeCarNet, a fusion block that consists of attention-based channel recalibration and dimension reduction is developed for local and global feature fusion. Furthermore, short and long residual connections both in the feature and pixel domains are combined to build a multi-level residual structure, thereby benefiting the network training and performance. In addition, aiming at reducing computation complexity further, pixel-shuffle-based image downsampling and upsampling layers are, respectively, arranged at the head and tail of the FeCarNet, which also enlarges the receptive field of the whole network. Experimental results show the superiority of FeCarNet over state-of-the-art compression artifacts reduction approaches in terms of both restoration capacity and model complexity. The applications of FeCarNet on several computer vision tasks, including image deblurring, edge detection, image segmentation, and object detection, demonstrate the effectiveness of FeCarNet further.

CARAFE: Content-Aware ReAssembly of FEatures

CARAFE++: Unified Content-Aware ReAssembly of FEatures

Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble Kernels

MFF-Net: Towards Efficient Monocular Depth Completion With Multi-Modal Feature Fusion

Retinal Vessel Segmentation Via Cross-attention Feature Fusion

ASFD: Automatic and Scalable Face Detector

Learning to Joint Remosaic and Denoise in Quad Bayer CFA via Universal Multi-scale Channel Attention Network.

M2MRF: Many-to-Many Reassembly of Features for Tiny Lesion Segmentation in Fundus Images

FFPA-Net: Efficient Feature Fusion with Projection Awareness for 3D Object Detection

Dynamic feature distillation and pyramid split large kernel attention network for lightweight image super-resolution

SIERRA: A robust bilateral feature upsampler for dense prediction

LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention

FeatUp: A Model-Agnostic Framework for Features at Any Resolution

Overfitting the Data: Compact Neural Video Delivery Via Content-aware Feature Modulation

AutoFocusFormer: Image Segmentation off the Grid

AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion

AFFNet: Attention Mechanism Network Based on Fusion Feature for Image Cloud Removal

Content-Augmented Feature Pyramid Network with Light Linear Spatial Transformers for Object Detection

Comprehensive Feature Enhancement Module For Single-Shot Object Detector

Pyramid Feature Attention Network for Monocular Depth Prediction

A Feature-Enriched Deep Convolutional Neural Network for JPEG Image Compression Artifacts Reduction and its Applications