Abstract:Image coding is one of the most fundamental techniques and is widely used in image/video processing and multimedia communications. Current image coding methods are mainly human-oriented, and the visual quality is always unsatisfactory, especially at low bitrates. Moreover, the recent emergence of machine vision goes beyond the scope of current coding. With these considerations, we proposed a sketch assisted face image coding for human and machine vision by a joint training approach. In the proposed approach, we design a new feature representation: a color sketch, which aims to satisfy both low-frequency features of human vision and high-frequency features of machine analysis. Then, we present a novel end-to-end image codec framework with joint training that consists of three models: an image-to-image translation module, a coding module, and a two-stage reconstruction module. Specifically, the input image is first translated into the edge map with the Canny edge as the auxiliary label to merely preserve the structure information. Afterward, the backpropagation from reconstruction module guides the edge map to increase or decrease the information through joint training, which results in the generation of color sketch. Then, the generated sketch is compressed into the bitstream and decompressed back to a sketch in the coding module. Finally, the decompressed sketch is reconstructed to support the machine and human tasks, respectively. In this way, the color sketch is designed to bridge the gap between human and machine vision, and the joint training strategy helps to adjust the low-frequency information in the sketch. The experimental results on challenge datasets demonstrate that our proposed algorithm offers 40.9%-86.6% bitrate savings on machine vision and is comparable to state-of-the-art image coding methods on human vision.

Automatic Face Segmentation Using Color Cues for Coding Typical Videophone Scenes

Video Conference System for Enhancing Quality of Target Region under Low Bit Rate

Real-time Detection and Tracking Algorithm Based on the Color and Character of Human Face

An algorithm for automatic segmentation of video objects based on intra-frame image partition

Multi-objects Real Time Recognition Based on Color Information

Video stream interception and erotic content filtering

Face Detection Based on Vector Quantization in Color Images

Color Correction Method Based on Histogram Segmentation for Multiview Video

Towards Coding for Human and Machine Vision: Scalable Face Image Coding

Contour Based Automatic Scene Segmentation in Image Sequences

Confidence-Based Color Modeling for Online Video Segmentation

The Study On The Layered Coding System For Very Low Bit Rate Videophone

Multi-cue-based Face and Facial Feature Detection on Video Segments

2D/3D Model-Based Facial Video Coding/Decoding at Ultra-Low Bit-Rate.

Low Complexity Depth Coding Assisted by Coding Information From Color Video

Object segmentation using stereo images

Hybrid model-and-object-based real-time conversational video coding

&Lt;title>layered Coding System for Very Low Bitrate Videophone</title>

Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach

Automatic Segmentation of Moving Objects in Video Sequences Using Multifeature

Improving Video Segmentation by Fusing Depth Cues and the Visual Background Extractor (ViBe) Algorithm