Content-aware Facial Image Compression with Deep Learning Method

Shuzhan Hu,Yiping Duan,Xiaoming Tao,Yongjia Liu,Xuming Zhang,Jianhua Lu
DOI: https://doi.org/10.1109/wcsp49889.2020.9299680
2020-01-01
Abstract:Recently, online office work and education through video conferences have become a typical part of people's daily life, increasing the need for bandwidth and storage of wireless communication networks. To avoid video stalling and blurring, this paper proposes a content-aware facial image compression method based on deep learning networks at low bit rates for higher quality of experience (QoE). In this paper, the average face calculated from the database collected by the server is incorporated into the face representation model, which is used as prior knowledge to improve the performance of the video communication system. With the face representation model, facial images are decomposed into shape components and texture components, and different compression methods are applied to these components according to different redundancy levels. Specifically, a convolutional neural network (CNN) method is proposed to process texture components. The texture components are optimized by end-to-end rate-distortion learning with mean squared error (MSE) and structural similarity (SSIM) as optimization objectives for high fidelity reconstruction. The shape components use quantizer and entropy coding to preserve the details. Compared with traditional compression methods JPEG and JPEG2000, our proposed method could achieve a better performance in terms of PSNR and SSIM values at low bit rates.
What problem does this paper attempt to address?