Joint super-resolution-based fast face image coding for human and machine vision

Wuzhen Shi,Fei Tao,Yang Wen
DOI: https://doi.org/10.1007/s00371-024-03428-w
IF: 2.835
2024-05-21
The Visual Computer
Abstract:As the Internet of Things continues to grow and thrive, more and more data are consumed by machines for intelligent analysis. How to simultaneously support fast machine vision analysis and obtain high-quality reconstructed images to serve human vision has become a problem that needs to be solved. We propose a fast face image compression scheme based on a Joint Super-Resolution Network (JSRNet), where images are encoded hierarchically to support machine vision and human vision. To support fast machine vision tasks, we compress the downsampled version of the input image at the encoding side and implement base layer decoding at the decoding side with a lightweight super-resolution network, thus effectively reducing the codec time. To obtain high-quality reconstructed images, we design a content transfer module to transfer the visual information generated by the base layer to the enhancement layer. The joint optimization of the base layer and the enhancement layer also allows them to better cooperate to achieve better performance. We verify the performance of this scheme on the facial landmark detection task, and the experimental results show that our method saves more bit-rate and achieves more accurate facial landmark detection results with faster codec time than baselines. The reconstructed images are more in line with human visual characteristics, and the peak signal-to-noise ratio (PSNR) and structural similarity (SSIM) are also improved significantly.
computer science, software engineering
What problem does this paper attempt to address?