<Emphasis Type="Italic">CodedVision</Emphasis>: Towards Joint Image Understanding and Compression via End-to-End Learning

Qiu Shen,Juanjuan Cai,Linfeng Liu,Haojie Liu,Tong Chen,Long Ye,Zhan Ma
DOI: https://doi.org/10.1007/978-3-030-00776-8_1
2018-01-01
Abstract:We present a CodedVision framework to achieve image content understanding and compression jointly, leveraging the recent advances in deep neural networks. We have introduced an eight-layer deep residual network to extract image features for compression and understanding. For compression, a scalar quantizer and an entropy coder are utilized to remove redundancy. Rate-distortion optimization is integrated to improve the coding efficiency where rate is estimated via a piecewise linear approximation. A noticeable 7.8% BD-Rate (Bjontegaard delta rate) gain is presented against the state-of-the-art HEVC intra based image compression. For content understanding, we patch another residual network-based classifier to perform the classification, with reasonable accuracy at the current stage.
What problem does this paper attempt to address?