Deep Network for Image Compressed Sensing Coding Using Local Structural Sampling

Wenxue Cui,Xingtao Wang,Xiaopeng Fan,Shaohui Liu,Xinwei Gao,Debin Zhao
2024-02-29
Abstract:Existing image compressed sensing (CS) coding frameworks usually solve an inverse problem based on measurement coding and optimization-based image reconstruction, which still exist the following two challenges: 1) The widely used random sampling matrix, such as the Gaussian Random Matrix (GRM), usually leads to low measurement coding efficiency. 2) The optimization-based reconstruction methods generally maintain a much higher computational complexity. In this paper, we propose a new CNN based image CS coding framework using local structural sampling (dubbed CSCNet) that includes three functional modules: local structural sampling, measurement coding and Laplacian pyramid reconstruction. In the proposed framework, instead of GRM, a new local structural sampling matrix is first developed, which is able to enhance the correlation between the measurements through a local perceptual sampling strategy. Besides, the designed local structural sampling matrix can be jointly optimized with the other functional modules during training process. After sampling, the measurements with high correlations are produced, which are then coded into final bitstreams by the third-party image codec. At last, a Laplacian pyramid reconstruction network is proposed to efficiently recover the target image from the measurement domain to the image domain. Extensive experimental results demonstrate that the proposed scheme outperforms the existing state-of-the-art CS coding methods, while maintaining fast computational speed.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to address two major challenges in image Compressed Sensing (CS) coding: 1. **Low measurement coding efficiency**: Existing CS coding frameworks usually use random sampling matrices, such as the Gaussian Random Matrix (GRM), which leads to low measurement coding efficiency. 2. **High computational complexity**: Optimization - based reconstruction methods usually have high computational complexity, affecting the execution efficiency and practical applications. To meet these challenges, the authors propose a new image CS coding framework based on Convolutional Neural Network (CNN), named CSCNet (Compressed Sensing Coding Network). This framework consists of three functional modules: 1. **Local - structure sampling**: A new local - structure sampling matrix is designed. The correlation between measurements is enhanced through a local - perception sampling strategy. This sampling matrix can be jointly optimized with other functional modules during the training process. 2. **Measurement coding**: The generated highly - correlated measurement values are encoded into the final bitstream by a third - party image codec. 3. **Laplacian pyramid reconstruction**: A convolutional Laplacian pyramid network is proposed to efficiently restore the target image from the measurement domain to the image domain. ### Specific contributions 1. **New CNN - based image CS coding framework**: This framework includes three functional modules: local - structure sampling, measurement coding, and Laplacian pyramid reconstruction. 2. **Learnable local - structure sampling matrix**: A highly - sparse local - structure sampling matrix is designed. It is not only easy to implement in hardware but also can generate highly - correlated measurement values, improving the measurement coding efficiency. 3. **Convolutional Laplacian pyramid network**: A convolutional Laplacian pyramid network is developed to gradually reconstruct the target image from the measurement domain to the image domain. ### Experimental results The experimental results show that the proposed CSCNet outperforms the existing state - of - the - art CS coding methods while maintaining a fast computational speed. Specifically, in terms of the rate - distortion performance (PSNR) at different sampling rates, CSCNet shows significant advantages. ### Mathematical formulas - **CS sampling process**: \[ y=\Phi x \] where \( y\in\mathbb{R}^M \) is the linear measurement, \( \Phi\in\mathbb{R}^{M\times N} \) is the sampling matrix, and \( x\in\mathbb{R}^N \) is the original signal. - **Optimization problem**: \[ \hat{x}=\arg\min_x\frac{1}{2}\|\Phi x - y\|^2_2+\lambda\|\Psi x\|_p \] where \( \Psi x \) represents the sparse coefficients in the transform domain, \( \lambda \) is the regularization parameter that controls the weight of the sparse term. - **Local - structure sampling**: \[ \tilde{W}_l=\Upsilon\circ W^*_l \] \[ W_l(k)=S_n(F_p(\tilde{W}_l(k))) \] where \( \circ \) represents element - wise multiplication, \( F_p \) is the positive mapping function, and \( S_n \) is the normalization operator. - **Measurement generation**: \[ y = W_l * x \] where \( * \) represents the convolution operation and \( W_l \) is the learnable filter weight in the sampling network. Through these innovations, CSCNet achieves higher efficiency and better performance in image Compressed Sensing coding.