Blind microscopy image denoising with a deep residual and multiscale encoder/decoder network

Fabio Hernán Gil Zuluaga,Francesco Bardozzo,Jorge Iván Ríos Patiño,Roberto Tagliaferri
DOI: https://doi.org/10.48550/arXiv.2105.00273
2021-05-01
Abstract:In computer-aided diagnosis (CAD) focused on microscopy, denoising improves the quality of image analysis. In general, the accuracy of this process may depend both on the experience of the microscopist and on the equipment sensitivity and specificity. A medical image could be corrupted by both intrinsic noise, due to the device limitations, and, by extrinsic signal perturbations during image acquisition. Nowadays, CAD deep learning applications pre-process images with image denoising models to reinforce learning and prediction. In this work, an innovative and lightweight deep multiscale convolutional encoder-decoder neural network is proposed. Specifically, the encoder uses deterministic mapping to map features into a hidden representation. Then, the latent representation is rebuilt to generate the reconstructed denoised image. Residual learning strategies are used to improve and accelerate the training process using skip connections in bridging across convolutional and deconvolutional layers. The proposed model reaches on average 38.38 of PSNR and 0.98 of SSIM on a test set of 57458 images overcoming state-of-the-art models in the same application domain
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively remove noise in microscopic images in computer - aided diagnosis (CAD) to improve the quality of image analysis. Specifically, the paper proposes an innovative and lightweight deep multi - scale convolutional encoder - decoder neural network for blind microscopic image denoising. This method aims to overcome the limitations of traditional denoising methods, especially in dealing with different noise levels and unknown noise characteristics. By introducing the residual learning strategy and skip connections across convolutional and deconvolutional layers, the model can accelerate the training process and improve the denoising effect. Experimental results show that the average peak signal - to - noise ratio (PSNR) of this model on the test set reaches 38.38, and the structural similarity index (SSIM) reaches 0.98, exceeding the existing state - of - the - art models in this application field. In terms of formulas, the objective of the model mentioned in the paper can be formally expressed as \(m(x | x^*; \theta)\), where \(x\) is the clean image, \(x^*\) is the noisy image, and \(\theta\) is the parameter to be learned. The loss function of the model adopts the mean absolute error (MAE), that is: \[ \text{Loss}(x, z) = \frac{1}{N} \sum_{i = 1}^{N} |x_i - z_i| \] Here, \(N\) represents the total number of pixels in the image, and \(x_i\) and \(z_i\) respectively represent the \(i\)-th pixel values of the original image and the reconstructed image. By minimizing this loss function, the model can learn the mapping from the noisy image to its denoised representation.