Abstract:Modeling and synthesizing real sRGB noise is crucial for various low-level vision tasks, such as building datasets for training image denoising systems. The distribution of real sRGB noise is highly complex and affected by a multitude of factors, making its accurate modeling extremely challenging. Therefore, recent studies have proposed methods that employ data-driven generative models, such as Generative Adversarial Networks (GAN) and Normalizing Flows. These studies achieve more accurate modeling of sRGB noise compared to traditional noise modeling methods. However, there are performance limitations due to the inherent characteristics of each generative model. To address this issue, we propose NM-FlowGAN, a hybrid approach that exploits the strengths of both GAN and Normalizing Flows. We combine pixel-wise noise modeling networks based on Normalizing Flows and spatial correlation modeling networks based on GAN. Specifically, the pixel-wise noise modeling network leverages the high training stability of Normalizing Flows to capture noise characteristics that are affected by a multitude of factors, and the spatial correlation networks efficiently model pixel-to-pixel relationships. In particular, unlike recent methods that rely on paired noisy images, our method synthesizes noise using clean images and factors that affect noise characteristics, such as easily obtainable parameters like camera type and ISO settings, making it applicable to various fields where obtaining noisy-clean image pairs is not feasible. In our experiments, our NM-FlowGAN outperforms other baselines in the sRGB noise synthesis task. Moreover, the denoising neural network trained with synthesized image pairs from our model shows superior performance compared to other baselines. Our code is available at: \url{<a class="link-external link-https" href="https://github.com/YoungJooHan/NM-FlowGAN" rel="external noopener nofollow">this https URL</a>}.

Self-Calibration Flow Guided Denoising Diffusion Model for Human Pose Transfer

Dense Intrinsic Appearance Flow for Human Pose Transfer

FDA-GAN: Flow-based Dual Attention GAN for Human Pose Transfer

Attentional pixel-wise deformation for pose-based human image generation

LSG-GAN: Latent space guided generative adversarial network for person pose transfer

PCFN: Progressive Cross-Modal Fusion Network for Human Pose Transfer

DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model

A Conditional Diffusion Model for 3D Human Pose Estimation

SCRN: Stepwise Change and Refine Network Based Semantic Distribution for Human Pose Transfer

Multi-Pose Virtual Try-On Via Self-Adaptive Feature Filtering

Supervised Video-To-Video Synthesis For Single Human Pose Transfer

Diffusion Based Coarse-to-Fine Network for 3D Human Pose and Shape Estimation from Monocular Video

FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models

NM-FlowGAN: Modeling sRGB Noise without Paired Images using a Hybrid Approach of Normalizing Flows and GAN

Diffusion-Based Pose Refinement and Multi-Hypothesis Generation for 3D Human Pose Estimation

Cross-view Masked Diffusion Transformers for Person Image Synthesis

Human Pose Transfer by Adaptive Hierarchical Deformation

Towards Fine-Grained Human Pose Transfer With Detail Replenishing Network

Consistent Human Image and Video Generation with Spatially Conditioned Diffusion

DSFFNet: Dual-Side Feature Fusion Network for 3D Pose Transfer

PoNA: Pose-Guided Non-Local Attention for Human Pose Transfer