Natural generative noise diffusion model imputation

Ari Wibisono,Denny,Petrus Mursanto,Simon See
DOI: https://doi.org/10.1016/j.knosys.2024.112310
IF: 8.139
2024-08-25
Knowledge-Based Systems
Abstract:Imputation is a critical method for enhancing dataset quality, essential for ensuring accurate analysis and insights. This research proposes an advanced imputation algorithm utilizing a Diffusion Model enhanced with Perlin noise generation. We introduce Perlin noise at each step of the diffusion process and incorporate a cosine scheduler to optimize performance. Our approach demonstrates improvements in imputing non-normal data, validated through tests on ten real datasets. Due to the substantial slope of distribution properties produced by perlin noise, noisy data gradually contaminates nonnormally distributed data, making it more like to a Gaussian distribution. The Perlin noise distribution increases the normality of the noisy data provided noise when it enters the deep neural process of diffusion imputation. We assess our proposed approach by simulating the missing data rate using three scenarios: Missing Completely at Random (MCAR), Missing Not At Random (MNAR), and Missing at Random (MAR). Every case is handled similarly, with 20 % to 80 % missing data. Compared to other deep learning imputation methods, our proposed methods and improvements contribute to lowering the RMSE value up to 10 % on non-normal distributed data imputation.
computer science, artificial intelligence
What problem does this paper attempt to address?