High-speed parallel wavelet algorithm based on CUDA and its application in three-dimensional surface texture analysis

Wang Jianjun,Lu Wen Long,Liu Xiaojun,Jiang Xiangqian
DOI: https://doi.org/10.1109/ICEICE.2011.5778225
2011-01-01
Abstract:A new efficient parallel wavelet algorithm was presented in order to speed up wavelet transform in three dimensional surface texture analysis. It is based NVIDIA's CUDA (Compute Unified Device Architecture), a new general purpose parallel programming model and instruction set architecture that leverage computational problems on GPU more efficient than CPU. Compared with CPU, GPU has evolved into a highly parallel, multithread, multicore processor with tremendous computational horsepower and very high memory bandwidth. GPU is well-suited to address data-parallel computation problems rather than flow controlled problems. Wavelet transform can use data-parallel programming model so data elements will be mapped to parallel processing threads to speed up the computations. CUDA wavelet decomposition and reconstruction algorithms were realized based on the analysis above. Experiments show that the parallelization of the fast wavelet decomposition transform for GPU speedup 34×-38× over CPU, reconstruction transform speedup 29x-33x over CPU.
What problem does this paper attempt to address?