CUDA parallel realization of simplex growing algorithm for endmember extraction

Jia-lin ZOU,Liao-ying ZHAO,Xiao-run LI,Xiao-fen CHEN
DOI: https://doi.org/10.16208/j.issn1000-7024.2017.11.051
2017-01-01
Abstract:To solve the problem that the time complexity of the simplex growing algorithm (FNSGA) is high due to the fact that the FNSGA needs to traverse all pixels many times,the CUDA parallel realization of the FNSGA was studied on account of the high parallelism characteristic of the FNSGA.The flow chart for the parallel computing of the FNSGA was designed and the parallel realization for the key stages was discussed.Three strategies for the code implementation,namely,code conversion design directly from MATLAB code,loop unrolling and using CUBLAS library,were put forward.The influence of the target number on the computing performance of the CUDA-based FNSGA algorithm was analyzed Two real hyperspectral images were used in the experiment.The results verify that the proposed optimization efforts on the CUDA-based FNSGA algorithm are obvious.Especially when using CUBLAS,the effect can be achieved at a speedup of 100 times with respect to one CPU core.
What problem does this paper attempt to address?