Capped robust nonnegative matrix factorization algorithm
Wenkai? Lu,Liping? Jing,Liu Yang
DOI: https://doi.org/10.13232/j.cnki.jnju.2016.04.016
2016-01-01
Abstract:Nonnegative Matrix Factorization Algorithm(NMF)has been widely applied in various areas,but it is easily influenced by outliers.In order to solve this problem,researchers have proposed Robust Nonnegative Matrix Factorization Algorithm(RNMF),which uses L2,1 norm to make the normal points be approximate as much as possible and reduce the residual of the outliers by using its absolute rather than square.However,RNMF is still sensitive to the proportion of outliers,i.e.,in some datasets,RNMF can handle outliers well,but its performance in other datasets is not satisfactory.Every real dataset has its own structure,that is,it contains a different proportion of outliers.Because of this,RNMF is limited in practical application.In this article,we present a Capped Robust Nonnegative Matrix Factorization Algorithm (CRNMF)by adding a denoising rateεinto the obj ective function of RNMF.To achieve better controlling of the outliers,we use this algorithm to handle the situation which the real dataset outlier ratio is different.The main idea of CRNMF is evaluating the residual for each data point according to the input data and the factors during the iterative procedure,if the residual is larger than the given denoising rateε, we will set the residual as 0,i.e.,the corresponding data point is taken as outlier and not considered in the computing process.By introducingεtruncation,the algorithm reduced the influence of outliers on matrix F and matrix G.This paper gives the description of CRNMF and experiments on real world and synthetic data sets.Experi-mental results show that the proposed algorithm can improve the clustering accuracy,reduce the impact of outliers and then improve the robustness of the algorithm to some extent,compared with the traditional NMF and RNMF.