A Covariance-Free Iterative Principal Component Analysis for High Dimensional and Large Scale Data

LI Chen,GUO Yue-fei
2013-01-01
Abstract:Principal component analysis is a well-established technique for dimension reduction.The Principal vectors are the eigenvectors of the covariance matrix corresponding to the maximum eigenvalues.The order of the covariance matrix equals to the dimension of the data.The principal vectors are calculated using a substitution matrix,whose dimension equals to the number of samples,when the dimension of the sample is very high.However,the principal vector is hard to calculate when both the dimension and the number of the samples(called high dimension and large scale) are very large.A covariance-free iterative principal component analysis(CIPCA) algorithm is presented for high dimensional and large scale data.It is proved that the presented algorithm monotonously converges to the exact principal vector at the exponential rate.The performance of CIPCA on the high dimension and large scale data,i.e.image data set,is demonstrated.The experiment result shows that the CIPCA converges very fast.
What problem does this paper attempt to address?