Covariance free incremental principal component analysis with exact mean update

Xueqiang Zeng,Guozheng Li
DOI: https://doi.org/10.12733/jcisP1170
2013-01-01
Journal of Computational Information Systems
Abstract:Incremental feature extraction is an essential data preprocessing technique for large-scale and streaming data mining. Among various covariance matrix-free Incremental Principal Component Analysis (IPCA) methods, Candid Covariance-free Incremental Principal Component Analysis (CCIPCA) is a state-of-the art algorithm. Since the training samples are required to be centred, CCIPCA applies an approximate centric alignment on the input data, where only the current sample is correctly centred and all historical data are not updated properly. In this paper, we propose a novel centred incremental principal component analysis algorithm with exact historical mean update, where not only the current sample is centred, but also all historical data are updated by the current mean correctly. Compared to CCIPCA, the proposed method converges more quickly, and the performance improvement is especially obvious when the data's inherent covariance is not stable. The experiments on real streaming dataset show that the proposed method is much superior to CCIPCA in convergence speed. © 2013 Binary Information Press.
What problem does this paper attempt to address?