Enhancing Information Discriminant Analysis: Feature Extraction with Linear Statistical Model and Information-Theoretic Criteria.

Liling Li,Lan Du,Wei Zhang,Hua He,Penghui Wang
DOI: https://doi.org/10.1016/j.patcog.2016.06.004
IF: 8
2016-01-01
Pattern Recognition
Abstract:In this paper, we develop a novel feature transformation method for supervised linear dimensionality reduction. Existing methods, e.g., Information Discriminant Analysis (IDA), estimate the first and second order statistics of the data in the original high-dimensional space, and then design the transformation matrix based on the information-theoretic criteria. Unfortunately, such transformation methods are sensitive to the accuracy of the statistics estimation. To overcome this disadvantage, our method describes the statistical structure of the transformed low-dimensional subspace via a linear statistical model, which can reduce the number of unknown parameters, while simultaneously maximizes the mutual information (MI) between the transformed data and their class labels, which can ensure the between-class separability according to the information theory. The key idea is that we seek the optimal model parameters, including the transformation matrix, via the joint optimization of MI function and log-likelihood function, therefore, this method can not only reduce the estimation errors but also maximize the between-class separability. Experimental results based on synthetic dataset and benchmark datasets demonstrate the better performance of our method over other related methods.
What problem does this paper attempt to address?