Abstract:Linear discriminant analysis (LDA) is a widely used technique for data classification. The method offers adequate performance in many classification problems, but it becomes inefficient when the data covariance matrix is ill-conditioned. This often occurs when the feature space's dimensionality is higher than or comparable to the training data size. Regularized LDA (RLDA) methods based on regularized linear estimators of the data covariance matrix have been proposed to cope with such a situation. The performance of RLDA methods is well studied, with optimal regularization schemes already proposed. In this paper, we investigate the capability of a positive semidefinite ridge-type estimator of the inverse covariance matrix that coincides with a nonlinear (NL) covariance matrix estimator. The estimator is derived by reformulating the score function of the optimal classifier utilizing linear estimation methods, which eventually results in the proposed NL-RLDA classifier. We derive asymptotic and consistent estimators of the proposed technique's misclassification rate under the assumptions of a double-asymptotic regime and multivariate Gaussian model for the classes. The consistent estimator, coupled with a one-dimensional grid search, is used to set the value of the regularization parameter required for the proposed NL-RLDA classifier. Performance evaluations based on both synthetic and real data demonstrate the effectiveness of the proposed classifier. The proposed technique outperforms state-of-art methods over multiple datasets. When compared to state-of-the-art methods across various datasets, the proposed technique exhibits superior performance.

What problem does this paper attempt to address?

This paper attempts to solve the problem of performance degradation of Linear Discriminant Analysis (LDA) in high - dimensional data classification. Specifically, when the dimension of the feature space is comparable to or higher than the amount of training data, the data covariance matrix may become ill - conditioned, which will lead to a significant decline in the performance of the LDA method. To meet this challenge, the paper proposes a regularized LDA method (RLDA), which is based on a non - linear covariance matrix estimator to improve classification performance. ### Main Problems 1. **Ill - conditioned Covariance Matrix in High - Dimensional Data** - When the dimension of the feature space is comparable to or higher than the amount of training data, the sample covariance matrix may become ill - conditioned, resulting in a decline in the performance of the LDA method. - An ill - conditioned covariance matrix may lead to numerical instability, thus affecting the accuracy of the classifier. 2. **Limitations of Existing Regularization Methods** - Existing regularization methods are mainly based on linear estimators, and these methods may not be effective enough in some cases. - A new method is needed to deal with the covariance matrix estimation problem in high - dimensional data more effectively. ### Solutions The paper proposes a new regularized LDA method (NL - RLDA), which uses a non - linear covariance matrix estimator. The specific steps are as follows: 1. **Non - linear Covariance Matrix Estimator** - By redefining the score function of the optimal classifier and using the linear estimation method, a non - linear covariance matrix estimator is derived. - This estimator can better handle the ill - conditioned covariance matrix problem in high - dimensional data. 2. **Asymptotic Performance Analysis** - Under the double - asymptotic assumption (that is, the data dimension and the number of samples grow at a fixed ratio), the asymptotic performance of the proposed classifier is derived. - The misclassification rate of the classifier is analyzed by the method of Random Matrix Theory (RMT). 3. **Consistent Misclassification Rate Estimator** - A consistent misclassification rate estimator is derived to set the regularization parameter. - The optimal value of the regularization parameter is found by using the one - dimensional grid search method. 4. **Performance Evaluation** - The performance of the proposed classifier is evaluated through synthetic data and real - world data sets. - The results show that the proposed classifier outperforms existing LDA - based classifiers and other types of classifiers on multiple data sets. ### Main Contributions 1. **Proposing a New Non - linear Covariance Matrix Estimator** - This estimator can deal with the ill - conditioned covariance matrix problem in high - dimensional data more effectively. 2. **Deriving the Asymptotic Performance of the Classifier** - Under the double - asymptotic assumption, the misclassification rate of the classifier is analyzed, providing theoretical support. 3. **Providing a Consistent Misclassification Rate Estimator** - Through the consistent misclassification rate estimator, the regularization parameter can be set effectively, improving the performance of the classifier. 4. **Experimentally Proving the Effectiveness of the Method** - Through experiments on synthetic data and real - world data sets, the superior performance of the proposed classifier on multiple data sets is verified. In conclusion, by proposing a new non - linear covariance matrix estimator, this paper solves the problem of performance degradation of the LDA method in high - dimensional data, providing an effective solution for high - dimensional data classification.

Regularized Linear Discriminant Analysis Using a Nonlinear Covariance Matrix Estimator

Regularized Linear Discriminant Analysis Using a Nonlinear Covariance Matrix Estimator

Regularized Deep Linear Discriminant Analysis

Linear Discriminant Regularized Regression

Two-Stage Regularized Linear Discriminant Analysis for 2-D Data.

Linear Discriminant Analysis Using Rotational Invariant L1 Norm

Regularized Bilinear Discriminant Analysis for Multivariate Time Series Data

Robust Linear Discriminant Analysis Using Ratio Minimization of L1,2-Norms

Efficient and Robust Sparse Linear Discriminant Analysis for Data Classification

Constrained Linear Discrimination Analysis and Face Recognition

Generalized robust linear discriminant analysis for jointly sparse learning

Sr-LDA:Sparse and Reduced-Rank Linear Discriminant Analysis for High Dimensional Matrix

Generalized two-dimensional linear discriminant analysis with regularization

Multimodal Linear Discriminant Analysis Via Structural Sparsity

High-Dimensional Regularized Discriminant Analysis

Network linear discriminant analysis

A Flexible and Efficient Algorithm for Regularized Fisher Discriminant Analysis.

Covariance Discriminative Learning: A Natural and Efficient Approach to Image Set Classification

L2,1-Norm Regularized Robust and Sparse Linear Discriminant Analysis via an Alternating Direction Method of Multipliers

Locality-regularized Linear Regression Discriminant Analysis for Feature Extraction

Kernel Generalized Nonlinear Discriminant Analysis Algorithm For Pattern Recognition