Abstract:We consider the linear discriminant analysis problem in the high-dimensional settings. In this work, we propose PANDA(PivotAl liNear Discriminant Analysis), a tuning-insensitive method in the sense that it requires very little effort to tune the parameters. Moreover, we prove that PANDA achieves the optimal convergence rate in terms of both the estimation error and misclassification rate. Our theoretical results are backed up by thorough numerical studies using both simulated and real datasets. In comparison with the existing methods, we observe that our proposed PANDA yields equal or better performance, and requires substantially less effort in parameter tuning.

What problem does this paper attempt to address?

This paper attempts to solve the problems of parameter estimation and classification error rate in high - dimensional Linear Discriminant Analysis (LDA). Specifically, the paper proposes a new method named PANDA (PivotA l liNear D iscriminant A nalysis), aiming to reduce the effort of parameter tuning and achieve the optimal convergence rate under high - dimensional data. The following are the main concerns of the paper: 1. **Linear Discriminant Analysis in High - Dimensional Data**: - In the case of high - dimensional data (i.e., the number of features \( p \) is much larger than the number of samples \( n \)), the traditional LDA method is difficult to effectively estimate the covariance matrix \( \Sigma \), resulting in performance degradation. - The paper proposes a new method, PANDA, to meet this challenge. 2. **Features of the PANDA Method**: - **Automatic Adaptability**: The PANDA method is insensitive to parameter tuning and can automatically adapt to different data distributions, reducing the workload of manual parameter adjustment. - **Optimal Convergence Rate**: The PANDA method achieves the optimal convergence rate in both estimation error and classification error rate. 3. **Theoretical Guarantees**: - The paper provides theoretical guarantees for the PANDA method and proves its optimal convergence properties in high - dimensional settings. - Specifically, the PANDA method achieves the same optimal convergence rate as existing methods (such as AdaLDA) in both estimation error and classification error rate. 4. **Numerical Experiments**: - The paper verifies the effectiveness of the PANDA method through experiments on simulated data and real - world data. - The experimental results show that the PANDA method performs excellently in terms of estimation error and classification error rate, and has a short calculation time. 5. **Comparison with Other Methods**: - The paper compares the PANDA method with existing high - dimensional LDA methods (such as LPD and AdaLDA). - The results show that the PANDA method performs well in most cases and even outperforms other methods. In conclusion, this paper aims to solve the problems of parameter estimation and classification performance of LDA under high - dimensional data. By proposing the PANDA method, it achieves efficient and accurate classification in high - dimensional data.

Pivotal Estimation of Linear Discriminant Analysis in High Dimensions

Optimal Feature Selection in High-Dimensional Discriminant Analysis

Two‐Stage Estimation for Ultrahigh Dimensional Sparse Quadratic Discriminant Analysis

Quadratic Discriminant Analysis by Projection

Linear Discriminant Analysis with High-dimensional Mixed Variables

Inference for High-Dimensional Linear Expectile Regression with De-Biasing Method

Adaptive Local Linear Discriminant Analysis

Understanding and Evaluating Sparse Linear Discriminant Analysis

A New Reduced-Rank Linear Discriminant Analysis Method and Its Applications

Diagonally-Dominant Principal Component Analysis

Sparse linear discriminant analysis by thresholding for high dimensional data

Penalized Independence Rule for Testing High-Dimensional Hypotheses

Parzen Discriminant Analysis

Least Square Incremental Linear Discriminant Analysis

Complete discriminant analysis

An Efficient Greedy Search Algorithm for High-Dimensional Linear Discriminant Analysis

An Incremental Linear Discriminant Analysis Using Fixed Point Method

SRDA: an Efficient Algorithm for Large-Scale Discriminant Analysis

Private Estimation and Inference in High-Dimensional Regression with FDR Control

Nonparametric Estimation via Partial Derivatives

Training Linear Discriminant Analysis in Linear Time