Abstract:Detecting out-of-distribution (OOD) data is crucial for ensuring the safe deployment of machine learning models in real-world applications. However, existing OOD detection approaches primarily rely on the feature maps or the full gradient space information to derive OOD scores neglecting the role of most important parameters of the pre-trained network over in-distribution (ID) data. In this study, we propose a novel approach called GradOrth to facilitate OOD detection based on one intriguing observation that the important features to identify OOD data lie in the lower-rank subspace of in-distribution (ID) data. In particular, we identify OOD data by computing the norm of gradient projection on the subspaces considered important for the in-distribution data. A large orthogonal projection value (i.e. a small projection value) indicates the sample as OOD as it captures a weak correlation of the ID data. This simple yet effective method exhibits outstanding performance, showcasing a notable reduction in the average false positive rate at a 95% true positive rate (FPR95) of up to 8% when compared to the current state-of-the-art methods.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to effectively detect out - of - distribution (OOD) data in practical applications. Out - of - distribution data refers to those data samples outside the training data distribution. Such data poses challenges to the reliability and security of the model, especially in critical application areas such as healthcare, self - driving cars and physical sciences. Modern deep neural networks (DNNs) produce over - confident predictions when dealing with OOD inputs, which makes it complicated to distinguish in - distribution (ID) data from OOD data. Therefore, the main goal of OOD detection is to develop methods that can accurately detect when the model encounters OOD data, so that the model can reject these inputs or provide more informative responses, such as uncertainty indications or confidence measures. To achieve this goal, the paper proposes a new method - GradOrth, which is based on an interesting observation: the key features for identifying OOD data are located in the low - rank subspace of ID data. Specifically, GradOrth identifies OOD data by calculating the projection norm of the gradient on the subspace considered as an important part of ID data. If the orthogonal projection value of the gradient on these subspaces is large (i.e., the projection value is small), it indicates that the sample has a weak correlation with ID data and is thus classified as OOD. This method is simple and effective, and compared with the current state - of - the - art methods, the average false positive rate (FPR95) at 95% true positive rate (True Positive Rate, TPR) is reduced by up to 8%. The main contributions of the paper include: - Proposing GradOrth, a novel and efficient OOD detection method that utilizes the most important parameter space and its gradient in the pre - trained network to complete the task. - Evaluating the performance of GradOrth on widely used benchmarks and demonstrating competitive results compared with existing post - processing OOD detection baselines. In particular, GradOrth outperforms strong baseline methods in terms of FPR95, with a reduction range from 2.71% to 8.05%. - Conducting a comprehensive analysis, including ablation experiments and theoretical exploration, to enhance the understanding of the proposed OOD detection method. Through these studies, the paper not only provides a new perspective and method for OOD detection but also lays the foundation for further research and development.

GradOrth: A Simple yet Efficient Out-of-Distribution Detection with Orthogonal Projection of Gradients

Low-Dimensional Gradient Helps Out-of-Distribution Detection

RG: OUT-OF-DISTRIBUTION DETECTION WITH REACTIVATE GRADNORM

Out-of-Distribution Detection with Deep Nearest Neighbors

Advancing Out-of-Distribution Detection through Data Purification and Dynamic Activation Function Design

Out-of-distribution detection based on subspace projection of high-dimensional features output by the last convolutional layer

Look Around and Find Out: OOD Detection with Relative Angles

Density-driven Regularization for Out-of-distribution Detection

Detecting Out-of-Distribution Through the Lens of Neural Collapse

Calibrated Out-of-Distribution Detection with a Generic Representation

Out-of-Distribution Detection with Overlap Index

A Simple Test-Time Method for Out-of-Distribution Detection

Logit Scaling for Out-of-Distribution Detection

Boosting Out-of-distribution Detection with Typical Features

Going Beyond Conventional OOD Detection

Detecting Out-of-Distribution Examples with In-distribution Examples and Gram Matrices

Rethinking Out-of-Distribution Detection From a Human-Centric Perspective

Exploring Simple, High Quality Out-of-Distribution Detection with L2 Normalization

Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution

Towards In-Distribution Compatible Out-of-Distribution Detection.

Exploiting Diffusion Prior for Out-of-Distribution Detection