Abstract:Detecting out-of-distribution (OOD) data is crucial for ensuring the safe deployment of machine learning models in real-world applications. However, existing OOD detection approaches primarily rely on the feature maps or the full gradient space information to derive OOD scores neglecting the role of most important parameters of the pre-trained network over in-distribution (ID) data. In this study, we propose a novel approach called GradOrth to facilitate OOD detection based on one intriguing observation that the important features to identify OOD data lie in the lower-rank subspace of in-distribution (ID) data. In particular, we identify OOD data by computing the norm of gradient projection on the subspaces considered important for the in-distribution data. A large orthogonal projection value (i.e. a small projection value) indicates the sample as OOD as it captures a weak correlation of the ID data. This simple yet effective method exhibits outstanding performance, showcasing a notable reduction in the average false positive rate at a 95% true positive rate (FPR95) of up to 8% when compared to the current state-of-the-art methods.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively detect out - of - distribution (OOD) data in practical applications. Out - of - distribution data refers to those data samples outside the training data distribution. Such data poses challenges to the reliability and security of the model, especially in critical application areas such as healthcare, self - driving cars and physical sciences. Modern deep neural networks (DNNs) produce over - confident predictions when dealing with OOD inputs, which makes it complicated to distinguish in - distribution (ID) data from OOD data. Therefore, the main goal of OOD detection is to develop methods that can accurately detect when the model encounters OOD data, so that the model can reject these inputs or provide more informative responses, such as uncertainty indications or confidence measures.
To achieve this goal, the paper proposes a new method - GradOrth, which is based on an interesting observation: the key features for identifying OOD data are located in the low - rank subspace of ID data. Specifically, GradOrth identifies OOD data by calculating the projection norm of the gradient on the subspace considered as an important part of ID data. If the orthogonal projection value of the gradient on these subspaces is large (i.e., the projection value is small), it indicates that the sample has a weak correlation with ID data and is thus classified as OOD. This method is simple and effective, and compared with the current state - of - the - art methods, the average false positive rate (FPR95) at 95% true positive rate (True Positive Rate, TPR) is reduced by up to 8%.
The main contributions of the paper include:
- Proposing GradOrth, a novel and efficient OOD detection method that utilizes the most important parameter space and its gradient in the pre - trained network to complete the task.
- Evaluating the performance of GradOrth on widely used benchmarks and demonstrating competitive results compared with existing post - processing OOD detection baselines. In particular, GradOrth outperforms strong baseline methods in terms of FPR95, with a reduction range from 2.71% to 8.05%.
- Conducting a comprehensive analysis, including ablation experiments and theoretical exploration, to enhance the understanding of the proposed OOD detection method.
Through these studies, the paper not only provides a new perspective and method for OOD detection but also lays the foundation for further research and development.