OTLP: Output Thresholding Using Mixed Integer Linear Programming

Baran Koseoglu,Luca Traverso,Mohammed Topiwalla,Egor Kraev,Zoltan Szopory
2024-05-18
Abstract:Output thresholding is the technique to search for the best threshold to be used during inference for any classifiers that can produce probability estimates on train and testing datasets. It is particularly useful in high imbalance classification problems where the default threshold is not able to refer to imbalance in class distributions and fail to give the best performance. This paper proposes OTLP, a thresholding framework using mixed integer linear programming which is model agnostic, can support different objective functions and different set of constraints for a diverse set of problems including both balanced and imbalanced classification problems. It is particularly useful in real world applications where the theoretical thresholding techniques are not able to address to product related requirements and complexity of the applications which utilize machine learning models. Through the use of Credit Card Fraud Detection Dataset, we evaluate the usefulness of the framework.
Machine Learning
What problem does this paper attempt to address?
This paper proposes a method named OTLP (Output Thresholding using Mixed Integer Linear Programming) to solve the problem of optimal threshold selection for classifiers during prediction, especially for highly imbalanced classification tasks. In imbalanced datasets, the default threshold may fail to handle the skewness of class distribution, resulting in poor performance. OTLP achieves a model-independent solution using the mixed integer linear programming (MILP) framework, which can adapt to different objective functions and constraints to address various problems, including balanced and imbalanced classification tasks. The paper introduces the working principle of OTLP, which adjusts the decision threshold on the training and validation sets to find the optimal threshold for assigning class labels to the testing data based on model probability estimation. The effectiveness of this framework is demonstrated through evaluation on a credit card fraud detection dataset. The paper also discusses related work, such as other threshold optimization methods, and points out their limitations, such as lack of support for complex constraints or model-specific requirements. The advantages of OTLP lie in its flexibility to handle different types of constraints, support for custom objective functions, and no restriction on model type. The experiments showcase the performance of OTLP under different classifiers, dataset class ratios, objective functions, and constraint settings, proving its ability to find optimal thresholds superior to the default value and its applicability to various types of classification problems. In summary, this paper addresses the optimization of classifier output thresholds, proposes a generic approach applicable to various scenarios, and validates its practicality and effectiveness through experiments.