PolieDRO: a novel classification and regression framework with non-parametric data-driven regularization

Tomás Gutierrez,Davi Valladão,Bernardo K. Pagnoncelli
DOI: https://doi.org/10.1007/s10994-024-06544-9
IF: 5.414
2024-04-16
Machine Learning
Abstract:PolieDRO is a novel analytics framework for classification and regression that harnesses the power and flexibility of data-driven distributionally robust optimization (DRO) to circumvent the need for regularization hyperparameters. Recent literature shows that traditional machine learning methods such as SVM and (square-root) LASSO can be written as Wasserstein-based DRO problems. Inspired by those results we propose a hyperparameter-free ambiguity set that explores the polyhedral structure of data-driven convex hulls, generating computationally tractable regression and classification methods for any convex loss function. Numerical results based on 100 real-world databases and an extensive experiment with synthetically generated data show that our methods consistently outperform their traditional counterparts.
computer science, artificial intelligence
What problem does this paper attempt to address?