Global Benchmark Database

Markus Iser,Christoph Jabs
2024-06-27
Abstract:This paper presents Global Benchmark Database (GBD), a comprehensive suite of tools for provisioning and sustainably maintaining benchmark instances and their metadata. The availability of benchmark metadata is essential for many tasks in empirical research, e.g., for the data-driven compilation of benchmarks, the domain-specific analysis of runtime experiments, or the instance-specific selection of solvers. In this paper, we introduce the data model of GBD as well as its interfaces and provide examples of how to interact with them. We also demonstrate the integration of custom data sources and explain how to extend GBD with additional problem domains, instance formats and feature extractors.
Databases,Artificial Intelligence,Logic in Computer Science
What problem does this paper attempt to address?
The main purpose of this paper is to introduce the Global Benchmark Database (GBD), a comprehensive toolkit for providing and sustainably maintaining benchmark instances and their metadata. Specifically, the paper addresses the following key issues: 1. **Availability of Benchmark Metadata**: Benchmark metadata is crucial for many empirical research tasks, such as data-driven benchmark compilation, domain-specific runtime experimental analysis, or instance-specific solver selection. 2. **Cross-Disciplinary Data Integration**: GBD aims to serve as a bridge between data science and empirical research on NP-hard problem classes by seamlessly integrating benchmark data into existing workflows. 3. **Tool Design and Implementation**: The paper details GBD's data model and its interfaces, providing practical examples of how to interact with it. Additionally, it demonstrates methods for integrating custom data sources and how to extend GBD to support more problem domains, instance formats, and feature extractors. 4. **Application Cases**: Examples of successful applications of GBD include cleaning and selecting SAT competition benchmarks, domain-specific solver evaluation, and the analysis of solver portfolios and prediction models. In summary, this paper primarily addresses the effective management and utilization of benchmark instances and their metadata, particularly in the fields of SAT, MaxSAT, and pseudo-Boolean optimization.