Revisiting Variable Ordering for Real Quantifier Elimination using Machine Learning

John Hester,Briland Hitaj,Grant Passmore,Sam Owre,Natarajan Shankar,Eric Yeh
DOI: https://doi.org/10.48550/arXiv.2302.14038
2023-02-27
Formal Languages and Automata Theory
Abstract:Cylindrical Algebraic Decomposition (CAD) is a key proof technique for formal verification of cyber-physical systems. CAD is computationally expensive, with worst-case doubly-exponential complexity. Selecting an optimal variable ordering is paramount to efficient use of CAD. Prior work has demonstrated that machine learning can be useful in determining efficient variable orderings. Much of this work has been driven by CAD problems extracted from applications of the MetiTarski theorem prover. In this paper, we revisit this prior work and consider issues of bias in existing training and test data. We observe that the classical MetiTarski benchmarks are heavily biased towards particular variable orderings. To address this, we apply symmetries to create a new dataset containing more than 41K MetiTarski challenges designed to remove bias. Furthermore, we evaluate issues of information leakage, and test the generalizability of our models on the new dataset.
What problem does this paper attempt to address?