OPFData: Large-scale datasets for AC optimal power flow with topological perturbations

Sean Lovett,Miha Zgubic,Sofia Liguori,Sephora Madjiheurem,Hamish Tomlinson,Sophie Elster,Chris Apps,Sims Witherspoon,Luis Piloto
2024-06-19
Abstract:Solving the AC optimal power flow problem (AC-OPF) is critical to the efficient and safe planning and operation of power grids. Small efficiency improvements in this domain have the potential to lead to billions of dollars of cost savings, and significant reductions in emissions from fossil fuel generators. Recent work on data-driven solution methods for AC-OPF shows the potential for large speed improvements compared to traditional solvers; however, no large-scale open datasets for this problem exist. We present the largest readily-available collection of solved AC-OPF problems to date. This collection is orders of magnitude larger than existing readily-available datasets, allowing training of high-capacity data-driven models. Uniquely, it includes topological perturbations - a critical requirement for usage in realistic power grid operations. We hope this resource will spur the community to scale research to larger grid sizes with variable topology.
Machine Learning
What problem does this paper attempt to address?
The paper aims to address the issue of the lack of datasets in the AC Optimal Power Flow (AC-OPF) problem. Specifically: 1. **Research Background**: AC-OPF is crucial in power network planning and operation, as it can improve efficiency and reduce emissions from fossil fuel generation. Although traditional solving methods face high computational costs or lack robustness in real-time applications for large-scale power grids, data-driven methods have shown great potential for significant speed improvements. 2. **Existing Problems**: Currently, there are no large-scale publicly available datasets to support research on the AC-OPF problem. Moreover, the few existing datasets do not adequately simulate the topological changes in real power grid operations, which is essential for practical applications. 3. **Solution**: The paper introduces the largest solved AC-OPF problem dataset to date—OPFData. This dataset is not only extensive in quantity but also includes topological changes, a critical requirement in real power grid operations. By providing this resource, researchers can train higher-capacity data-driven models and push research to expand to larger-scale power grids with variable topologies. In summary, the main goal of the paper is to promote research progress in this field by releasing a large-scale AC-OPF dataset that includes topological perturbations, and to support the development of efficient and reliable methods applicable to real power grid operations.