Introducing Thermodynamics-Informed Symbolic Regression -- A Tool for Thermodynamic Equations of State Development

Viktor Martinek,Ophelia Frotscher,Markus Richter,Roland Herzog
2023-09-06
Abstract:Thermodynamic equations of state (EOS) are essential for many industries as well as in academia. Even leaving aside the expensive and extensive measurement campaigns required for the data acquisition, the development of EOS is an intensely time-consuming process, which does often still heavily rely on expert knowledge and iterative fine-tuning. To improve upon and accelerate the EOS development process, we introduce thermodynamics-informed symbolic regression (TiSR), a symbolic regression (SR) tool aimed at thermodynamic EOS modeling. TiSR is already a capable SR tool, which was used in the research of <a class="link-external link-https" href="https://doi.org/10.1007/s10765-023-03197-z" rel="external noopener nofollow">this https URL</a>. It aims to combine an SR base with the extensions required to work with often strongly scattered experimental data, different residual pre- and post-processing options, and additional features required to consider thermodynamic EOS development. Although TiSR is not ready for end users yet, this paper is intended to report on its current state, showcase the progress, and discuss (distant and not so distant) future directions. TiSR is available at <a class="link-external link-https" href="https://github.com/scoop-group/TiSR" rel="external noopener nofollow">this https URL</a> and can be cited as <a class="link-external link-https" href="https://doi.org/10.5281/zenodo.8317547" rel="external noopener nofollow">this https URL</a>.
Machine Learning,Data Analysis, Statistics and Probability
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges in the development process of thermodynamic equations of state (EOS). Specifically, the development of EOS is a complex and time - consuming process, relying on expensive and extensive experimental data collection, as well as expert knowledge and iterative fine - tuning. To improve and accelerate this process, the author introduces a new tool - Thermodynamics - informed Symbolic Regression (TiSR), which is aimed at thermodynamic EOS modeling. ### Main problems: 1. **Time - consuming development process and dependence on expert experience**: - The development of EOS requires not only a large amount of experimental data, but also the experience and intuition of experts for iterative optimization. - This process is very time - consuming, which limits the rapid development and wide application of EOS. 2. **Inadequate ability to handle scattered experimental data**: - Experimental data often has a large degree of dispersion, and traditional EOS development methods are difficult to effectively handle these data. 3. **Lack of formalization and enforcement of thermodynamic constraints**: - Existing methods fail to fully consider thermodynamic constraints, resulting in models that may not conform to physical laws. ### Solutions: - **TiSR tool**: By combining symbolic regression (SR) and thermodynamic knowledge, TiSR aims to automate the development process of EOS, reduce the dependence on expert knowledge, and improve the accuracy and reliability of the model. - **Extended functions**: TiSR has the functions of handling scattered experimental data, pre - processing and post - processing options, and considering thermodynamic constraints. - **Early stopping and simplification**: To prevent over - fitting and improve generalization performance, TiSR introduces an early - stopping mechanism and simplifies expressions through methods such as LASSO regularization. ### Future directions: - **More constraint conditions**: It is planned to add more constraint conditions to ensure that the model complies with thermodynamic laws. - **Support for more variable types**: Such as support for factor variables, in order to better handle nominal variables. - **Combination of experiment and calculation**: Further combine experimental and computational methods to accelerate the development of EOS. Through these improvements, TiSR is expected to significantly improve the efficiency and accuracy of EOS development, thereby promoting research and applications in related fields.