A combined clustering/symbolic regression framework for fluid property prediction

Filippos Sofos,Avraam Charakopoulos,Konstantinos Papastamatiou,Theodoros E. Karakasidis,Theodoros E Karakasidis
DOI: https://doi.org/10.1063/5.0096669
IF: 3.5
2022-05-22
Physics Today
Abstract:Physics of Fluids, Volume 0, Issue ja, -Not available-. Symbolic regression (SR) techniques are constantly gaining ground in materials informatics, as the machine learning counterpart capable of providing analytical equations exclusively derived from data. When the feature space is unknown, unsupervised learning is incorporated to discover and explore hidden connections between data points and may suggest a regional solution, specific for a group of data. In this work, we develop a Lennard-Jones (LJ) fluid descriptor based on density and temperature values and investigate the similarity between data corresponding to diffusion coefficients. Descriptions are linked with aid of clustering algorithms which lead to fluid groups with similar behavior, bound to physical laws. Keeping in mind that the fluid data space goes over the gas, liquid, and supercritical states, we compare clustering results to this categorization, and found that the proposed methods can detect the gas and liquid states, while distinct supercritical region characteristics are discovered, where fluid density and temperature affect the diffusion coefficient in a more complex way. The incorporation of symbolic regression algorithms on each cluster provides an in-depth investigation on fluid behavior, and regional expressions are proposed.
physics, multidisciplinary
What problem does this paper attempt to address?