Jacobian-Scaled K-means Clustering for Physics-Informed Segmentation of Reacting Flows

Shivam Barwey,Venkat Raman
2024-06-24
Abstract:This work introduces Jacobian-scaled K-means (JSK-means) clustering, which is a physics-informed clustering strategy centered on the K-means framework. The method allows for the injection of underlying physical knowledge into the clustering procedure through a distance function modification: instead of leveraging conventional Euclidean distance vectors, the JSK-means procedure operates on distance vectors scaled by matrices obtained from dynamical system Jacobians evaluated at the cluster centroids. The goal of this work is to show how the JSK-means algorithm -- without modifying the input dataset -- produces clusters that capture regions of dynamical similarity, in that the clusters are redistributed towards high-sensitivity regions in phase space and are described by similarity in the source terms of samples instead of the samples themselves. The algorithm is demonstrated on a complex reacting flow simulation dataset (a channel detonation configuration), where the dynamics in the thermochemical composition space are known through the highly nonlinear and stiff Arrhenius-based chemical source terms. Interpretations of cluster partitions in both physical space and composition space reveal how JSK-means shifts clusters produced by standard K-means towards regions of high chemical sensitivity (e.g., towards regions of peak heat release rate near the detonation reaction zone). The findings presented here illustrate the benefits of utilizing Jacobian-scaled distances in clustering techniques, and the JSK-means method in particular displays promising potential for improving former partition-based modeling strategies in reacting flow (and other multi-physics) applications.
Computational Physics,Machine Learning,Fluid Dynamics
What problem does this paper attempt to address?
The paper aims to address the issue where traditional clustering methods fail to effectively capture dynamically similar regions in reactive flows (such as explosive flows). Specifically, the standard K-means clustering method tends to cluster based on the features of the samples themselves (such as temperature, species concentration, etc.) when dealing with complex reactive flow data, while ignoring the physical dynamic information behind these features. This leads to clustering results that may not accurately reflect the dynamic similarity in terms of chemical reaction rates and other aspects in different regions. To overcome this limitation, the paper proposes the Jacobian-scaled K-means (JSK-means) clustering method. This method modifies the distance function by introducing a Jacobian matrix-based scaling, injecting underlying physical knowledge into the clustering process. Specifically, JSK-means uses the Jacobian matrix to scale the distance vector from the sample to the centroid, thereby making the clustering process more focused on the dynamic similarity between samples rather than just the similarity of sample feature values. In this way, the clustering results can better capture highly sensitive regions, such as areas with high heat release rates near explosive reaction zones, providing a new approach for local modeling of reactive flows. The paper validates the effectiveness of the JSK-means method on a complex channel explosion simulation dataset, demonstrating the potential of this method in improving local modeling of reactive flows, especially in multiphysics applications. This method not only enhances the physical consistency of clustering but also provides general methodological support for dynamically consistent segmentation of feature spaces in other multiphysics systems.