Interpretable machine learning models: a physics-based view

Ion Matei,Johan de Kleer,Christoforos Somarakis,Rahul Rai,John S. Baras
DOI: https://doi.org/10.48550/arXiv.2003.10025
2020-03-23
Abstract:To understand changes in physical systems and facilitate decisions, explaining how model predictions are made is crucial. We use model-based interpretability, where models of physical systems are constructed by composing basic constructs that explain locally how energy is exchanged and transformed. We use the port Hamiltonian (p-H) formalism to describe the basic constructs that contain physically interpretable processes commonly found in the behavior of physical systems. We describe how we can build models out of the p-H constructs and how we can train them. In addition we show how we can impose physical properties such as dissipativity that ensure numerical stability of the training process. We give examples on how to build and train models for describing the behavior of two physical systems: the inverted pendulum and swarm dynamics.
Artificial Intelligence,Dynamical Systems,Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the lack of transparency in machine - learning models in explaining the changes and decision - making processes of physical systems. Specifically, the author focuses on how to construct physically - interpretable models that can explain energy conversion and exchange processes. These models can not only predict the dynamic behaviors of physical systems, but also provide explanations of the physical mechanisms behind the predictions, thereby enhancing the comprehensibility and credibility of the models. This is of great significance in fault detection and isolation, system design feedback, and the debugging of machine - learning models themselves. By using the port - Hamiltonian (p - H) formalization method, the paper proposes a method for constructing and training such models, while ensuring the numerical stability of the model training process, and demonstrates the application of this method in two physical systems, the inverted pendulum and swarm dynamics, through examples.