Efficient Online Inference and Learning in Partially Known Nonlinear State-Space Models by Learning Expressive Degrees of Freedom Offline

Jan-Hendrik Ewering,Björn Volkmann,Simon F. G. Ehlers,Thomas Seel,Michael Meindl
2024-09-14
Abstract:Intelligent real-world systems critically depend on expressive information about their system state and changing operation conditions, e.g., due to variation in temperature, location, wear, or aging. To provide this information, online inference and learning attempts to perform state estimation and (partial) system identification simultaneously. Current works combine tailored estimation schemes with flexible learning-based models but suffer from convergence problems and computational complexity due to many degrees of freedom in the inference problem (i.e., parameters to determine). To resolve these issues, we propose a procedure for data-driven offline conditioning of a highly flexible Gaussian Process (GP) formulation such that online learning is restricted to a subspace, spanned by expressive basis functions. Due to the simplicity of the transformed problem, a standard particle filter can be employed for Bayesian inference. In contrast to most existing works, the proposed method enables online learning of target functions that are nested nonlinearly inside a first-principles model. Moreover, we provide a theoretical quantification of the error, introduced by restricting learning to a subspace. A Monte-Carlo simulation study with a nonlinear battery model shows that the proposed approach enables rapid convergence with significantly fewer particles compared to a baseline and a state-of-the-art method.
Systems and Control
What problem does this paper attempt to address?
This paper attempts to address the problem of online state estimation and system identification under complex and varying conditions. Specifically, the paper focuses on the problem of online inference and learning in Partially Known Nonlinear State-Space Models. These problems typically arise in real-world intelligent systems that need to adjust their behavior based on changing operating conditions such as temperature, location, wear, or aging. ### Main Issues 1. **High-dimensional search space**: Current methods face convergence and computational complexity issues when dealing with inference problems that have many degrees of freedom (i.e., parameters that need to be determined). 2. **Lack of expert knowledge**: In many cases, expert knowledge may not be available, making online learning more difficult. 3. **Real-time adaptability**: Existing methods often rely on offline data to train highly flexible learning models, which limits their ability to adapt to changing conditions in real-time. ### Solution To overcome the above issues, the paper proposes a new method that restricts online learning to a low-dimensional subspace by learning expressive degrees of freedom (DOF) offline. The specific steps are as follows: 1. **Offline data-driven conditioning**: - Use a highly flexible Gaussian Process (GP) model to capture different realizations of the system offline. - Extract the most important modes through Singular Value Decomposition (SVD) to construct a new set of expressive basis functions. 2. **Efficient online inference and learning**: - Utilize a standard Particle Filter (PF) for Bayesian inference. Due to the problem simplification, this method performs well in the low-dimensional subspace. - Further improve the robustness and adaptability of the algorithm through adaptive noise adjustment. ### Experimental Validation The paper demonstrates the effectiveness of the proposed method through a Monte Carlo simulation study of a nonlinear battery model. The results show that the method can achieve rapid convergence with a significantly reduced number of particles and performs better than baseline and state-of-the-art methods. ### Conclusion The proposed method effectively addresses the high-dimensional search space problem in online inference and learning by learning expressive degrees of freedom offline, improving computational efficiency and real-time adaptability. The method performs well in handling Partially Known Nonlinear State-Space Models without requiring expert knowledge and has broad application prospects.