Abstract:Current sensing methods often ignore the fact that their sensing targets are dynamic and can change over time. As a result, to build an accurate model should not always be of the first priority. What we need is to establish an adaptive modelling framework. Lack of this adaptability hinders us from building a more intelligent sensing system. In this paper, we try to apply inspirations from human cognition to design a more intelligent sensing and modelling system, which can adaptively detect anomalies. Based on our understanding of free-energy and Infomax principle, the target of sensing and modelling is not to get as much data as possible, or to build the most accurate model, but to establish an adaptive representation of target and achieve balance between sensing performance and system resource consumption. Formally speaking, from the perspective of free energy minimization, this corresponds to a balance between accuracy and the minimization of complexity costs. To achieve this goal, we adopt a working memory mechanism to help the model evolve with the target; we use deep autoencoder network as model representation, which models complex data with its nonlinear and hierarchical architecture. Since we typically only have partial observations from sensed target, we design a variance of autoencoder that can reconstruct corrupted input. We utilize attentional surprise mechanism to control model update. Training of the deep network is driven by surprises detected (anomalies), which indicates model failure or target's new behaviour. Due to partial observations, we are not able to minimize free-energy in a single update, but iteratively minimize it by finding new optimization bounds. While both random and non-random sensor selection can create new optimization bounds, non-random methods like surprise minimization used in this paper demonstrate better performance. In our system, the model update frequency is controlled by several parameters, including surprise threshold and memory size. These parameters control the alertness as well as the resource consumption of the system in a top-down manner. For evaluation, we conducted experiments on simulated data to test whether our methodology makes the model more adaptive. The result showed that we achieved this aim. We also applied our method to a real application, which is EEG (Electroencephalography) seizure detection. This application shows features that we desired.

Intrinsic-Motivated Sensor Management: Exploring with Physical Surprise

Cognitive Sensing: Adaptive Anomalies Detection with Deep Networks

Model-Based Robot Learning Control with Uncertainty Directed Exploration

Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention

Analyzing and Improving Supervised Nonlinear Dynamical Probabilistic Latent Variable Model for Inferential Sensors

Distributed Self-Monitoring Sensor Networks Via Markov Switching Dynamic Linear Models

Sensor Control for Information Gain in Dynamic, Sparse and Partially Observed Environments

Resource-Efficient Sensor Data Management for Autonomous Systems Using Deep Reinforcement Learning

Intrinsic Motivation Driven Intuitive Physics Learning using Deep Reinforcement Learning with Intrinsic Reward Normalization

Measuring and Modeling Physical Intrinsic Motivation

Learning to See Physical Properties with Active Sensing Motor Policies

Physics-informed Dyna-style model-based deep reinforcement learning for dynamic control

Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning

Physics-guided machine learning from simulated data with different physical parameters

Deep Reinforcement Learning Sensor Scheduling for Effective Monitoring of Dynamical Systems

Pay Attention to How You Drive: Safe and Adaptive Model-Based Reinforcement Learning for Off-Road Driving

Optimal Policies Search for Sensor Management

Learning Intuitive Physics and One-Shot Imitation Using State-Action-Prediction Self-Organizing Maps

Scheduled Intrinsic Drive: A Hierarchical Take on Intrinsically Motivated Exploration

Reliable Proactive Adaptation Via Prediction Fusion and Extended Stochastic Model Predictive Control