Abstract:An autonomous experimentation platform in manufacturing is supposedly capable of conducting a sequential search for finding suitable manufacturing conditions by itself or even for discovering new materials with minimal human intervention. The core of the intelligent control of such platforms is a policy to decide where to conduct the next experiment based on what has been done thus far. Such policy inevitably trades off between exploitation and exploration. Currently, the prevailing approach is to use various acquisition functions in the Bayesian optimization framework. We discuss whether it is beneficial to trade off exploitation versus exploration by measuring the element and degree of surprise associated with the immediate past observation. We devise a surprise-reacting policy using two existing surprise metrics, known as the Shannon surprise and Bayesian surprise. Our analysis shows that the surprise-reacting policy appears to be better suited for quickly characterizing the overall landscape of a response surface under resource constraints. We do not claim that we have a fully autonomous experimentation system but believe that the surprise-reacting capability benefits the automation of sequential decisions in autonomous experimentation.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to achieve effective decision - making in an autonomous experimental platform during the manufacturing process. In particular, in the case of limited resources, how to quickly explore and characterize the overall landscape of the response surface. Specifically, the paper explores better ways to find a balance between exploration and exploitation, and proposes a sequential experimental strategy based on "surprise reaction". ### Background and Motivation Traditional autonomous experimental platforms mainly rely on various acquisition functions in the Bayesian Optimization (BO) framework to decide the location of the next experiment. However, these methods are often too greedy and tend to converge to local optimal solutions too quickly, ignoring the exploration of unknown areas. To overcome this limitation, the paper proposes a novel concept - guiding experimental decisions by measuring and reacting to "surprise". ### Core Problems 1. **How to define and quantify "surprise"?** - The paper adopts two existing surprise metrics: Shannon surprise and Bayesian surprise. - **Shannon surprise**: Use negative log - likelihood to measure the difference between the observed value and the current model prediction. - **Bayesian surprise**: Use Kullback - Leibler (KL) divergence to measure the degree of change in model beliefs caused by a new data point. 2. **How to react to "surprise"?** - When "surprise" is detected, the system will spend extra resources to confirm whether the observed value is valid. - If the observed value is valid, update the model; if not, maintain the current hypothesis and continue to explore. 3. **What are the impacts of using the "surprise - reaction" strategy?** - Experiments have proven that this strategy can explore the design space more effectively in the case of limited resources and performs better than traditional acquisition function methods. ### Methods and Contributions - **Statistical Model**: The paper selects the Gaussian Process (GP) as the statistical model for modeling experimental data and making predictions. - **Surprise Metrics**: It details how to calculate Shannon surprise and Bayesian surprise, and proposes how to decide the location of the next experiment based on these metrics. - **Experimental Evaluation**: By comparing with different Bayesian optimization acquisition functions (such as EI, PI, UCB, and MaxVar), it shows the advantages of the "surprise - reaction" strategy. ### Conclusion The main contribution of the paper is to propose a new "surprise - reaction" strategy, which can dynamically adjust the balance between exploration and exploitation, thereby exploring the design space more effectively in the case of limited resources. This method not only helps to quickly characterize the overall landscape of the response surface but also can improve the performance of autonomous experimental platforms in practical applications.

Towards Futuristic Autonomous Experimentation--A Surprise-Reacting Sequential Experiment Policy

Towards Futuristic Autonomous Experimentation--A Surprise-Reacting Sequential Experiment Policy

Safe Sim-to-Real Robot Exploration with Constrained Bayesian Optimization

Guiding the Sequential Experiments in Autonomous Experimentation Platforms through EI-based Bayesian Optimization and Bayesian Model Averaging

Adaptive Sequential Experiments with Unknown Information Arrival Processes

Curiosity-Driven Exploration via Latent Bayesian Surprise

Explore and Control with Adversarial Surprise

A dynamic Bayesian optimized active recommender system for curiosity-driven partially Human-in-the-loop automated experiments

Using Surprise Index for Competency Assessment in Autonomous Decision-Making

CARE: Confidence-rich Autonomous Robot Exploration Using Bayesian Kernel Inference and Optimization

Equipping data-driven experiment planning for Self-driving Laboratories with semantic memory: case studies of transfer learning in chemical reaction optimization

Experimentation Platforms Meet Reinforcement Learning: Bayesian Sequential Decision-Making for Continuous Monitoring

Emulating Expert Insight: A Robust Strategy for Optimal Experimental Design

Expecting the Unexpected: Developing Autonomous-System Design Principles for Reacting to Unpredicted Events and Conditions

Active Exploration via Experiment Design in Markov Chains

Intrinsic-Motivated Sensor Management: Exploring with Physical Surprise

Bayesian Optimization for Policy Search via Online-Offline Experimentation

Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models

The Exploration-Exploitation Dilemma Revisited: An Entropy Perspective

Autonomous Scene Exploration Using Experience Enhancement

Optimization-Driven Adaptive Experimentation