Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments

Olivia Jullian Parra,Julián García Pardiñas,Lorenzo Del Pianta Pérez,Maximilian Janisch,Suzanne Klaver,Thomas Lehéricy,Nicola Serra

2024-05-24

Abstract:Data Quality Monitoring (DQM) is a crucial task in large particle physics experiments, since detector malfunctioning can compromise the data. DQM is currently performed by human shifters, which is costly and results in limited accuracy. In this work, we provide a proof-of-concept for applying human-in-the-loop Reinforcement Learning (RL) to automate the DQM process while adapting to operating conditions that change over time. We implement a prototype based on the Proximal Policy Optimization (PPO) algorithm and validate it on a simplified synthetic dataset. We demonstrate how a multi-agent system can be trained for continuous automated monitoring during data collection, with human intervention actively requested only when relevant. We show that random, unbiased noise in human classification can be reduced, leading to an improved accuracy over the baseline. Additionally, we propose data augmentation techniques to deal with scarce data and to accelerate the learning process. Finally, we discuss further steps needed to implement the approach in the real world, including protocols for periodic control of the algorithm's outputs.

High Energy Physics - Experiment,Machine Learning

What problem does this paper attempt to address?

This paper discusses the importance of Data Quality Monitoring (DQM) in large-scale particle physics experiments and the current issues that exist. Currently, DQM is mainly performed by manual operators, which is both expensive and may result in limited classification accuracy. The paper proposes a human-in-the-loop reinforcement learning (RL) approach to automate the DQM process and adapt to changing operating conditions over time. They implemented a prototype based on the Proximal Policy Optimization (PPO) algorithm and validated it on a simplified artificial dataset. The study shows that multi-agent systems can be trained to continuously monitor the conditions during data collection and request human intervention only when necessary. Through this approach, random and unbiased noise in human classification can be reduced, improving accuracy. In addition, the paper proposes data augmentation techniques to address data scarcity issues and accelerate the learning process. Future implementation of this approach will require further steps, including regular checks on algorithm output. The paper divided the experiments into online and offline phases and discussed the advantages of RL in adapting to changing conditions and improving efficiency. The experimental results demonstrate that RL can reduce dependence on human resources and improve the efficiency and accuracy of DQM. In summary, the paper aims to address how to use reinforcement learning to automate data quality monitoring in particle physics experiments, while adapting to evolving operational conditions, reducing limitations of manual operation, and improving the accuracy and efficiency of data processing.

Human-in-the-loop Reinforcement Learning for Data Quality Monitoring in Particle Physics Experiments

Data Quality Aware Hierarchical Federated Reinforcement Learning Framework for Dynamic Treatment Regimes

PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring

Proximal Policy Gradient Arborescence for Quality Diversity Reinforcement Learning

Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task

Human-in-the-Loop Reinforcement Learning in Continuous-Action Space

Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

Reinforcement Learning from Automatic Feedback for High-Quality Unit Test Generation

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

Reinforcement learning-trained optimisers and Bayesian optimisation for online particle accelerator tuning

Microsecond-Latency Feedback at a Particle Accelerator by Online Reinforcement Learning on Hardware

RadDQN: a Deep Q Learning-based Architecture for Finding Time-efficient Minimum Radiation Exposure Pathway

Automating proton PBS treatment planning for head and neck cancers using policy gradient-based deep reinforcement learning

Injection Optimization at Particle Accelerators via Reinforcement Learning: From Simulation to Real-World Application

Residual Physics and Post-Posed Shielding for Safe Deep Reinforcement Learning Method

Adaptive Multi-Agent Deep Reinforcement Learning for Timely Healthcare Interventions

Trustworthy Human-AI Collaboration: Reinforcement Learning with Human Feedback and Physics Knowledge for Safe Autonomous Driving

Deep Q-learning From Demonstrations

Particle Swarm Based Reinforcement Learning.

Deep Reinforcement Learning for 2D Physics-Based Object Manipulation in Clutter

Auxiliary Task-based Deep Reinforcement Learning for Quantum Control