Abstract:Deep reinforcement learning (DRL) has demonstrated significant potential in industrial manufacturing domains such as workshop scheduling and energy system management. However, due to the model's inherent uncertainty, rigorous validation is requisite for its application in real-world tasks. Specific tests may reveal inadequacies in the performance of pre-trained DRL models, while the "black-box" nature of DRL poses a challenge for testing model behavior. We propose a novel performance improvement framework based on probabilistic automata, which aims to proactively identify and correct critical vulnerabilities of DRL systems, so that the performance of DRL models in real tasks can be improved with minimal model modifications. First, a probabilistic automaton is constructed from the historical trajectory of the DRL system by abstracting the state to generate probabilistic decision-making units (PDMUs), and a reverse breadth-first search (BFS) method is used to identify the key PDMU-action pairs that have the greatest impact on adverse outcomes. This process relies only on the state-action sequence and final result of each trajectory. Then, under the key PDMU, we search for the new action that has the greatest impact on favorable results. Finally, the key PDMU, undesirable action and new action are encapsulated as monitors to guide the DRL system to obtain more favorable results through real-time monitoring and correction mechanisms. Evaluations in two standard reinforcement learning environments and three actual job scheduling scenarios confirmed the effectiveness of the method, providing certain guarantees for the deployment of DRL models in real-world applications.

DeepDFA: Automata Learning through Neural Probabilistic Relaxations

Towards Interpreting Recurrent Neural Networks Through Probabilistic Abstraction

Integrating Regular Expressions with Neural Networks via DFA

DeepAuto: A First Step Towards Formal Verification of Deep Learning Systems (S).

i dfa: A novel deterministic finite automaton without state explosion

State-Regularized Recurrent Neural Networks to Extract Automata and Explain Predictions

Cache-Based Scalable Deep Packet Inspection with Predictive Automaton.

LLMs as Probabilistic Minimally Adequate Teachers for DFA Learning

Decision-Guided Weighted Automata Extraction from Recurrent Neural Networks.

DFAMiner: Mining minimal separating DFAs from labelled samples

A Comparative Study of Rule Extraction for Recurrent Neural Networks

A Probabilistic Framework for Deep Learning

The Neural Network Pushdown Automaton: Model, Stack and Learning Simulations

Automaton Distillation: Neuro-Symbolic Transfer Learning for Deep Reinforcement Learning

Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

Verifying And Interpreting Neural Networks using Finite Automata

Dreaming Learning

Database-assisted automata learning

Learning minimal automata with recurrent neural networks

Deep Recurrent Stochastic Configuration Networks for Modelling Nonlinear Dynamic Systems

TFA : A Tunable Finite Automaton for Regular Expression Matching