Efficient LTL Model Checking of Deep Reinforcement Learning Systems Using Policy Extraction

Peng Jin,Yang Wang,Min Zhang
DOI: https://doi.org/10.18293/seke2022-029
2022-01-01
Abstract:Deep Reinforcement Learning (DRL) is a promising technology for solving intractable control tasks.Its applications in safety-critical fields require high-reliability guarantees.However, formal verification of DRL systems is challenging because deep neural networks (DNNs) embedded in the applications are uninterpretable.In this paper, we propose a novel approach to linear temporal logic (LTL) model checking of DRL systems by extracting interpretable policies from DNNs.The extracted policy can retain comparable performance to the original DNN.More importantly, its decision domain is finite and thus directly verifiable against LTL properties using existing model checking techniques.Experimental results on four classic control systems demonstrate the effectiveness of our approach.
What problem does this paper attempt to address?