Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction

Quentin Delfosse,Hikaru Shindo,Devendra Dhami,Kristian Kersting
2023-10-26
Abstract:The limited priors required by neural networks make them the dominating choice to encode and learn policies using reinforcement learning (RL). However, they are also black-boxes, making it hard to understand the agent's behaviour, especially when working on the image level. Therefore, neuro-symbolic RL aims at creating policies that are interpretable in the first place. Unfortunately, interpretability is not explainability. To achieve both, we introduce Neurally gUided Differentiable loGic policiEs (NUDGE). NUDGE exploits trained neural network-based agents to guide the search of candidate-weighted logic rules, then uses differentiable logic to train the logic agents. Our experimental evaluation demonstrates that NUDGE agents can induce interpretable and explainable policies while outperforming purely neural ones and showing good flexibility to environments of different initial states and problem sizes.
Machine Learning,Artificial Intelligence,Computation and Language,Logic in Computer Science,Symbolic Computation
What problem does this paper attempt to address?