Novel approach for explaining the behavior of trained artificial neural networks with distributed representations

Yuanhui Zhou,Yuchang Lü,Chunyi Shi
1999-01-01
Abstract:A difficult problem for neural networks is that concepts learned by them cannot be understood. Rule extraction can offer a promising way to solve it. In this paper, a novel extraction approach composed of two phases is presented to extract rules from neural networks with distributed representations. From hidden layer to output layer, it extracts rules by identifying valid regions from the whole hidden activation space. From input layer to hidden layer, it extracts rules based on the analysis of weights between input units and hidden units so that all instances covered by these rules generate hidden activation vectors lying in the above valid regions. Experiments have demonstrated that this method generates rules of better performance than the decision tree approach under noisy condition and the fidelity of rules extracted in the presented method is higher than that in conventional decompositional approaches, especially for a distributed neural network.
What problem does this paper attempt to address?