Layerwise Change of Knowledge in Neural Networks

Xu Cheng,Lei Cheng,Zhaoran Peng,Yang Xu,Tian Han,Quanshi Zhang
2024-09-13
Abstract:This paper aims to explain how a deep neural network (DNN) gradually extracts new knowledge and forgets noisy features through layers in forward propagation. Up to now, although the definition of knowledge encoded by the DNN has not reached a consensus, Previous studies have derived a series of mathematical evidence to take interactions as symbolic primitive inference patterns encoded by a DNN. We extend the definition of interactions and, for the first time, extract interactions encoded by intermediate layers. We quantify and track the newly emerged interactions and the forgotten interactions in each layer during the forward propagation, which shed new light on the learning behavior of DNNs. The layer-wise change of interactions also reveals the change of the generalization capacity and instability of feature representations of a DNN.
Machine Learning,Artificial Intelligence,Computation and Language,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to explain how deep neural networks (DNNs) progressively extract new knowledge and forget noisy features during the forward propagation process. Specifically: 1. **Definition and Quantification of Knowledge**: - The authors extend the definition of "interactions" and, for the first time, extract interactions from intermediate layers. - Researchers quantify and track the newly emerging and forgotten interactions in each layer, revealing the learning behavior of DNNs. 2. **Challenges of Knowledge Change**: - Three main challenges are proposed: alignment of interaction primitives, decomposability and countability of knowledge, and its connection to generalization ability. - By training linear classifiers to extract interactions from each layer and analyzing the fidelity of these interactions, these challenges are addressed. 3. **Fidelity of Interactions**: - The newly defined interaction primitives still belong to the typical interaction paradigm, and their fidelity can be proven through a series of theorems. 4. **Contributions**: - Redefines interactions on intermediate layers and finds that adjacent layers encode similar interactions. - Provides several theoretically verifiable metrics to quantify the newly emerging and forgotten knowledge during the forward propagation process. - Discovers that changes in interactions are also related to the generalization ability of DNNs. In summary, this paper is dedicated to revealing the learning behavior of DNNs by quantifying and tracking interactions in various layers and further understanding the generalization ability of DNNs.