To What Extent Do Different Neural Networks Learn the Same Representation: A Neuron Activation Subspace Match Approach

Liwei Wang,Lunjia Hu,Jiayuan Gu,Zhiqiang Hu,Yue Wu,Kun He,John E. Hopcroft
2018-01-01
Abstract:Studying the learned representations is important for understanding deep neural networks. In this paper, we investigate the similarity of representations learned by two networks with identical architecture but trained from different initializations. Instead of resorting to heuristic methods, we develop a rigorous theory based on the neuron activation subspace match model. The theory gives a complete characterization of the structure of neuron activation subspace matches, where the core concepts are maximum match and simple match which describe the overall and the finest similarity between sets of neurons in two networks respectively. We also propose efficient algorithms to find the maximum match and simple matches. Finally, experimental study using our algorithms suggests that, somewhat surprisingly, representations learned by the same convolutional layers of two networks are not as similar as prevalently expected.
What problem does this paper attempt to address?