Model Information As an Analysis Tool in Deep Learning

Xiao Zhang,Di Hu,Xingjian Li,Dejing Dou,Ji Wu
2021-01-01
Abstract:Information-theoretic perspectives can provide an alternative dimension of analyzing the learning process and complements usual performance metrics. Recently several works proposed methods for quantifying information content in a (which we refer to as model information). We demonstrate using information as a general analysis tool to gain insight into problems that arise in deep learning. By utilizing information in different scenarios with different control variables, we are able to adapt information to analyze fundamental elements of learning, i.e., task, data, model, and algorithm. We provide an example in each domain that information is used as a tool to provide new solutions to problems or to gain insight into the nature of the particular learning setting. These examples help to illustrate the versatility and potential utility of information as an analysis tool in deep learning.
What problem does this paper attempt to address?