Towards Better Analysis of Deep Convolutional Neural Networks

Mengchen Liu,Jiaxin Shi,Zhen Li,Chongxuan Li,Jun Zhu,Shixia Liu
DOI: https://doi.org/10.48550/arXiv.1604.07043
2016-05-04
Abstract:Deep convolutional neural networks (CNNs) have achieved breakthrough performance in many pattern recognition tasks such as image classification. However, the development of high-quality deep models typically relies on a substantial amount of trial-and-error, as there is still no clear understanding of when and why a deep model works. In this paper, we present a visual analytics approach for better understanding, diagnosing, and refining deep CNNs. We formulate a deep CNN as a directed acyclic graph. Based on this formulation, a hybrid visualization is developed to disclose the multiple facets of each neuron and the interactions between them. In particular, we introduce a hierarchical rectangle packing algorithm and a matrix reordering algorithm to show the derived features of a neuron cluster. We also propose a biclustering-based edge bundling method to reduce visual clutter caused by a large number of connections between neurons. We evaluated our method on a set of CNNs and the results are generally favorable.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to better understand and analyze deep convolutional neural networks (CNNs). Specifically, the author points out that although deep CNNs have achieved breakthrough performance in pattern recognition tasks such as image classification, these models are usually developed relying on a large number of trial - and - error processes, because there is still a lack of clear understanding of when and why deep models are effective at present. In addition, since CNNs have a large number of layers, neurons and connections, it is very difficult to understand the role of each component (such as neurons, connections). Therefore, this paper proposes a visual analysis method, aiming to help machine - learning experts better understand, diagnose and optimize deep CNNs, thereby reducing the time and resource consumption in the trial - and - error process. To achieve this goal, the author has developed an interactive visual analysis system - CNNVis, which represents CNNs as directed acyclic graphs (DAGs) and combines multiple techniques such as rectangle packing, matrix visualization and biclustering - based edge bundling to show multiple aspects of neuron clusters and their interactions. This method not only helps to reveal the roles of neurons when processing different types of images, but also reduces the visual clutter caused by a large number of connections, enabling experts to explore and understand the internal mechanisms of deep CNNs more effectively.