Tree-Like Branching Network for Multi-class Classification

Mengqi Xue,Jie Song,Li Sun,Mingli Song
DOI: https://doi.org/10.1007/978-3-030-93247-3_18
2022-01-01
Abstract:In multi-task learning, network branching, i.e. specializing branches for different tasks on top of a shared truck, has been a golden rule. In multi-class classification task, however, previous work usually arranges all categories at the last layer in deep neural networks, which implies that all the layers are shared by these categories regardless of their varying relationships. In this paper, we study how to convert a trained typical neural network into a branching network where layers are properly shared or specialized for the involved categories. We propose a three-step branching strategy, dubbed as Tree-Like Branching (TLB), to exploit network sharing and branching for multi-class classification. TLB first mines inherent category relationships from a trained neural network in a layer-wise manner. Then it determines the appropriate layer in the network on which specialized branches grow to reconcile the conflicting decision patterns of different categories. Finally TLB adopts knowledge distillation to train the derived branching network. Experiments on widely used benchmarks show that the derived tree-like network from TLB achieves higher accuracy and lower cost compared to prior models, meanwhile exhibiting better interpretability.
What problem does this paper attempt to address?