Hierarchical Convolutional Neural Network with Knowledge Complementation for Long-Tailed Classification

Hong Zhao,Zhengyu Li,Wenwei He,Yan Zhao
DOI: https://doi.org/10.1145/3653717
IF: 4.157
2024-03-22
ACM Transactions on Knowledge Discovery from Data
Abstract:Existing methods based on transfer learning leverage auxiliary information to help tail generalization and improve the performance of the tail classes. However, they cannot fully exploit the relationships between auxiliary information and tail classes and bring irrelevant knowledge to the tail classes. To solve this problem, we propose a hierarchical CNN with knowledge complementation, which regards hierarchical relationships as auxiliary information and transfers relevant knowledge to tail classes. First, we integrate semantics and clustering relationships as hierarchical knowledge into the CNN to guide feature learning. Then, we design a complementary strategy to jointly exploit the two types of knowledge, where semantic knowledge acts as a prior dependence and clustering knowledge reduces the negative information caused by excessive semantic dependence (i.e., semantic gaps). In this way, the CNN facilitates the utilization of the two complementary hierarchical relationships and transfers useful knowledge to tail data to improve long-tailed classification accuracy. Experimental results on public benchmarks show that the proposed model outperforms existing methods. In particular, our model improves accuracy by 3.46% compared with the second-best method on the long-tailed tieredImageNet dataset.
computer science, information systems, software engineering
What problem does this paper attempt to address?
The paper primarily proposes a new solution to the long-tail classification problem. Specifically, it focuses on handling the extremely imbalanced class sample distribution in deep learning models, particularly Convolutional Neural Networks (CNNs), where the number of samples in a few head classes far exceeds that in a large number of tail classes. This imbalance can lead the model to accurately predict head classes while ignoring tail classes, thereby affecting overall classification performance. To address the above issue, the authors propose a Hierarchical Convolutional Neural Network (HCKC) that leverages the concept of knowledge completion. The main contributions of this approach are: 1. **Exploration of Clustering and Semantic Relationships**: By establishing hierarchical superclasses (including both semantic-based and clustering-based superclasses), the method aids in feature learning for tail classes. These hierarchical structures not only provide a physical explanation of the relationships between classes but also serve as auxiliary information to improve the performance of tail classes. 2. **Addressing the Semantic Gap Problem**: It is demonstrated that clustering relationships can help alleviate the semantic gap problem present in hierarchical relationships, guiding the long-tail classification task from both data and label perspectives. 3. **Proposing a Complementary Strategy**: By combining semantic and clustering knowledge, the method helps tail classes learn more diverse knowledge, thereby enhancing feature diversity. This includes first transferring semantic knowledge to tail data to learn relevant features, and then using clustering knowledge to mitigate the negative impact caused by the semantic gap. In summary, the paper aims to improve the performance of tail classes in long-tail classification problems by introducing a hierarchical convolutional neural network and a knowledge completion strategy. Especially in deep learning models, it utilizes semantic and clustering information to enhance the generalization ability of tail classes. Experimental results show that the proposed model outperforms existing methods on multiple public benchmark datasets.