Abstract:Hierarchical structures of labels usually exist in large-scale classification tasks, where labels can be organized into a tree-shaped structure. The nodes near the root stand for coarser labels, while the nodes close to leaves mean the finer labels. We label unseen samples from the root node to a leaf node, and obtain multigranularity predictions in the hierarchical classification. Sometimes, we cannot obtain a leaf decision due to uncertainty or incomplete information. In this case, we should stop at an internal node, rather than going ahead rashly. However, most existing hierarchical classification models aim at maximizing the percentage of correct predictions, and do not take the risk of misclassifications into account. Such risk is critically important in some real-world applications, and can be measured by the distance between the ground truth and the predicted classes in the class hierarchy. In this work, we utilize the semantic hierarchy to define the classification risk and design an optimization technique to reduce such risk. By defining the conservative risk and the precipitant risk as two competing risk factors, we construct the balanced conservative/precipitant semantic (BCPS) risk matrix across all nodes in the semantic hierarchy with user-defined weights to adjust the tradeoff between two kinds of risks. We then model the classification process on the semantic hierarchy as a sequential decision-making task. We design an algorithm to derive the risk-minimized predictions. There are two modules in this model: 1) multitask hierarchical learning and 2) deep reinforce multigranularity learning. The first one learns classification confidence scores of multiple levels. These scores are then fed into deep reinforced multigranularity learning for obtaining a global risk-minimized prediction with flexible granularity. Experimental results show that the proposed model outperforms state-of-the-art methods on seven large-scale classification datasets with the sema-tic tree.

Inducing Semantic Hierarchy Structure in Empirical Risk Minimization with Optimal Transport Measures

Embedding Semantic Hierarchy in Discrete Optimal Transport for Risk Minimization

Hierarchical Semantic Risk Minimization for Large-Scale Classification

Learning Structured Representations by Embedding Class Hierarchy with Fast Optimal Transport

Relative Entropic Optimal Transport: a (Prior-Aware) Matching Perspective to (unbalanced) Classification

Learning Ultrametric Trees for Optimal Transport Regression

SP$^2$OT: Semantic-Regularized Progressive Partial Optimal Transport for Imbalanced Clustering

Tilted Cross Entropy (TCE): Promoting Fairness in Semantic Segmentation

Hierarchical Optimal Transport for Multimodal Distribution Alignment

Optimizing the Dice Score and Jaccard Index for Medical Image Segmentation: Theory & Practice

Neural Estimation Of Entropic Optimal Transport

Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training

Semantic Correspondence as an Optimal Transport Problem

Modeling Hierarchical Structural Distance for Unsupervised Domain Adaptation

Relaxed Earth Mover's Distances for Chain- and Tree-connected Spaces and their use as a Loss Function in Deep Learning

Semi-Discrete Optimal Transport: Nearly Minimax Estimation With Stochastic Gradient Descent and Adaptive Entropic Regularization

On the potential of Optimal Transport in Geospatial Data Science

OTCMR: Bridging Heterogeneity Gap with Optimal Transport for Cross-modal Retrieval

Double-Bounded Optimal Transport for Advanced Clustering and Classification

Transferability Estimation for Semantic Segmentation Task