Abstract:Learning from a label distribution has achieved promising results on ordinal regression tasks such as facial age and head pose estimation wherein, the concept of adaptive label distribution learning (ALDL) has drawn lots of attention recently for its superiority in theory. However, compared with the methods assuming fixed form label distribution, ALDL methods have not achieved better performance. We argue that existing ALDL algorithms do not fully exploit the intrinsic properties of ordinal regression. In this paper, we emphatically summarize that learning an adaptive label distribution on ordinal regression tasks should follow three principles. First, the probability corresponding to the ground-truth should be the highest in label distribution. Second, the probabilities of neighboring labels should decrease with the increase of distance away from the ground-truth, i.e., the distribution is unimodal. Third, the label distribution should vary with samples changing, and even be distinct for different instances with the same label, due to the different levels of difficulty and ambiguity. Under the premise of these principles, we propose a novel loss function for fully adaptive label distribution learning, namely unimodal-concentrated loss. Specifically, the unimodal loss derived from the learning to rank strategy constrains the distribution to be unimodal. Furthermore, the estimation error and the variance of the predicted distribution for a specific sample are integrated into the proposed concentrated loss to make the predicted distribution maximize at the ground-truth and vary according to the predicting uncertainty. Extensive experimental results on typical ordinal regression tasks including age and head pose estimation, show the superiority of our proposed unimodal-concentrated loss compared with existing loss functions.

Computationally Efficient Wasserstein Loss for Structured Labels

An Empirical Study of Self-Supervised Learning with Wasserstein Distance

Label Distribution Learning: A Local Collaborative Mechanism

Label Distribution Learning Forests

Adaptive Weighted Ranking-Oriented Label Distribution Learning

Unimodal-Concentrated Loss: Fully Adaptive Label Distribution Learning for Ordinal Regression

Label Distribution Learning by Exploiting Label Correlations.

Label Distribution Learning by Optimal Transport.

Label Distribution Learning using the Squared Neural Family on the Probability Simplex

Latent Semantics Encoding for Label Distribution Learning.

Inaccurate Label Distribution Learning

Incomplete Label Distribution Learning Via Label Correlation Decomposition

Supervised Tree-Wasserstein Distance

Probabilistic Label Tree for Streaming Multi-Label Learning

Label distribution learning by maintaining label ranking relation

Label Distribution Learning with Noisy Labels Via Three-Way Decisions

Controlling Wasserstein Distances by Kernel Norms with Application to Compressive Statistical Learning

Distribution-Balanced Loss for Multi-label Classification in Long-Tailed Datasets

Unified Framework for Learning with Label Distribution

Fast unsupervised ground metric learning with tree-Wasserstein distance