Abstract:To make the problem of multilabel classification with many classes more tractable, in recent years, academia has seen efforts devoted to performing label space dimension reduction (LSDR). Specifically, LSDR encodes high-dimensional label vectors into low-dimensional code vectors lying in a latent space, so as to train predictive models at much lower costs. With respect to the prediction, it performs classification for any unseen instance by recovering a label vector from its predicted code vector via a decoding process. In this paper, we propose a novel method, namely End-to-End Feature-aware label space Encoding (E2FE), to perform LSDR. Instead of requiring an encoding function like most previous works, E2FE directly learns a code matrix formed by code vectors of the training instances in an end-to-end manner. Another distinct property of E2FE is its feature awareness attributable to the fact that the code matrix is learned by jointly maximizing the recoverability of the label space and the predictability of the latent space. Based on the learned code matrix, E2FE further trains predictive models to map instance features into code vectors, and also learns a linear decoding matrix for efficiently recovering the label vector of any unseen instance from its predicted code vector. Theoretical analyses show that both the code matrix and the linear decoding matrix in E2FE can be efficiently learned. Moreover, similar to previous works, E2FE can be specified to learn an encoding function. And it can also be extended with kernel tricks to handle nonlinear correlations between the feature space and the latent space. Comprehensive experiments conducted on diverse benchmark data sets with many classes show consistent performance gains of E2FE over the state-of-the-art methods.

Labelset Anchored Subspace Ensemble (LASE) for Multi-Label Annotation.

Multi-label Subspace Ensemble.

Multi-Label Transfer Learning with Sparse Representation

Multi-label Learning via Structured Decomposition and Group Sparsity

Learning Label-Adaptive Representation for Large-Scale Multi-Label Text Classification

Leveraged Asymmetric Loss with Disambiguation for Multi-label Recognition with One-Positive Annotations

Active Learning with Label Correlation Exploration for Multi-Label Image Classification

A Label Embedding Method via Conditional Covariance Maximization for Multi-label Classification.

Multi-Modal Image Annotation with Multi-Instance Multi-Label LDA.

Semi-supervised Label Enhancement Via Structured Semantic Extraction

LLMaAA: Making Large Language Models as Active Annotators

Generalized Label Enhancement with Sample Correlations

End-to-End Feature-Aware Label Space Encoding for Multilabel Classification with Many Classes.

Online Multi-Label Active Annotation

Label2Label: A Language Modeling Framework for Multi-attribute Learning

Label Attention Network for Structured Prediction

Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition

Sequence Multi-Labeling: A Unified Video Annotation Scheme with Spatial and Temporal Context

Ensemble Approach Based on Conditional Random Field for Multi-Label Image and Video Annotation

Data Augmentation For Label Enhancement

Label-Assemble: Leveraging Multiple Datasets with Partial Labels