A Label Embedding Algorithm Based on Maximizing Normalized Cross-Covariance Operator

Yulin Xue,Yuchen Pan,Tao Peng,Jun Li,Jianhua Xu
DOI: https://doi.org/10.1007/978-3-031-68309-1_17
2024-01-01
Abstract:Multi-label classification studies a problem where each instance is associated with multiple relevant labels, which leads to the exponential growth of output space. To address this issue of high-dimensional label space, dimensionality reduction strategy originally applied to feature space is also used in label space, known as label space dimensionality reduction (LSDR). One popular strategy to implement LSDR is label embedding (LE), which encodes the original high-dimensional label vector into a low-dimensional vector linearly or nonlinearly. In this paper, We investigate the normalized cross-covariance operator (NOCCO), which originally is a kernel-based measure of the dependency between features and labels, whose empirical estimator is described as a trace operation including two inverse matrices of feature and label kernels plus a predefined regularization constant. We specifically designed an approximate and symmetric form of this operator for linear LE, which is maximized under orthonormal constraints, resulting in a novel eigenvalue problem for linear LE. The solution to this eigenvalue problem produces our compression matrix, and its transpose as our recovery matrix. Our proposed novel linear LE method based on maximizing normalized cross-covariance operator is termed as LEMCCO for short. The experiments on four benchmark data sets with more than 100 labels demonstrate that our proposed method is statistically superior to four state-of-the-art LE methods on the basis of two performance evaluation metrics.
What problem does this paper attempt to address?