Abstract:Multi-label image classification is of significant interest due to its major role in real-world web image analysis applications such as large-scale image retrieval and browsing. Recently, matrix completion (MC) has been developed to deal with multi-label classification tasks. MC has distinct advantages, such as robustness to missing entries in the feature and label spaces and a natural ability to handle multi-label problems. However, current MC-based multi-label image classification methods only consider data represented by a single view feature, therefore, do not precisely characterize images that contain several semantic concepts. An intuitive way to utilize multiple features taken from different views is to concatenate the different features into a long vector; however, this concatenation is prone to over-fitting and leads to high time complexity in MC-based image classification. Therefore, we present a novel multi-view learning model for MC based image classification, called low-rank multi-view matrix completion (lrMMC), which first seeks a low-dimensional common representation of all views by utilizing the proposed low-rank multi-view learning (lrMVL) algorithm. In lrMVL, the common subspace is constrained to be low rank so that it is suitable for MC. In addition, combination weights are learned to explore complementarity between different views. An efficient solver based on fixed-point continuation (FPC) is developed for optimization, and the learned low-rank representation is then incorporated into MC-based image classification. Extensive experimentation on the challenging PASCAL VOC' 07 dataset demonstrates the superiority of lrMMC compared to other multi-label image classification approaches.

SADCMF: Self-Attentive Deep Consistent Matrix Factorization for Micro-Video Multi-Label Classification

Dual-domain Aligned Deep Hierarchical Matrix Factorization Method for Micro-video Multi-label Classification

Sentiment Analysis Using Deep Robust Complementary Fusion of Multi-Features and Multi-Modalities.

Common-Individual Semantic Fusion for Multi-View Multi-Label Learning

Context-aware focal alignment network for micro-video multi-label classification

Multimodal Progressive Modulation Network for Micro-video Multi-label Classification

Multimodal Attentive Representation Learning for Micro-video Multi-label Classification

Self-supervised Deep Partial Adversarial Network for Micro-Video Multimodal Classification

Self-Supervised Discriminative Feature Learning for Deep Multi-View Clustering

Ensemble Multi-Instance Multi-Label Learning Approach for Video Annotation Task

Semi-supervised multi-instance multi-label learning for video annotation task.

Fusing Multi-Stream Deep Networks for Video Classification

Sparse MDMO: Learning a Discriminative Feature for Micro-Expression Recognition

Low-Rank Multi-View Learning in Matrix Completion for Multi-Label Image Classification

Deep Collective Matrix Factorization for Augmented Multi-View Learning

Boosting Video Representation Learning with Multi-Faceted Integration

Multi-Stream Multi-Class Fusion of Deep Networks for Video Classification.

Cross-modal Fusion for Multi-Label Image Classification with Attention Mechanism

MULTI-LABEL IMAGE RECOGNITION WITH JOINT CLASS-AWARE MAP DISENTANGLING AND LABEL CORRELATION EMBEDDING

Sparse Concept Discriminant Matrix Factorization for Image Representation.

Deep Multi-task Multi-label CNN for Effective Facial Attribute Classification