Incomplete Multi-View Weak-Label Learning with Noisy Features and Imbalanced Labels

Zhiwei Li,Zijian Yang,Lu Sun,Mineichi Kudo,Kego Kimura
2023-08-29
Abstract:A variety of modern applications exhibit multi-view multi-label learning, where each sample has multi-view features, and multiple labels are correlated via common views. Current methods usually fail to directly deal with the setting where only a subset of features and labels are observed for each sample, and ignore the presence of noisy views and imbalanced labels in real-world problems. In this paper, we propose a novel method to overcome the limitations. It jointly embeds incomplete views and weak labels into a low-dimensional subspace with adaptive weights, and facilitates the difference between embedding weight matrices via auto-weighted Hilbert-Schmidt Independence Criterion (HSIC) to reduce the redundancy. Moreover, it adaptively learns view-wise importance for embedding to detect noisy views, and mitigates the label imbalance problem by focal loss. Experimental results on four real-world multi-view multi-label datasets demonstrate the effectiveness of the proposed method.
Machine Learning
What problem does this paper attempt to address?
The paper primarily proposes a new method to address issues encountered in multi-view multi-label learning. Specifically, the paper tackles the following key problems: 1. **Incomplete Data Handling**: In practical applications, each sample may have only a portion of features (views) and labels observed, and existing methods often cannot directly handle such incomplete situations. 2. **Presence of Noisy Views**: Real-world datasets often contain noisy views, which can interfere with the model's learning process. 3. **Label Imbalance Problem**: The disparity in the ratio of positive and negative samples in multi-label datasets can lead to a decline in model performance. To address the above issues, the paper proposes a method named NAIL (Noisy features and Imbalanced labels). The main contributions of NAIL include: - **Adaptive Weighting for Incomplete Multi-View Embedding**: Using L2,1 norm to measure the reconstruction error of different views and introducing adaptive weight parameters to detect noisy views. - **Imbalanced Weak Label Embedding**: Employing Focal Loss to alleviate the label imbalance problem, enabling the model to focus more on samples that are difficult to classify. - **Correlation Modeling Based on Adaptive Weighted HSIC**: Utilizing Hilbert-Schmidt Independence Criterion (HSIC) to reduce redundancy between embedding weight matrices of different views, further enhancing the model's generalization ability. Experimental results show that NAIL demonstrates superior performance on four real-world datasets, especially in handling noisy views and label imbalance. Additionally, ablation studies validate the effectiveness and necessity of each component of NAIL. In summary, NAIL is a flexible and effective method for addressing incomplete data issues in multi-view multi-label learning.