Abstract:Multi-label text classification tasks face challenges such as sample diversity, complexity, and the need for effective utilization of label correlations. In this paper, we propose a model that integrates multi-granularity fusion of text sequence features and label semantic correlation information. Our model leverages graph convolutional networks to extract label semantic correlation, which enhances classification performance for samples with similar labels and addresses label omission issues. Additionally, text convolutional neural networks are employed to extract multi-granularity sense group features from text sequences, calculate their similarity with semantic correlation label distributions, and dynamically adjust the similarity between text context and label information. This approach tackles the limitations of feature extraction in short texts and label confusion. We replace the original multi-hot label encoding in model training with a label distribution that fuses text multi-granularity sense group features and label correlation information, using a more precise encoding method for soft alignment based on label probability distributions. This enhances the model's resilience to noisy data, avoiding the issue of assigning high-confidence probabilities to incorrect categories due to hard-coded supervision. Our model's performance improvement on noisy datasets significantly surpasses that achieved by label smoothing. Extensive experiments on three legal text datasets and two generalized multi-label datasets demonstrate the model's excellent performance. Our approach is applicable in various real-world scenarios, such as legal judgment prediction, news categorization, and recommendation systems, where accurate multi-label classification is crucial. Ablation and experiments on noisy datasets validate the model's effectiveness and robustness.

Multi-Field Learning For Email Spam Filtering

Largemargin Classification for Combating Disguise Attacks on Spam Filters

Extracting discriminative information from e-mail for spam detection inspired by Immune System

A Local-Concentration-Based Feature Extraction Approach for Spam Filtering.

Feature Construction Approach for Email Categorization Based on Term Space Partition

Ensemble Decision for Spam Detection Using Term Space Partition Approach

Classification of Spam Emails through Hierarchical Clustering and Supervised Learning

A Late Multi-Modal Fusion Model for Detecting Hybrid Spam E-mail

Online Learning of Multiple Tasks and Their Relationships : Testing on Spam Email Data and EEG Signals Recorded in Construction Fields

Spam-T5: Benchmarking Large Language Models for Few-Shot Email Spam Detection

Multi-task Vector Field Learning.

Effective spam filter based on a hybrid method of header checking and content parsing

An Advanced Deep Attention Collaborative Mechanism for Secure Educational Email Services

Enhancing Label Correlation Feedback in Multi-Label Text Classification via Multi-Task Learning

Training SpamAssassin with Active Semi-supervised Learning

Variable Length Concentration Based Feature Construction Method for Spam Detection

Term Space Partition Based Ensemble Feature Construction For Spam Detection

MFLSCI: Multi-granularity fusion and label semantic correlation information for multi-label legal text classification

An Optimized Approach for Detection and Classification of Spam Email's Using Ensemble Methods

Intelligent Detection Approaches for Spam

An Adaptive Concentration Selection Model for Spam Detection.