A Hybrid Model Based on Convolutional Neural Network and Long Short-Term Memory for Multi-label Text Classification

Hamed Khataei Maragheh,Farhad Soleimanian Gharehchopogh,Kambiz Majidzadeh,Amin Babazadeh Sangar

DOI: https://doi.org/10.1007/s11063-024-11500-8

IF: 2.565

2024-02-17

Neural Processing Letters

Abstract:Multi-label text classification (MLTC) is a popular method for organizing electronic documents, which is crucial for accessing and processing data. As the number of classes increases, learning multi-label data will be challenging. The number of possible states for various labels increases exponentially, and learning algorithms in single-label data cannot be used to solve these problems. In the meantime, using single-label data algorithms could be very time-consuming. In MLTC, complexity costs should be reduced. Deep-learning neural networks that can learn intricate patterns are used in many real-world problems because of their high power and accuracy. This paper proposed a hybridization of the long short-term memory (LSTM) neural network and the convolutional neural network (CNN) method for MLTC. The proposed model uses LSTM to enhance CNN to improve the proposed model's accuracy. Also, the competitive search algorithm (CSA) is used to improve the LSTM hyperparameters. The LSTM hyperparameters play an important role in increasing the detection accuracy. The CSA algorithm finds the best values for the hyperparameters by searching the problem space. It was tested on four different datasets of multi-label texts: Reuters-21578, RCV1-v2, EUR-Lex, and Bookmarks. The result showed that the proposed model performed better than CNN and LSTM-CSA in terms of accuracy percentage and that it has improved by an average of more than 10%. Also, the results show that the LSTM-CSA model has higher detection accuracy compared to LSTM—Gradient-based optimizer (GBO) and LSTM—whale optimization algorithm (WOA).

computer science, artificial intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the complexity and accuracy challenges in multi - label text classification (MLTC). As the number of categories increases, it becomes more and more difficult to learn multi - label data, because the number of possible states will grow exponentially, and traditional single - label data learning algorithms cannot effectively solve these problems. In addition, using single - label data algorithms can be very time - consuming. Therefore, the paper proposes a hybrid model based on convolutional neural network (CNN) and long - short - term memory network (LSTM) to solve the multi - label text classification problem, aiming to improve the accuracy and efficiency of the model while reducing the prediction error. Specifically, the goals of the paper include: - Using a hybrid model of CNN and LSTM to improve the accuracy of multi - label text classification. - Optimizing the hyper - parameters of LSTM through the competitive search algorithm (CSA) to further improve the detection accuracy. - Evaluating the performance of the proposed model on different multi - label text data sets, including Reuters - 21578, RCV1 - v2, EUR - Lex and Bookmarks. Through these goals, the paper hopes to provide a more efficient and accurate multi - label text classification method to meet the growing demand for electronic document organization.

A Hybrid Model Based on Convolutional Neural Network and Long Short-Term Memory for Multi-label Text Classification

A New Hybrid Based on Long Short-Term Memory Network with Spotted Hyena Optimization Algorithm for Multi-Label Text Classification

Research of multi-label text classification based on label attention and correlation networks

Multi-label text classification based on semantic-sensitive graph convolutional network

A Label Information Aware Model for Multi-label Text Classification

MsCoa: Multi-Step Co-Attention Model for Multi-Label Classification

Bi-LSTM Model to Increase Accuracy in Text Classification: Combining Word2vec CNN and Attention Mechanism

Semantic-Unit-Based Dilated Convolution for Multi-Label Text Classification.

A text classification method based on a convolutional and bidirectional long short-term memory model

Contrastive Learning-Enhanced Nearest Neighbor Mechanism for Multi-Label Text Classification.

Hierarchical Multi-Granularity Attention- Based Hybrid Neural Network for Text Classification.

MLGN:A Multi-Label Guided Network for Improving Text Classification

Multilabel Text Classification Using Multilayer DGAT

Enhancing Label Correlation Feedback in Multi-Label Text Classification via Multi-Task Learning

LA-HCN: Label-based Attention for Hierarchical Multi-label TextClassification Neural Network

A hybrid principal label space transformation-based ridge regression and decision tree for multi-label classification

Multi-Task Learning Model Based on Multi-Scale CNN and LSTM for Sentiment Classification

Hierarchical Multi-label Text Classification: An Attention-based Recurrent Network Approach

Hybrid Multi-Label Classification Model for Medical Applications Based on Adaptive Synthetic Data and Ensemble Learning

Hybrid embedding-based text representation for hierarchical multi-label text classification

Medical Text Classification Using Hybrid Deep Learning Models with Multihead Attention