Abstract:Text categorization has become an increasingly important issue for businesses that handle massive volumes of data generated online, and it has found substantial use in the field of NLP. The capacity to group texts into separate categories is crucial for users to effectively retain and utilize important information. Our goal is to improve upon existing recurrent neural network (RNN) techniques for text classification by creating a deep learning strategy through our study. Raising the quality of the classifications made is the main difficulty in text classification, nevertheless, as the overall efficacy of text classification is often hampered by the data semantics' inadequate context sensitivity. Our study presents a unified approach to examine the effects of word embedding and the GRU on text classification to address this difficulty. In this study, we use the TREC standard dataset. RCNN has four convolution layers, four LSTM levels, and two GRU layers. RNN, on the other hand, has four GRU layers and four LSTM levels. One kind of recurrent neural network (RNN) that is well-known for its comprehension of sequential data is the gated recurrent unit (GRU). We found in our tests that words with comparable meanings are typically found near each other in embedding spaces. The trials' findings demonstrate that our hybrid GRU model is capable of efficiently picking up word usage patterns from the provided training set. Remember that the depth and breadth of the training data greatly influence the model's effectiveness. Our suggested method performs remarkably well when compared to other well-known recurrent algorithms such as RNN, MV-RNN, and LSTM on a single benchmark dataset. In comparison to the hybrid GRU's F-measure 0.952, the proposed model's F-measure is 0.982%. We compared the performance of the proposed method to that of the three most popular recurrent neural network designs at the moment RNNs, MV-RNNs, and LSTMs, and found that the new method achieved better results on two benchmark datasets, both in terms of accuracy and error rate.

Text classification framework for short text based on TFIDF-FastText

A multiclass classification framework for document categorization

Fast text categorization based on collaborative work in the semantic and class spaces

Improving Short Text Classification Through Better Feature Space Selection

A hybrid approach of Weighted Fine-Tuned BERT extraction with deep Siamese Bi – LSTM model for semantic text similarity identification

Short Text Classification Based on Strong Feature Thesaurus

A Fuzzy Similarity Based Concept Mining Model for Text Classification

Chinese text classification method using FastText and term frequency-inverse document frequency optimization

Context‐aware text classification system to improve the quality of text: A detailed investigation and techniques

FastClass: A Time-Efficient Approach to Weakly-Supervised Text Classification

Context-Preserving Hashing for Fast Text Classification.

A technical study and analysis on fuzzy similarity based models for text classification

Short text classification by detecting information path.

A multi-semantic passing framework for semi-supervised long text classification

Does a Hybrid Neural Network based Feature Selection Model Improve Text Classification?

Semantic similarity-aware feature selection and redundancy removal for text classification using joint mutual information

A Multimodel-Based Deep Learning Framework for Short Text Multiclass Classification with the Imbalanced and Extremely Small Data Set

A Comparative Study on TF-IDF feature Weighting Method and its Analysis using Unstructured Dataset

A Framework For Refining Text Classification and Object Recognition from Academic Articles

A hybrid medical text classification framework: Integrating attentive rule construction and neural network

A Hybrid Deep Learning GRU based Approach for Text Classification using Word Embedding