Abstract:Text categorization has become an increasingly important issue for businesses that handle massive volumes of data generated online, and it has found substantial use in the field of NLP. The capacity to group texts into separate categories is crucial for users to effectively retain and utilize important information. Our goal is to improve upon existing recurrent neural network (RNN) techniques for text classification by creating a deep learning strategy through our study. Raising the quality of the classifications made is the main difficulty in text classification, nevertheless, as the overall efficacy of text classification is often hampered by the data semantics' inadequate context sensitivity. Our study presents a unified approach to examine the effects of word embedding and the GRU on text classification to address this difficulty. In this study, we use the TREC standard dataset. RCNN has four convolution layers, four LSTM levels, and two GRU layers. RNN, on the other hand, has four GRU layers and four LSTM levels. One kind of recurrent neural network (RNN) that is well-known for its comprehension of sequential data is the gated recurrent unit (GRU). We found in our tests that words with comparable meanings are typically found near each other in embedding spaces. The trials' findings demonstrate that our hybrid GRU model is capable of efficiently picking up word usage patterns from the provided training set. Remember that the depth and breadth of the training data greatly influence the model's effectiveness. Our suggested method performs remarkably well when compared to other well-known recurrent algorithms such as RNN, MV-RNN, and LSTM on a single benchmark dataset. In comparison to the hybrid GRU's F-measure 0.952, the proposed model's F-measure is 0.982%. We compared the performance of the proposed method to that of the three most popular recurrent neural network designs at the moment RNNs, MV-RNNs, and LSTMs, and found that the new method achieved better results on two benchmark datasets, both in terms of accuracy and error rate.

Turkish Text Classification: From Lexicon Analysis to Bidirectional Transformer

Improving Turkish Text Sentiment Classification Through Task-Specific and Universal Transformations: An Ensemble Data Augmentation Approach

Turkish Medical Text Classification Using BERT

Albanian Text Classification: Bag of Words Model and Word Analogies

Advancing natural language processing (NLP) applications of morphologically rich languages with bidirectional encoder representations from transformers (BERT): an empirical case study for Turkish

Fine-tuning Transformer-based Encoder for Turkish Language Understanding Tasks

Switching Self-Attention Text Classification Model with Innovative Reverse Positional Encoding for Right-to-Left Languages: A Focus on Arabic Dialects

Comparison of Turkish Word Representations Trained on Different Morphological Forms

Large Scale Legal Text Classification Using Transformer Models

Development of Deep Learning Based Natural Language Processing Model for Turkish

Expanding the Text Classification Toolbox with Cross-Lingual Embeddings

Comparison of Pre-trained Language Models for Turkish Address Parsing

A Survey of Text Classification With Transformers: How Wide? How Large? How Long? How Accurate? How Expensive? How Safe?

A Survey on Text Classification Algorithms: From Text to Predictions

BERT2D: Two Dimensional Positional Embeddings for Efficient Turkish NLP

Harnessing the Intrinsic Knowledge of Pretrained Language Models for Challenging Text Classification Settings

ConvTextTM: An Explainable Convolutional Tsetlin Machine Framework for Text Classification.

A Hybrid Deep Learning GRU based Approach for Text Classification using Word Embedding

Transformers for Multi-label Classification of Medical Text: An Empirical Comparison

CONSENT: Context Sensitive Transformer for Bold Words Classification

Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers