Abstract:Sentiment analysis also referred to as opinion mining, plays a significant role in automating the identification of negative, positive, or neutral sentiments expressed in textual data. The proliferation of social networks, review sites, and blogs has rendered these platforms valuable resources for mining opinions. Sentiment analysis finds applications in various domains and languages, including English and Arabic. However, Arabic presents unique challenges due to its complex morphology characterized by inflectional and derivation patterns. To effectively analyze sentiment in Arabic text, sentiment analysis techniques must account for this intricacy. This paper proposes a model designed using the transformer model and deep learning (DL) techniques. The word embedding is represented by Transformer-based Model for Arabic Language Understanding (ArabBert), and then passed to the AraBERT model. The output of AraBERT is subsequently fed into a Long Short-Term Memory (LSTM) model, followed by feedforward neural networks and an output layer. AraBERT is used to capture rich contextual information and LSTM to enhance sequence modeling and retain long-term dependencies within the text data. We compared the proposed model with machine learning (ML) algorithms and DL algorithms, as well as different vectorization techniques: term frequency-inverse document frequency (TF-IDF), ArabBert, Continuous Bag-of-Words (CBOW), and skipGrams using four Arabic benchmark datasets. Through extensive experimentation and evaluation of Arabic sentiment analysis datasets, we showcase the effectiveness of our approach. The results underscore significant improvements in sentiment analysis accuracy, highlighting the potential of leveraging transformer models for Arabic Sentiment Analysis. The outcomes of this research contribute to advancing Arabic sentiment analysis, enabling more accurate and reliable sentiment analysis in Arabic text. The findings reveal that the proposed framework exhibits exceptional performance in sentiment classification, achieving an impressive accuracy rate of over 97%.

Arabic dialect identification in social media: A hybrid model with transformer models and BiLSTM

Yet Another Model for Arabic Dialect Identification

Multi-Dialect Arabic BERT for Country-Level Dialect Identification

Arabic Tweet Act: A Weighted Ensemble Pre-Trained Transformer Model for Classifying Arabic Speech Acts on Twitter

Arabic Dialect Identification in the Wild

OSN-MDAD: Machine Translation Dataset for Arabic Multi-Dialectal Conversations on Online Social Media

Automatic Dialect Detection in Arabic Broadcast Speech

Exploiting Dialect Identification in Automatic Dialectal Text Normalization

Annotation and evaluation of a dialectal Arabic sentiment corpus against benchmark datasets using transformers

Adapting MARBERT for Improved Arabic Dialect Identification: Submission to the NADI 2021 Shared Task

On the Robustness of Arabic Speech Dialect Identification

MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge

Heterogeneous Ensemble Deep Learning Model for Enhanced Arabic Sentiment Analysis

ArabBert-LSTM: improving Arabic sentiment analysis based on transformer model and Long Short-Term Memory

Automatic Arabic Dialect Identification Systems for Written Texts: A Survey

A Comparative Study of Deep Learning Approaches for Arabic Language Processing

Designing a System to Recognize Main Arabic Dialects

ALDi: Quantifying the Arabic Level of Dialectness of Text

A Parameter-Efficient Learning Approach to Arabic Dialect Identification with Pre-Trained General-Purpose Speech Model

Post-hoc analysis of Arabic transformer models

BERT-Based Arabic Social Media Author Profiling