Abstract:Technological developments over the past few decades have changed the way people communicate, with platforms like social media and blogs becoming vital channels for international conversation. Even though hate speech is vigorously suppressed on social media, it is still a concern that needs to be constantly recognized and observed. The Arabic language poses particular difficulties in the detection of hate speech, despite the considerable efforts made in this area for English-language social media content. Arabic calls for particular consideration when it comes to hate speech detection because of its many dialects and linguistic nuances. Another degree of complication is added by the widespread practice of "code-mixing," in which users merge various languages smoothly. Recognizing this research vacuum, the study aims to close it by examining how well machine learning models containing variation features can detect hate speech, especially when it comes to Arabic tweets featuring code-mixing. Therefore, the objective of this study is to assess and compare the effectiveness of different features and machine learning models for hate speech detection on Arabic hate speech and code-mixing hate speech datasets. To achieve the objectives, the methodology used includes data collection, data pre-processing, feature extraction, the construction of classification models, and the evaluation of the constructed classification models. The findings from the analysis revealed that the TF-IDF feature, when employed with the SGD model, attained the highest accuracy, reaching 98.21%. Subsequently, these results were contrasted with outcomes from three existing studies, and the proposed method outperformed them, underscoring the significance of the proposed method. Consequently, our study carries practical implications and serves as a foundational exploration in the realm of automated hate speech detection in text.

An Effective Approach for Rumor Detection of Arabic Tweets Using eXtreme Gradient Boosting Method

A Gradient Tree Boosting based Approach to Rumor Detecting on Sina Weibo

Comparing the Random Forest vs. Extreme Gradient Boosting using Cuckoo Search Optimizer for Detecting Arabic Cyberbullying

Enhancing prediction of user stance for social networks rumors

DETECTION OF FAKE NEWS ON TWITTER USING MACHINE LEARNING: AN XGBOOST-BASED APPROACH WITH SENTIMENT AND SOURCE CHARACTERISTIC ANALYSIS

Sentiment Analysis of Arab Tweets: Unveiling Public Opinion Trends Using Machine Learning

A Comprehensive Low and High-level Feature Analysis for Early Rumor Detection on Twitter

Code-mixing unveiled: Enhancing the hate speech detection in Arabic dialect tweets using machine learning models

Modified Genetic Algorithm for Feature Selection and Hyper Parameter Optimization: Case of XGBoost in Spam Prediction

Calling to CNN-LSTM for Rumor Detection: A Deep Multi-channel Model for Message Veracity Classification in Microblogs

Detection on early dynamic rumor influence and propagation using biogeography-based optimization with deep learning approaches

Arabic Sentiment Analysis for ChatGPT Using Machine Learning Classification Algorithms: A Hyperparameter Optimization Technique

A machine learning-based approach for sentiment analysis on distance learning from Arabic Tweets

Rumor Detection and Classification for Twitter Data

A Bi-GRU-DSA-based social network rumor detection approach

Rumour Detection and Analysis on Twitter

Detecting Arabic Cyberbullying Tweets Using Machine Learning

Identifying Possible Rumor Spreaders on Twitter: A Weak Supervised Learning Approach

Using Gaussian Processes for Rumour Stance Classification in Social Media

Ensemble based high performance deep learning models for fake news detection

An Information Diffusion Approach to Rumor Propagation and Identification on Twitter