Abstract:The rise in web and social media interactions has resulted in the efortless proliferation of offensive language and hate speech. Such online harassment, insults, and attacks are commonly termed cyberbullying. The sheer volume of user-generated content has made it challenging to identify such illicit content. Machine learning has wide applications in text classification, and researchers are shifting towards using deep neural networks in detecting cyberbullying due to the several advantages they have over traditional machine learning algorithms. This paper proposes a novel neural network framework with parameter optimization and an algorithmic comparative study of eleven classification methods: four traditional machine learning and seven shallow neural networks on two real world cyberbullying datasets. In addition, this paper also examines the effect of feature extraction and word-embedding-techniques-based natural language processing on algorithmic performance. Key observations from this study show that bidirectional neural networks and attention models provide high classification results. Logistic Regression was observed to be the best among the traditional machine learning classifiers used. Term Frequency-Inverse Document Frequency (TF-IDF) demonstrates consistently high accuracies with traditional machine learning techniques. Global Vectors (GloVe) perform better with neural network models. Bi-GRU and Bi-LSTM worked best amongst the neural networks used. The extensive experiments performed on the two datasets establish the importance of this work by comparing eleven classification methods and seven feature extraction techniques. Our proposed shallow neural networks outperform existing state-of-the-art approaches for cyberbullying detection, with accuracy and F1-scores as high as ~95% and ~98%, respectively.

Detecting Hostile Posts using Relational Graph Convolutional Network

Hostility Detection in Hindi leveraging Pre-Trained Language Models

Divide and Conquer: An Ensemble Approach for Hostile Post Detection in Hindi

Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings

Evaluation of Deep Learning Models for Hostility Detection in Hindi Text

Walk in Wild: An Ensemble Approach for Hostility Detection in Hindi Posts

Combating Hostility: Covid-19 Fake News and Hostile Post Detection in Social Media

Hostility Detection and Covid-19 Fake News Detection in Social Media

Relational Graph Convolutional Networks for Sentiment Analysis

BotRGCN: Twitter Bot Detection with Relational Graph Convolutional Networks

A hybrid convolutional neural network for sarcasm detection from multilingual social media posts

An advanced learning approach for detecting sarcasm in social media posts: Theory and solutions

Social media content classification and community detection using deep learning and graph analytics

Hostility Detection Dataset in Hindi

Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages

Hypers at ComMA@ICON: Modelling Aggressiveness, Gender Bias and Communal Bias Identification

Modelling Social Context for Fake News Detection: A Graph Neural Network Based Approach

Task Adaptive Pretraining of Transformers for Hostility Detection

Cyberbullying Detection: Hybrid Models Based on Machine Learning and Natural Language Processing Techniques

ALPHA-CHLORALOSE AS A CAPTURE AND RESTRAINT AGENT OF BIRDS: THERAPEUTIC INDEX DETERMINATION IN THE CHICKEN

Misogynistic Meme Detection using Early Fusion Model with Graph Network