Abstract:Offensive language detection is an ever-growing natural language processing (NLP) application. This growth is mainly because of the widespread usage of social networks, which becomes a mainstream channel for people to communicate, work, and enjoy entertainment content. Many incidents of sharing aggressive and offensive content negatively impacted society to a great extend. We believe contributing to improving and comparing different machine learning models to fight such harmful contents is an important and challenging goal for this thesis. We targeted the problem of offensive language detection for building efficient automated models for offensive language detection. With the recent advancements of NLP models, specifically, the Transformer model, which tackled many shortcomings of the standard seq-to-seq techniques. The BERT model has shown state-of-the-art results on many NLP tasks. Although the literature still exploring the reasons for the BERT achievements in the NLP field. Other efficient variants have been developed to improve upon the standard BERT, such as RoBERTa and ALBERT. Moreover, due to the multilingual nature of text on social media that could affect the model decision on a given tween, it is becoming essential to examine multilingual models such as XLM-RoBERTa trained on 100 languages and how did it compare to unilingual models. The RoBERTa based model proved to be the most capable model and achieved the highest F1 score for the tasks. Another critical aspect of a well-rounded offensive language detection system is the speed at which a model can be trained and make inferences. In that respect, we have considered the model run-time and fine-tuned the very efficient implementation of FastText called BlazingText that achieved good results, which is much faster than BERT-based models.

Chinese offensive language analysis based on Bidirectional Encoder Representation Transformer (BERT)

COLD: A Benchmark for Chinese Offensive Language Detection

Towards Evaluating the Robustness of Chinese BERT Classifiers

Enhancing Offensive Language Detection with Data Augmentation and Knowledge Distillation

Pashto offensive language detection: a benchmark dataset and monolingual Pashto BERT

Unsupervised offensive speech detection for multimedia based on multilingual BERT

KOLD: Korean Offensive Language Dataset

ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations

Speech Detection Task Against Asian Hate: BERT the Central, While Data-Centric Studies the Crucial

Neural Models for Offensive Language Detection

Cross-Cultural Transfer Learning for Chinese Offensive Language Detection

RoChBert: Towards Robust BERT Fine-tuning for Chinese

Chinese MentalBERT: Domain-Adaptive Pre-training on Social Media for Chinese Mental Health Text Analysis

UPB at SemEval-2020 Task 12: Multilingual Offensive Language Detection on Social Media by Fine-tuning a Variety of BERT-based Models

Enhanced Offensive Language Detection Through Data Augmentation

Chinese Offensive Language Detection:Current Status and Future Directions

BERT-ATTACK: Adversarial Attack Against BERT Using BERT

Cross-Linguistic Offensive Language Detection: BERT-Based Analysis of Bengali, Assamese, & Bodo Conversational Hateful Content from Social Media

Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges

Semi-Meta-Supervised Hate Speech Detection

Investigating cross-lingual training for offensive language detection