Abstract:Concerns about the frequency of harmful remarks have been raised by the growth of online communication platforms, which makes it difficult to create inclusive and safe digital spaces. This study explores the creation of a strong framework that uses machine learning algorithms and natural language processing (NLP) methods to categorise harmful comments. In order to improve the accuracy and comprehensiveness of categorization, the study investigates the integration of personality trait analysis in addition to identifying hazardous language. A wide range of online comments comprised the dataset that was gathered and put through extensive preparation methods such as text cleaning, lemmatization, and feature extraction. To facilitate the training and assessment of machine learning models, textual data was converted into numerical representations by utilising TF-IDF vectorization and word embeddings. Furthermore, personality traits were extracted from comments using sentiment analysis and language clues, which linked linguistic patterns with behavioural inclinations. The study resulted in the development and assessment of complex categorization models that combined features from textual content and inferred personality traits. The findings show encouraging associations between specific personality qualities and the use of toxic language, providing opportunities to identify subtle differences in toxic comment contexts. In order to provide insights into developing more sophisticated and successful methods of reducing toxicity in online discourse, this study outlines the methodology, major findings, and consequences of incorporating personality traits analysis into the classification of toxic comments.

Toxic Comments Hunter : Score Severity of Toxic Comments

A Survey of Toxic Comment Classification Methods

Purging the Poison: A Machine Learning Approach to Filtering Toxic Comments

Toxic Comment Classification based on Personality Traits Using NLP

Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmarks

Modeling subjectivity (by Mimicking Annotator Annotation) in toxic comment identification across diverse communities

ToxiSpanSE: An Explainable Toxicity Detection in Code Review Comments

Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments

Impact of Sentiment Detection to Recognize Toxic and Subversive Online Comments

SS-BERT: Mitigating Identity Terms Bias in Toxic Comment Classification by Utilising the Notion of "Subjectivity" and "Identity Terms"

Predicting Different Types of Subtle Toxicity in Unhealthy Online Conversations

Comparison of Deep Learning Models and Various Text Pre-Processing Techniques for the Toxic Comments Classification

A Study of Multilingual Toxic Text Detection Approaches under Imbalanced Sample Distribution

Reading Between the Demographic Lines: Resolving Sources of Bias in Toxicity Classifiers

Character-Level Chinese Toxic Comment Classification Algorithm Based on CNN and Bi-GRU

Understanding Longitudinal Behaviors of Toxic Accounts on Reddit

Enhancing Transparency and Interpretability in Toxic Comment Classification: A Study on the Integration of Explainable Artificial Intelligence (XAI) Techniques || Dr. Pallavi Devendra Tawde, Mr. Jadyn Dias

Toxicity Inspector: A Framework to Evaluate Ground Truth in Toxicity Detection Through Feedback

Designing Toxic Content Classification for a Diversity of Perspectives

Empirical Analysis of Multi-Task Learning for Reducing Model Bias in Toxic Comment Detection

Investigating Bias In Automatic Toxic Comment Detection: An Empirical Study