Abstract:Through the advancement of the contemporary web and the rapid adoption of social media platforms such as YouTube, Twitter, and Facebook, for example, life has become much easier when dealing with certain highly personal problems. The far-reaching consequences of online harassment require immediate preventative steps to safeguard psychological wellness and scholarly achievement via detection at an earlier stage. This piece of writing aims to eliminate online harassment and create a criticism-free online environment. In the paper, we have used a variety of attributes to evaluate a large number of Bengali comments. We communicate cleansed data utilizing machine learning (ML) methods and natural language processing techniques, which must be followed using term frequency and reverse document frequency (TF-IDF) with a count vectorizer. In addition, we used tokenization with padding to feed our deep learning (DL) models. Using mathematical visualization and natural language processing, online bullying could be detected quickly. Multi-layer Perceptron (MLP), K-Nearest Neighbors (K-NN), Extreme Gradient Boosting (XGBoost), Adaptive Boosting Classifier (AdaBoost), Logistic Regression Classifier (LR), Random Forest Classifier (RF), Bagging Classifier, Stochastic Gradient Descent (SGD), Voting Classifier, and Stacking are employed in the research we conducted. We expanded our investigation to include different DL frameworks. Deep Neural Networks (DNN), Convolutional Neural Networks (CNN), Convolutional-Long Short-Term Memory (C-LSTM), and Bidirectional Long Short-Term Memory (BiLSTM) are all implemented. A large amount of data is required to precisely recognize harassing behavior. To rapidly recognize internet harassment written material, we combined two sets of data, producing 94,000 Bengali comments from different points of view. After understanding the ML and DL models, we can see that a hybrid model (MLP+SGD+LR) performed more effectively when compared to other models, its evaluation accuracy is 99.34%, precision is 99.34%, recall rate is 99.33%, and F1 score is 99.34% on multi-label class. For the binary classification model, we got 99.41% of accuracy.

Multi-class Sports News Categorization using Machine Learning Techniques: Resource Creation and Evaluation

Machine and Deep Learning Methods with Manual and Automatic Labelling for News Classification in Bangla Language

Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network

Urdu News Content Classification Using Machine Learning Algorithms

Traditional Bangladeshi Sports Video Classification Using Deep Learning Method

Detection of Bangla Fake News using MNB and SVM Classifier

Leveraging textual information for social media news categorization and sentiment analysis

A subjectivity classification framework for sports articles using improved cortical algorithms

Classification of Bangla Compound Characters Using a HOG-CNN Hybrid Model

InceptB: A CNN Based Classification Approach for Recognizing Traditional Bengali Games

Categorization of Bangla Web Text Documents Based on TF-IDF-ICF Text Analysis Scheme

Amplifying document categorization with advanced features and deep learning

Machine learning and deep learning-based approach to categorize Bengali comments on social networks using fused dataset

Robust Sports Image Classification Using InceptionV3 and Neural Networks

Classifying Fake News Detection Using SVM, Naive Bayes and LSTM

A Novel Approach to Enhance the Performance of Semantic Search in Bengali using Neural Net and other Classification Techniques

An Empirical Study on the Classification of Chinese News Articles by Machine Learning and Deep Learning Techniques

Detecting Racist Text in Bengali: An Ensemble Deep Learning Framework

Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models

Sentiment analysis in Bengali via transfer learning using multi-lingual BERT

A novel Data and Model Centric artificial intelligence based approach in developing high-performance Named Entity Recognition for Bengali Language