Abstract:In the past few decades, the internet has become an inseparable part of human life. It provides ease of access and permeates almost every aspect of human existence. One of the internet platforms that is widely used by people around the world is social media. Apart from being spoiled with the convenience and efficiency offered by social media to support daily life, it has gained popularity among a wide audience. This has positive implications when utilized effectively, but it cannot be denied that there are negative consequences if not utilized properly. One such consequence is the prevalence of cyberbullying activities on social media. Cyberbullying has become a major concern for the public and social media users, prompting researchers to leverage information technology in developing technologies that can identify the elements of cyberbullying, particularly on social media platforms. Sentiment analysis has been employed by researchers to identify the components of cyberbullying in online platforms. Sentiment analysis involves the application of natural language processing techniques and text analysis to identify and extract subjective information from text. This study aims to compare the Naive Bayes algorithm and the Support Vector Machine algorithm, while utilizing feature selection, specifically chi-square, to enhance the accuracy of both algorithms in classifying Instagram comments. The experimental results indicate that the Multinomial Naive Bayes (MNB) algorithm outperforms the Support Vector Machine (SVM) algorithm, achieving an accuracy of 83.85% without feature selection and 90.77% with feature selection. Meanwhile, SVM achieves an accuracy of 82.31% without feature selection and 90% with feature selection. Evaluation through the confusion matrix and classification report reveals that MNB exhibits better precision and recall rates compared to SVM in identifying bullying and non-bullying classes. The use of feature selection enhances the performance of both algorithms in classifying Instagram comments related to cyberbullying.

Influence of Word Normalization and Chi-Squared Feature Selection on Support Vector Machine (SVM) Text Classification

Feature Selection for Support Vector Machines in Text Categorization

An Enhanced Hybrid Feature Selection Technique Using Term Frequency-Inverse Document Frequency and Support Vector Machine-Recursive Feature Elimination for Sentiment Classification

A Study of Discriminatory Speech Classification Based on Improved Smote and SVM-RF

Sentiment Analysis of Short Texts Using SVMs and VSMs-Based Multiclass Semantic Classification

Web Page Classification Based on SVM

SVM Classification:Its Contents and Challenges

Improving sentiment reviews classification performance using support vector machine-fuzzy matching algorithm

Comparison of NB and SVM in Sentiment Analysis of Cyberbullying using Feature Selection

Chinese text classification based on character-level CNN and SVM

Sentiment analysis of Japanese text and vocabulary learning based on natural language processing and SVM

Data preprocessing approach for machine learning-based sentiment classification

Classification of Src Kinase Inhibitors Based on Support Vector Machine

Sentiment Text Classification of Customers Reviews on the Web Based on SVM

Classification of masses on mammograms using support vector machine

Performance Assessment of Multiple Classifiers Based on Ensemble Feature Selection Scheme for Sentiment Analysis

Feature Rescaling of Support Vector Machines

Feature selection based on a normalized difference measure for text classification

Adapting Feature Selection Algorithms for the Classification of Chinese Texts

An Empirical Study on the Classification of Chinese News Articles by Machine Learning and Deep Learning Techniques