Abstract:The emergence of social media has made it crucial to analyze and comprehend users' sentiments, especially on platforms like Twitter, which provide real-time data for sentiment analysis. This study proposes an innovative ensemble classifier model incorporating the latest advancements in Natural Language Processing (NLP) and machine learning technologies, such as Transformer-XL, RoBERTa, and XGBoost, for Twitter sentiment analysis. Transformer-XL is a state-of-the-art language model that can capture long-range dependencies in text data, making it ideal for handling sequential data like tweets. RoBERTa, on the other hand, is a variant of BERT that leverages a larger dataset and training time, resulting in improved performance in understanding the context and meaning of words in tweets. In addition to these powerful language models, XGBoost, an efficient gradient-boosting framework is integrated, into the ensemble. XGBoost is renowned for its ability to handle complex interactions between features and has been successfully applied in various machine learning tasks. This study leverage a diverse and comprehensive dataset extending beyond COVID-19 pandemic-related tweets to train the ensemble classifier model. It ensures the Transformer-XL-RoBERTa-XGBoost Ensemble model's versatility and generalizability for sentiment analysis across different contexts by including tweets from various domains and topics. Proposed experimental results demonstrates that the proposed ensemble classifier model exceeds individual classifiers with accuracy, precision, recall, and F1 score. Incorporating advanced NLP techniques and the power of XGBoost enables the ensemble model to achieve higher accuracy and better performance in sentiment analysis on Twitter data. By harnessing the potential of cutting-edge technologies, the study gains valuable insights into user sentiments, which can benefit various applications, including market analysis, brand perception monitoring, and social trend tracking.

Two simple and effective ensemble classifiers for Twitter sentiment analysis

Tweet sentiment analysis with classifier ensembles

A reliable sentiment analysis for classification of tweets in social networks

Ensemble of feature sets and classification algorithms for sentiment classification

NNEMBs at SemEval-2017 Task 4: Neural Twitter Sentiment Classification: a Simple Ensemble Method with Different Embeddings.

Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

Advancing Twitter Sentiment Analysis: An Ensemble Approach with Transformer-XL, RoBERTa, and XGBoost

Sentiment classification: The contribution of ensemble learning

Improving Sentiment Analysis for Social Media Applications Using an Ensemble Deep Learning Language Model

Feature Selection Approach for Twitter Sentiment Analysis and Text Classification Based on Chi-Square and Naïve Bayes

Harnessing Twitter for Automatic Sentiment Identification Using Machine Learning Techniques

Twitter sentiment analysis using various classification algorithms

Live Sentiment Analysis Using Multiple Machine Learning and Text Processing Algorithms

A Novel Machine Learning Approach for Sentiment Analysis on Twitter Incorporating the Universal Language Model Fine-Tuning and SVM

Analyzing Social Media Sentiment: Twitter as a Case Study

Machine Learning Techniques for Sentiment Analysis of COVID-19-Related Twitter Data

Microblog Sentiment Classification with Heterogeneous Sentiment Knowledge

Learning the Market: Sentiment-Based Ensemble Trading Agents

Using Support Vector Machine Ensembles for Target Audience Classification on Twitter

Comparative Study of Machine Learning Algorithms for Twitter Sentiment Analysis

User-level Sentiment Analysis Incorporating Social Networks