Advancing Twitter Sentiment Analysis: An Ensemble Approach with Transformer-XL, RoBERTa, and XGBoost

Dudipalli Madhu,Shaik Sharez Hussain,Chandra Nikhil Kumar,Praveen Tumuluru,Likhith Kankanala,Shaik Shoaib
DOI: https://doi.org/10.1109/ICSSAS57918.2023.10331828
2023-10-18
Abstract:The emergence of social media has made it crucial to analyze and comprehend users' sentiments, especially on platforms like Twitter, which provide real-time data for sentiment analysis. This study proposes an innovative ensemble classifier model incorporating the latest advancements in Natural Language Processing (NLP) and machine learning technologies, such as Transformer-XL, RoBERTa, and XGBoost, for Twitter sentiment analysis. Transformer-XL is a state-of-the-art language model that can capture long-range dependencies in text data, making it ideal for handling sequential data like tweets. RoBERTa, on the other hand, is a variant of BERT that leverages a larger dataset and training time, resulting in improved performance in understanding the context and meaning of words in tweets. In addition to these powerful language models, XGBoost, an efficient gradient-boosting framework is integrated, into the ensemble. XGBoost is renowned for its ability to handle complex interactions between features and has been successfully applied in various machine learning tasks. This study leverage a diverse and comprehensive dataset extending beyond COVID-19 pandemic-related tweets to train the ensemble classifier model. It ensures the Transformer-XL-RoBERTa-XGBoost Ensemble model's versatility and generalizability for sentiment analysis across different contexts by including tweets from various domains and topics. Proposed experimental results demonstrates that the proposed ensemble classifier model exceeds individual classifiers with accuracy, precision, recall, and F1 score. Incorporating advanced NLP techniques and the power of XGBoost enables the ensemble model to achieve higher accuracy and better performance in sentiment analysis on Twitter data. By harnessing the potential of cutting-edge technologies, the study gains valuable insights into user sentiments, which can benefit various applications, including market analysis, brand perception monitoring, and social trend tracking.
Computer Science
What problem does this paper attempt to address?