A hybrid convolutional neural network for sarcasm detection from multilingual social media posts
Kumar, Abhinav,Tripathi, Sudhakar
DOI: https://doi.org/10.1007/s11042-024-19672-0
IF: 2.577
2024-06-25
Multimedia Tools and Applications
Abstract:Sarcasm in social media postings significantly impacts automated sentiment extraction due to its potential to invert the overall polarity of phrases. It poses a formidable challenge in extracting genuine sentiments from informal multilingual comments about events or individuals, especially when the message is in a multilingual form. The prevalence of noisy datasets further complicates this task. This work proposes a hybrid convolutional neural network (CNN-H) model to classify sarcastic and non-sarcastic statements. Unlike traditional CNN models, CNN-H integrates character and word embeddings, enhancing its capacity to handle the complexities of multilingual content. Evaluation on five benchmark datasets, including English (news headlines, Ghosh Tweet, Riloff Tweet), Hindi, and multilingual Hindi-English (Hinglish), demonstrates the superior performance of CNN-H. The model surpasses various state-of-the-art conventional machine-learning and deep-learning models, including CNN, achieving an impressive F1-score of 93.0% on the multilingual dataset. The incorporation of character and word embeddings proves pivotal in mitigating the impact of noisy datasets, prevalent in social media posts.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering