Sentiment analysis of video danmakus based on MIBE-RoBERTa-FF-BiLSTM

Jianbo Zhao,Huailiang Liu,Yakai Wang,Weili Zhang,Xiaojin Zhang,Bowei Li,Tong Sun,Yanwei Qi,Shanzhuang Zhang
DOI: https://doi.org/10.1038/s41598-024-56518-z
IF: 4.6
2024-03-11
Scientific Reports
Abstract:Danmakus are user-generated comments that overlay on videos, enabling real-time interactions between viewers and video content. The emotional orientation of danmakus can reflect the attitudes and opinions of viewers on video segments, which can help video platforms optimize video content recommendation and evaluate users' abnormal emotion levels. Aiming at the problems of low transferability of traditional sentiment analysis methods in the danmaku domain, low accuracy of danmaku text segmentation, poor consistency of sentiment annotation, and insufficient semantic feature extraction, this paper proposes a video danmaku sentiment analysis method based on MIBE-RoBERTa-FF-BiLSTM. This paper constructs a "Bilibili Must-Watch List and Top Video Danmaku Sentiment Dataset" by ourselves, covering 10,000 positive and negative sentiment danmaku texts of 18 themes. A new word recognition algorithm based on mutual information (MI) and branch entropy (BE) is used to discover 2610 irregular network popular new words from trigrams to heptagrams in the dataset, forming a domain lexicon. The Maslow's hierarchy of needs theory is applied to guide the consistent sentiment annotation. The domain lexicon is integrated into the feature fusion layer of the RoBERTa-FF-BiLSTM model to fully learn the semantic features of word information, character information, and context information of danmaku texts and perform sentiment classification. Comparative experiments on the dataset show that the model proposed in this paper has the best comprehensive performance among the mainstream models for video danmaku text sentiment classification, with an F1 value of 94.06%, and its accuracy and robustness are also better than other models. The limitations of this paper are that the construction of the domain lexicon still requires manual participation and review, the semantic information of danmaku video content and the positive case preference are ignored.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper primarily addresses the issue of sentiment analysis in danmaku (bullet screen) and proposes a method based on MIBE-RoBERTa-FF-BiLSTM. Specifically, the research aims to solve the following key problems: 1. **Low transferability of traditional sentiment analysis methods in the danmaku domain**: Existing sentiment analysis methods perform poorly when dealing with danmaku, mainly because danmaku text has a unique language style and expression. 2. **Low accuracy of danmaku text segmentation**: Danmaku contains a large number of new internet words, abbreviations, and other non-standard vocabulary, which traditional segmentation tools struggle to effectively recognize. 3. **Poor consistency in sentiment annotation**: Different annotators may have different understandings of the sentiment tendency of the same danmaku, leading to poor consistency in annotation results. 4. **Insufficient semantic feature extraction**: Existing methods fail to fully utilize the contextual information in danmaku text, thereby affecting the accuracy of sentiment analysis. To address the above issues, the paper proposes the following innovations: - **Construction of a danmaku domain dictionary**: By proposing a new word discovery algorithm based on Mutual Information (MI) and Branch Entropy (BE) to automatically identify non-standard popular vocabulary in danmaku text and form a domain dictionary to improve segmentation accuracy. - **Consistent sentiment annotation guidance**: Applying Maslow's hierarchy of needs theory to guide the sentiment annotation process, making the annotation more reasonable and consistent. - **Deep learning model with integrated features**: Integrating the domain dictionary into the RoBERTa-FF-BiLSTM model to fully learn the semantic features of danmaku text and perform sentiment classification. The RoBERTa model is used to extract semantic features, while BiLSTM is used to capture contextual information. Through experimental validation, the proposed model demonstrates the best overall performance among mainstream models, with an F1 score of 94.06%, and also outperforms other models in terms of accuracy and robustness. Nevertheless, the paper also points out some limitations of the method, such as the need for manual review in the construction of the domain dictionary, and the neglect of semantic information from danmaku video content and positive case preference.