Multi-task Prompt Words Learning for Social Media Content Generation

Haochen Xue,Chong Zhang,Chengzhi Liu,Fangyu Wu,Xiaobo Jin
2024-07-10
Abstract:The rapid development of the Internet has profoundly changed human life. Humans are increasingly expressing themselves and interacting with others on social media platforms. However, although artificial intelligence technology has been widely used in many aspects of life, its application in social media content creation is still blank. To solve this problem, we propose a new prompt word generation framework based on multi-modal information fusion, which combines multiple tasks including topic classification, sentiment analysis, scene recognition and keyword extraction to generate more comprehensive prompt words. Subsequently, we use a template containing a set of prompt words to guide ChatGPT to generate high-quality tweets. Furthermore, in the absence of effective and objective evaluation criteria in the field of content generation, we use the ChatGPT tool to evaluate the results generated by the algorithm, making large-scale evaluation of content generation algorithms possible. Evaluation results on extensive content generation demonstrate that our cue word generation framework generates higher quality content compared to manual methods and other cueing techniques, while topic classification, sentiment analysis, and scene recognition significantly enhance content clarity and its consistency with the image.
Computation and Language,Computer Vision and Pattern Recognition,Multimedia
What problem does this paper attempt to address?
The paper attempts to address the inadequacies in applying artificial intelligence technology to social media content generation. Although AI has been widely applied in many aspects of life, there is still a gap in its application in the field of social media content creation. Specifically, the paper focuses on the following aspects: 1. **Quality of content generation**: How to use multimodal information fusion to generate higher quality social media content, including the consistency and relevance of text and images. 2. **Controllability of automated generation**: How to control and optimize the generated content through Multi-task Prompt Words Learning (MPWL) to make it more in line with human aesthetics and needs. 3. **Lack of evaluation standards**: The lack of effective objective evaluation standards in the field of content generation, and how to use existing large language models (such as ChatGPT) for large-scale, standardized evaluation. To address these issues, the paper proposes a prompt word generation framework based on multimodal information fusion, combining multiple tasks such as topic classification, sentiment analysis, scene recognition, and keyword extraction to generate more comprehensive prompt words. These prompt words are then used to guide ChatGPT in generating high-quality tweets. Additionally, the paper proposes a method for evaluating the generated content, utilizing the ChatGPT tool for large-scale, standardized evaluation, thereby addressing the shortcomings of existing evaluation methods. In summary, the paper aims to improve the quality and controllability of social media content generation through a multi-task prompt word learning framework and provide an effective evaluation method.