Melanie Subbiah,Amrita Bhattacharjee,Yilun Hua,Tharindu Kumarage,Huan Liu,Kathleen McKeown
Abstract:Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread. We argue that while misinformation and disinformation detection have been studied, there has been a lack of investment in the important open challenge of detecting harmful agendas in news articles; identifying harmful agendas is critical to flag news campaigns with the greatest potential for real world harm. Moreover, due to real concerns around censorship, harmful agenda detectors must be interpretable to be effective. In this work, we propose this new task and release a dataset, NewsAgendas, of annotated news articles for agenda identification. We show how interpretable systems can be effective on this task and demonstrate that they can perform comparably to black-box models.
What problem does this paper attempt to address?
### The Problem the Paper Aims to Solve
This paper aims to address the open challenge of harmful agenda detection in news articles. Specifically, the authors argue that while the detection of misinformation and disinformation has been studied, there is a lack of sufficient attention to the detection of harmful agendas in news articles. Identifying harmful agendas is crucial for flagging news activities that may cause the most harm in the real world. Additionally, due to concerns about censorship, harmful agenda detectors must be interpretable to be effective.
### Background and Motivation
In recent years, the spread of misinformation and disinformation online has become a particularly persistent and harmful problem. For example, during the COVID-19 pandemic in the United States, malicious actors spread misinformation about mask mandates, vaccines, and fake treatments to discredit the government and public health officials. These actors often have harmful agendas, such as discrediting a particular individual or group, or encouraging destructive real-world actions. With the advent of new conversational language models like ChatGPT and GPT-4, malicious actors can generate human-like harmful text content at scale.
### Main Contributions
1. **Defining the Important Open Challenge of Harmful Agenda Detection**: Proposing and defining the new task of harmful agenda detection in news articles.
2. **Annotating and Releasing a Dataset**: Creating and releasing an annotated dataset, NEWSAGENDAS, containing 506 news articles with 882 fine-grained label annotations.
3. **Developing a Harmful Agenda Detector**: Developing a harmful agenda detector that ensures interpretability while maintaining performance.
### Methods and Experiments
- **Dataset Construction**: Articles were selected from FakeNewsCorpus, Yang et al.'s dataset of satirical and real news articles, and the Proppy corpus, covering a variety of features and agendas.
- **Annotation Method**: Columbia University students with backgrounds in journalism, political science, or natural language processing were hired for annotation. Annotators needed to assess whether an article advanced a harmful agenda and annotate specific features within the article.
- **Model Training**: A feature classifier was trained using a weakly labeled dataset, and these features were combined through a linear classifier to predict the final agenda classification.
- **Results Analysis**: The system's effectiveness was validated through a series of experiments, including the prediction of overall agenda scores, the relationship between features and agenda scores, and the performance of feature detectors.
### Conclusion
- **System Performance**: Although the performance of interpretable models is slightly lower than that of black-box models, they are still very effective in detecting harmful agendas.
- **Feature Importance**: Hate speech and negative sentiment are the strongest indicators of harmful agendas, while neutral sentiment and satire are more likely indicators of benign agendas.
- **Future Work**: Further improving feature detectors to enhance the accuracy of overall agenda classification, particularly by increasing the data volume and improving model interpretability.
Overall, this paper provides new insights and tools for the automatic detection and reduction of harmful information dissemination by defining and addressing the challenge of harmful agenda detection in news articles.