Abstract:In our rapidly evolving digital sphere, the ability to discern media bias becomes crucial as it can shape public sentiment and influence pivotal decisions. The advent of large language models (LLMs), such as ChatGPT, noted for their broad utility in various natural language processing (NLP) tasks, invites exploration of their efficacy in media bias detection. Can ChatGPT detect media bias? This study seeks to answer this question by leveraging the Media Bias Identification Benchmark (MBIB) to assess ChatGPT's competency in distinguishing six categories of media bias, juxtaposed against fine-tuned models such as BART, ConvBERT, and GPT-2. The findings present a dichotomy: ChatGPT performs at par with fine-tuned models in detecting hate speech and text-level context bias, yet faces difficulties with subtler elements of other bias detections, namely, fake news, racial, gender, and cognitive biases.

What problem does this paper attempt to address?

The paper attempts to address the issue of evaluating the effectiveness of large language models (LLMs) such as ChatGPT in detecting media bias. Specifically, the researchers use the "Media Bias Identification Benchmark" (MBIB) to test ChatGPT's ability to distinguish between six types of media bias (hate speech, text-level contextual bias, fake news, racial bias, gender bias, and cognitive bias) and compare it with fine-tuned language models such as BART, ConvBERT, and GPT-2. ### Main Questions: 1. **Can ChatGPT effectively detect media bias?** - The researchers aim to verify ChatGPT's performance in different types of media bias detection tasks through experiments. 2. **How does ChatGPT compare to other fine-tuned models?** - By comparing the performance of ChatGPT with BART, ConvBERT, and GPT-2 in detecting media bias, the researchers evaluate its strengths and weaknesses. ### Background: - **Importance of Media Bias**: Media bias can influence public sentiment and critical decision-making, making the detection and understanding of media bias particularly important in the digital age. - **Limitations of Existing Methods**: Existing methods for detecting media bias (including manual content analysis and automated methods) face challenges in scalability and complexity, especially in detecting subtle language nuances. - **Potential of Large Language Models**: Large language models (such as GPT) have shown excellent performance in natural language processing tasks, and researchers hope to explore the potential application of these models in media bias detection. ### Experimental Design: - **Dataset**: The MBIB dataset, covering six types of media bias tasks. - **Models**: ChatGPT (zero-shot learning), BART, ConvBERT, and GPT-2 (fine-tuned models). - **Evaluation Metrics**: Micro-average F1 score and macro-average F1 score. ### Results: - **Performance of ChatGPT**: - ChatGPT's performance is comparable to fine-tuned models in detecting hate speech and text-level contextual bias. - ChatGPT performs poorly in detecting fake news, racial bias, gender bias, and cognitive bias. - **Performance of Fine-Tuned Models**: - Fine-tuned models (BART, ConvBERT, and GPT-2) perform better in all tasks, especially in tasks requiring deep contextual understanding. ### Conclusion: - **Advantages of ChatGPT**: It has some capability in detecting explicit biases (such as hate speech and text-level contextual bias). - **Disadvantages of ChatGPT**: It performs poorly in tasks requiring deep contextual understanding and subtle language details. - **Future Directions**: ChatGPT's performance in media bias detection can be further improved through few-shot prompting and human evaluation. This paper provides a preliminary evaluation of the application of large language models in the field of media bias detection, highlighting their potential advantages and limitations, and offering suggestions for future improvements.

ChatGPT v.s. Media Bias: A Comparative Study of GPT-3.5 and Fine-tuned Language Models

BiasTestGPT: Using ChatGPT for Social Bias Testing of Language Models

Statistical Analysis of Bias in ChatGPT Using Prompt Engineering

LLMs left, right, and center: Assessing GPT's capabilities to label political bias from web domains

Artificial Intelligence Tools and Bias in Journalism-related Content Generation: Comparison Between Chat GPT-3.5, GPT-4 and Bing

Is ChatGPT More Biased Than You?

Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination

Taught by the Internet, Exploring Bias in OpenAIs GPT3

Quite Good, but Not Enough: Nationality Bias in Large Language Models -- A Case Study of ChatGPT

ChatGPT and Bard Responses to Polarizing Questions

Examining Multimodal Gender and Content Bias in ChatGPT-4o

Is GPT-4 Less Politically Biased than GPT-3.5? A Renewed Investigation of ChatGPT's Political Biases

Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity

Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models

Bias of AI-generated content: an examination of news produced by large language models

A Preliminary Study of ChatGPT on News Recommendation: Personalization, Provider Fairness, Fake News

Exploring the Capability of ChatGPT to Reproduce Human Labels for Social Computing Tasks (Extended Version)

The high dimensional psychological profile and cultural bias of ChatGPT

ChatGPT Based Data Augmentation for Improved Parameter-Efficient Debiasing of LLMs

BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs

Stars, Stripes, and Silicon: Unravelling the ChatGPT's All-American, Monochrome, Cis-centric Bias