Abstract:Media bias significantly shapes public perception by reinforcing stereotypes and exacerbating societal divisions. Prior research has often focused on isolated media bias dimensions such as \textit{political bias} or \textit{racial bias}, neglecting the complex interrelationships among various bias dimensions across different topic domains. Moreover, we observe that models trained on existing media bias benchmarks fail to generalize effectively on recent social media posts, particularly in certain bias identification tasks. This shortfall primarily arises because these benchmarks do not adequately reflect the rapidly evolving nature of social media content, which is characterized by shifting user behaviors and emerging trends. In response to these limitations, our research introduces a novel dataset collected from YouTube and Reddit over the past five years. Our dataset includes automated annotations for YouTube content across a broad spectrum of bias dimensions, such as gender, racial, and political biases, as well as hate speech, among others. It spans diverse domains including politics, sports, healthcare, education, and entertainment, reflecting the complex interplay of biases across different societal sectors. Through comprehensive statistical analysis, we identify significant differences in bias expression patterns and intra-domain bias correlations across these domains. By utilizing our understanding of the correlations among various bias dimensions, we lay the groundwork for creating advanced systems capable of detecting multiple biases simultaneously. Overall, our dataset advances the field of media bias identification, contributing to the development of tools that promote fairer media consumption. The comprehensive awareness of existing media bias fosters more ethical journalism, promotes cultural sensitivity, and supports a more informed and equitable public discourse.

Uncovering Media Bias Via Social Network Learning

Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions

Intertwined Biases Across Social Media Spheres: Unpacking Correlations in Media Bias Dimensions

Developing a Natural Language Understanding Model to Characterize Cable News Bias

Uncovering the Essence of Diverse Media Biases from the Semantic Embedding Space

Quantitative Analysis of Forecasting Models:In the Aspect of Online Political Bias

Balancing Transparency and Accuracy: A Comparative Analysis of Rule-Based and Deep Learning Models in Political Bias Classification

More Voices Than Ever? Quantifying Media Bias in Networks

Predicting the Politics of an Image Using Webly Supervised Data

Media Bias and Polarization through the Lens of a Markov Switching Latent Space Network Model

A Machine Learning Pipeline to Examine Political Bias with Congressional Speeches

Analysis of Media Writing Style Bias through Text-Embedding Networks

In Plain Sight: Media Bias Through the Lens of Factual Reporting

SENTINET: A DEEP SENTIMENT ANALYSIS NETWORK FOR POLITICAL MEDIA BIAS DETECTION

Machine-Learning media bias

Modeling Political Orientation of Social Media Posts: An Extended Analysis

The Effects of Media Bias on News Recommendations

NewsUnfold: Creating a News-Reading Application That Indicates Linguistic Media Bias and Collects Feedback

Learning Unbiased News Article Representations: A Knowledge-Infused Approach

MGM: Global Understanding of Audience Overlap Graphs for Predicting the Factuality and the Bias of News Media