Abstract:Prior studies have shown the existence of gender bias in job postings, performance reviews, and letters of recommendation. However, very little research has been done on the presence of gender biases in mainstream news sources and how they vary across publications. Human editing, given the rapid pace of news dissemination, is not effective enough to address biases. Even computer programs that parse the news articles for specific words and references still fall short of identifying and detecting the undertones and implicit references, which is why sophisticated techniques like Artificial Intelligence (AI) are necessary. In this study, I used Natural Language Processing (NLP) methods, a series of Python-programs to further analyze how biases vary in new information, along the metrics of type, variety, and intensity. I used over 500,000 news articles from 15 publications, spanning over 4 years to build and train my algorithm. Using Word2Vec, a popular NLP method, I was able to conclude that more right leaning publications are more likely to exhibit misogynistic content that is biased against women. However, the method fell short of identifying many forms of objectification like Benevolent Sexism. Similarly, using VADER, a python-code of sentiment analysis tool, I was able to determine that mere metrics of positive, negative, and neutral sentiment are not sufficient to detect occurrences of gender bias. To gauge the breadth of sexist language effectively, I used the LIWC text analysis program which calculates the percentage of words in a given text that fall into one or more of over 80 linguistic, psychological and topical categories indicating various social, cognitive, and affective processes. As a result, with statistical evidence my study was able to conclude the presence of implicit gender bias occurs all across publications but is more prevalent in right-leaning publications.

Mitigating Gender Bias in Machine Learning Data Sets

Mitigating Gender Bias in Natural Language Processing: Literature Review

Fairness in AI Systems: Mitigating gender bias from language-vision models

Gender Bias in AI Recruitment Systems: A Sociological-and Data Science-based Case Study

Exploration, detection, and mitigation: Unveiling gender bias in NLP

AI Gender Bias, Disparities, and Fairness: Does Training Data Matter?

Multi-Dimensional Gender Bias Classification

Gender Bias in Neural Natural Language Processing

Exploring gender biases in ML and AI academic research through systematic literature review

Projective Methods for Mitigating Gender Bias in Pre-trained Language Models

Toward Gender-Inclusive Coreference Resolution: An Analysis of Gender and Bias Throughout the Machine Learning Lifecycle

Gender Bias in Big Data Analysis

Interpretable bias mitigation for textual data: Reducing gender bias in patient notes while maintaining classification performance

Evaluating Gender Bias in Natural Language Inference

Big data and AI for gender equality in health: bias is a big challenge

User Acceptance of Gender Stereotypes in Automated Career Recommendations

Fairway: SE Principles for Building Fairer Software

Analyzing the Extent to which Gender Bias Exists in News Articles Using Natural Language Processing Techniques

Reducing Gender Bias in Abusive Language Detection

Debiasing Gender Bias in Information Retrieval Models

How Far Can It Go?: On Intrinsic Gender Bias Mitigation for Text Classification