Abstract:Concerns about gender bias in word embedding models have captured substantial attention in the algorithmic bias research literature. Other bias types however have received lesser amounts of scrutiny. This work describes a large-scale analysis of sentiment associations in popular word embedding models along the lines of gender and ethnicity but also along the less frequently studied dimensions of socioeconomic status, age, physical appearance, sexual orientation, religious sentiment and political leanings. Consistent with previous scholarly literature, this work has found systemic bias against given names popular among African-Americans in most embedding models examined. Gender bias in embedding models however appears to be multifaceted and often reversed in polarity to what has been regularly reported. Interestingly, using the common operationalization of the term <em>bias</em> in the fairness literature, novel types of so far unreported bias types in word embedding models have also been identified. Specifically, the popular embedding models analyzed here display negative biases against middle and working-class socioeconomic status, male children, senior citizens, plain physical appearance and intellectual phenomena such as Islamic religious faith, non-religiosity and conservative political orientation. Reasons for the paradoxical underreporting of these bias types in the relevant literature are probably manifold but widely held blind spots when searching for algorithmic bias and a lack of widespread technical jargon to unambiguously describe a variety of algorithmic associations could conceivably be playing a role. The causal origins for the multiplicity of loaded associations attached to distinct demographic groups within embedding models are often unclear but the heterogeneity of said associations and their potential multifactorial roots raises doubts about the validity of grouping them all under the umbrella term <em>bias</em>. Richer and more fine-grained terminology as well as a more comprehensive exploration of the bias landscape could help the fairness epistemic community to characterize and neutralize algorithmic discrimination more efficiently.

Trapped Before Clicking Enter Digital Inequality and Search Engine Autocomplete Algorithmic Bias

Trapped in the Search Box: an Examination of Algorithmic Bias in Search Engine Autocomplete Predictions

Finding the white male: The prevalence and consequences of algorithmic gender and race bias in political Google searches

Examining Racial Stereotypes in YouTube Autocomplete Suggestions

Detecting race and gender bias in visual representation of AI on web search engines

Towards More Accountable Search Engines: Online Evaluation of Representation Bias

Algorithmic discrimination: examining its types and regulatory measures with emphasis on US legal practices

Propagation of societal gender inequality by internet search algorithms

Beyond Algorithmic Bias: A Socio-Computational Interrogation of the Google Search by Image Algorithm

Algorithmic amplification of biases on Google Search

Algorithmic Bias? An Empirical Study of Apparent Gender-Based Discrimination in the Display of STEM Career Ads

Examining bias perpetuation in academic search engines: an algorithm audit of Google and Semantic Scholar

A comparison of online search engine autocompletion in Google and Baidu

Cognitively Biased Users Interacting with Algorithmically Biased Results in Whole-Session Search on Debated Topics

Can Algorithm Knowledge Stop Women from Being Targeted by Algorithm Bias? The New Digital Divide on Weibo

Ethics and discrimination in artificial intelligence-enabled recruitment practices

Fairness in Image Search: A Study of Occupational Stereotyping in Image Retrieval and its Debiasing

Wide range screening of algorithmic bias in word embedding models using large sentiment lexicons reveals underreported bias types

Algorithms that "Don't See Color": Comparing Biases in Lookalike and Special Ad Audiences

National Origin Discrimination in Deep-learning-powered Automated Resume Screening

Evaluation Metrics for Measuring Bias in Search Engine Results