Trapped Before Clicking Enter Digital Inequality and Search Engine Autocomplete Algorithmic Bias

Yuxin Gao,Lin William Cong,Na Ta,Hongyao Fu,Kaiyu Li
DOI: https://doi.org/10.2139/ssrn.4484236
2023-01-01
Abstract:Ethical concerns of algorithmic bias arising from digital technologies have gained growing attention. Problematic predictions with defamation and discrimination by search engine autocomplete algorithms, however, have rarely been quantitatively investigated. This paper aims to examine the autocomplete algorithmic bias of leading search engines against three sensitive attributes: gender, race, and sexual orientation. By simulating user search query prefixes and calling search engine APIs, 106,896 autocomplete predictions were collected, and their semantic toxicity scores as measures of negative algorithmic bias were computed based on machine learning models. The roles of sensitive attributes and topic categories on algorithmic bias were examined. Results indicated that search engine autocomplete algorithmic bias was overall consistent with long-standing societal discrimination. Historically disadvantaged groups such as the female, the Black, and the homosexual people suffered higher levels of negative algorithmic bias. Moreover, the degree of algorithmic bias varies across topic categories. This paper contributes to the empirical evidence and operationalization of the nuanced algorithmic bias based on large-scale data. Concerning the particularities of autocomplete algorithmic bias, implications about the underlying mechanisms and potential consequences are discussed drawing from the theoretical perspective of digital inequality.
What problem does this paper attempt to address?