Are Male Candidates Better than Females? Debiasing BERT Resume Retrieval System.

Sijing Zhang,Ping Li,Ziyan Cai
DOI: https://doi.org/10.1109/SMC53654.2022.9945184
2022-01-01
Abstract:Advanced language models like BERT have great performances in various Natural Language Processing tasks. However, recent researches have shown that language models can learn gender biases from corpus, and lead to discriminatory decisions and unfair allocation of resources. We proposed a measure of gender bias in BERT resume retrieval system, by performing job searches on a group of resumes with different genders but consistent abilities. We proposed to calculate the average ranking and Discounted Cumulative Gain (DCG) scores of male and female resumes, and found that men outperformed women even though the two resumes were identical except for gender. This shows BERT has gender stereotypes, and its resume retrieval systems prefer male candidates. Therefore, we also proposed a regularized debiasing method to promote gender equity. Referring to Densifier method, we can get the subspace vector encoding bias semantics through matrix transformation of word vector difference between occupational and gender words. By defining the correlation between BERT word vector and gender bias subspace as the loss term, we can remove the bias semantics in BERT, and also avoid it learning stereotypes even trained on an unfair data set. After regularized debiased, the gender ranking gap of BERT was reduced by an average of 61.8%, while DCG scores were reduced by 53.9%.
What problem does this paper attempt to address?