Unintended bias evaluation: An analysis of hate speech detection and gender bias mitigation on social media using ensemble learning
Francimaria R.S. Nascimento,George D.C. Cavalcanti,Márjory Da Costa-Abreu
DOI: https://doi.org/10.1016/j.eswa.2022.117032
IF: 8.5
2022-09-01
Expert Systems with Applications
Abstract:Hate speech on online social media platforms is now at a level that has been considered a serious concern by governments, media outlets, and scientists, especially because it is easily spread, promoting harm to individuals and society, and made it virtually impossible to tackle with using just human analysis. Automatic approaches using machine learning and natural language processing are helpful for detection. For such applications, amongst several different approaches, it is essential to investigate the systems' robustness to deal with biases toward identity terms (gender, race, religion, for example). In this work, we analyse gender bias in different datasets and proposed a ensemble learning approach based on different feature spaces for hate speech detection with the aim that the model can learn from different abstractions of the problem, namely unintended bias evaluation metrics. We have used nine different feature spaces to train the pool of classifiers and evaluated our approach on a publicly available corpus, and our results demonstrate its effectiveness compared to state-of-the-art solutions.
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science