Evaluating and Mitigating Linguistic Discrimination in Large Language Models

Guoliang Dong,Haoyu Wang,Jun Sun,Xinyu Wang
2024-05-10
Abstract:By training on text in various languages, large language models (LLMs) typically possess multilingual support and demonstrate remarkable capabilities in solving tasks described in different languages. However, LLMs can exhibit linguistic discrimination due to the uneven distribution of training data across languages. That is, LLMs are hard to keep the consistency of responses when faced with the same task but depicted in different languages.
Computation and Language,Artificial Intelligence,Cryptography and Security,Software Engineering
What problem does this paper attempt to address?