Knowledge of cultural moral norms in large language models

Aida Ramezani,Yang Xu
2023-06-03
Abstract:Moral norms vary across cultures. A recent line of work suggests that English large language models contain human-like moral biases, but these studies typically do not examine moral variation in a diverse cultural setting. We investigate the extent to which monolingual English language models contain knowledge about moral norms in different countries. We consider two levels of analysis: 1) whether language models capture fine-grained moral variation across countries over a variety of topics such as ``homosexuality'' and ``divorce''; 2) whether language models capture cultural diversity and shared tendencies in which topics people around the globe tend to diverge or agree on in their moral judgment. We perform our analyses with two public datasets from the World Values Survey (across 55 countries) and PEW global surveys (across 40 countries) on morality. We find that pre-trained English language models predict empirical moral norms across countries worse than the English moral norms reported previously. However, fine-tuning language models on the survey data improves inference across countries at the expense of a less accurate estimate of the English moral norms. We discuss the relevance and challenges of incorporating cultural knowledge into the automated inference of moral norms.
Computation and Language
What problem does this paper attempt to address?
The paper primarily explores the understanding and representation capabilities of large English pre-trained language models (EPLMs) regarding moral norms in different cultures. Specifically, the study focuses on the following aspects: 1. **Research Background and Motivation**: Moral norms vary across different cultures. Although previous studies have shown that large English language models can capture human-like moral biases, these studies typically do not examine the changes in moral concepts in a multicultural context. Therefore, this paper aims to investigate whether these monolingual English language models can understand cultural moral norms from different countries. 2. **Research Questions**: - **Level 1**: Do EPLMs encode knowledge that reflects the moral norms of different countries? For example, "divorce" might be a frowned-upon topic in one country but considered acceptable in another. - **Level 2**: Can EPLMs infer the diversity and commonality in moral judgments across different cultures? For instance, people from different countries might unanimously consider a certain behavior (like X) immoral, but have differing views on another behavior (like Y). 3. **Datasets and Methods**: - Two public datasets were used: the World Values Survey (WVS) and the PEW Global Attitudes Survey, covering data on moral norms from 55 and 40 countries, respectively. - Evaluation methods were proposed to analyze the extent to which EPLMs understand moral norms in different cultural contexts, validated by comparing human evaluations with machine scores. - The study explored methods for fine-tuning language models to improve their understanding of cross-cultural moral norms and discussed the trade-offs that such an approach might entail. 4. **Research Findings**: - EPLMs can capture some knowledge about moral norms in different cultures, but their accuracy is lower compared to their understanding of moral norms in an English context. - EPLMs understand the moral norms of Western affluent countries more accurately, while there is a certain bias in understanding the moral norms of non-Western or non-affluent countries. - Fine-tuning EPLMs can improve their understanding of cross-cultural moral norms to some extent, but this may reduce their ability to understand moral norms in an English context. In summary, this paper delves into the performance of English language models in understanding cross-cultural moral norms and proposes a fine-tuning strategy to improve model performance, while also discussing the limitations and challenges that this approach may bring.