Exploring Large Language Models on Cross-Cultural Values in Connection with Training Methodology

Minsang Kim,Seungjun Baek
2024-12-12
Abstract:Large language models (LLMs) closely interact with humans, and thus need an intimate understanding of the cultural values of human society. In this paper, we explore how open-source LLMs make judgments on diverse categories of cultural values across countries, and its relation to training methodology such as model sizes, training corpus, alignment, etc. Our analysis shows that LLMs can judge socio-cultural norms similar to humans but less so on social systems and progress. In addition, LLMs tend to judge cultural values biased toward Western culture, which can be improved with training on the multilingual corpus. We also find that increasing model size helps a better understanding of social values, but smaller models can be enhanced by using synthetic data. Our analysis reveals valuable insights into the design methodology of LLMs in connection with their understanding of cultural values.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to explore the performance of large - language models (LLMs) in cross - cultural value judgment and their relationship with training methods. Specifically, the research aims to answer the following key questions: 1. **How do LLMs understand and judge the cultural values of different countries?** - The research finds that the judgments of LLMs in terms of sociocultural norms are similar to those of humans, but their judgments in terms of social systems and progress are less consistent. 2. **Are there biases in the cultural value judgments of LLMs?** - The results show that the judgments of LLMs tend to be biased towards Western culture, which may be because they are mainly trained on English corpora. 3. **How can the understanding of different cultures by LLMs be improved by improving training methods?** - The research points out that training with multilingual corpora can significantly improve the understanding of non - Western cultures by LLMs. - Larger models show better performance in understanding cultural values. - Synthetic data can help smaller models improve their cultural understanding ability. - Alignment technology can make LLMs closer to human cultural judgments. ### Summary of main research findings: - **Sociocultural norms**: The judgments of LLMs in terms of sociocultural norms are similar to those of humans, but their judgments in terms of social systems and progress are less consistent. - **Cultural bias**: The judgments of LLMs tend to be biased towards Western culture, but this situation can be improved by training with multilingual corpora. - **Model scale**: Larger models perform better in understanding cultural values. - **Synthetic data**: Synthetic data can enhance the cultural understanding ability of smaller models. - **Alignment technology**: Alignment technology enables LLMs to better imitate human cultural judgments. Through these findings, the research provides important insights for the design and improvement of LLMs, especially in cross - cultural communication and understanding.