Abstract:This paper provides a comprehensive survey of the latest research on multilingual large language models (MLLMs). MLLMs not only are able to understand and generate language across linguistic boundaries, but also represent an important advancement in artificial intelligence. We first discuss the architecture and pre-training objectives of MLLMs, highlighting the key components and methodologies that contribute to their multilingual capabilities. We then discuss the construction of multilingual pre-training and alignment datasets, underscoring the importance of data quality and diversity in enhancing MLLM performance. An important focus of this survey is on the evaluation of MLLMs. We present a detailed taxonomy and roadmap covering the assessment of MLLMs' cross-lingual knowledge, reasoning, alignment with human values, safety, interpretability and specialized applications. Specifically, we extensively discuss multilingual evaluation benchmarks and datasets, and explore the use of LLMs themselves as multilingual evaluators. To enhance MLLMs from black to white boxes, we also address the interpretability of multilingual capabilities, cross-lingual transfer and language bias within these models. Finally, we provide a comprehensive review of real-world applications of MLLMs across diverse domains, including biology, medicine, computer science, mathematics and law. We showcase how these models have driven innovation and improvements in these specialized fields while also highlighting the challenges and opportunities in deploying MLLMs within diverse language communities and application scenarios. We listed the paper related in this survey and publicly available at <a class="link-external link-https" href="https://github.com/tjunlp-lab/Awesome-Multilingual-LLMs-Papers" rel="external noopener nofollow">this https URL</a>.

Evaluating and Mitigating Linguistic Discrimination in Large Language Models

Benchmarking Linguistic Diversity of Large Language Models

A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias

Do Multilingual Large Language Models Mitigate Stereotype Bias?

How do Large Language Models Handle Multilingualism?

Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners

A Survey on Evaluation of Large Language ModelsJust Accepted

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

A Survey on Evaluation of Large Language Models

Mitigating the Bias of Large Language Model Evaluation

Exploring Accuracy-Fairness Trade-off in Large Language Models

Guardians of Discourse: Evaluating LLMs on Multilingual Offensive Language Detection

Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models

Locating and Mitigating Gender Bias in Large Language Models

Multilingual Large Language Models: A Systematic Survey

A Survey on Fairness in Large Language Models

Unveiling Linguistic Regions in Large Language Models

LLM for Everyone: Representing the Underrepresented in Large Language Models

Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models