Abstract:This paper provides a comprehensive survey of the latest research on multilingual large language models (MLLMs). MLLMs not only are able to understand and generate language across linguistic boundaries, but also represent an important advancement in artificial intelligence. We first discuss the architecture and pre-training objectives of MLLMs, highlighting the key components and methodologies that contribute to their multilingual capabilities. We then discuss the construction of multilingual pre-training and alignment datasets, underscoring the importance of data quality and diversity in enhancing MLLM performance. An important focus of this survey is on the evaluation of MLLMs. We present a detailed taxonomy and roadmap covering the assessment of MLLMs' cross-lingual knowledge, reasoning, alignment with human values, safety, interpretability and specialized applications. Specifically, we extensively discuss multilingual evaluation benchmarks and datasets, and explore the use of LLMs themselves as multilingual evaluators. To enhance MLLMs from black to white boxes, we also address the interpretability of multilingual capabilities, cross-lingual transfer and language bias within these models. Finally, we provide a comprehensive review of real-world applications of MLLMs across diverse domains, including biology, medicine, computer science, mathematics and law. We showcase how these models have driven innovation and improvements in these specialized fields while also highlighting the challenges and opportunities in deploying MLLMs within diverse language communities and application scenarios. We listed the paper related in this survey and publicly available at <a class="link-external link-https" href="https://github.com/tjunlp-lab/Awesome-Multilingual-LLMs-Papers" rel="external noopener nofollow">this https URL</a>.

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

MaLA-500: Massive Language Adaptation of Large Language Models

SambaLingo: Teaching Large Language Models New Languages

Extrapolating Large Language Models to Non-English by Aligning Languages

Bootstrapping Multilingual Semantic Parsers using Large Language Models

P-MMEval: A Parallel Multilingual Multitask Benchmark for Consistent Evaluation of LLMs

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

Multilingual Large Language Models: A Systematic Survey

BigTranslate: Augmenting Large Language Models with Multilingual Translation Capability over 100 Languages

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Responsible Multilingual Large Language Models: A Survey of Development, Applications, and Societal Impact

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

MindLLM: Pre-training Lightweight Large Language Model from Scratch, Evaluations and Domain Applications

PolyLM: An Open Source Polyglot Large Language Model

A Novel Paradigm Boosting Translation Capabilities of Large Language Models

McEval: Massively Multilingual Code Evaluation

Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement

Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers