Abstract:Large language models (LLMs) have learned vast amounts of factual knowledge through self-supervised pre-training on large-scale corpora. Meanwhile, LLMs have also demonstrated excellent multilingual capabilities, which can express the learned knowledge in multiple languages. However, the knowledge storage mechanism in LLMs still remains mysterious. Some researchers attempt to demystify the factual knowledge in LLMs from the perspective of knowledge neurons, and subsequently discover language-agnostic knowledge neurons that store factual knowledge in a form that transcends language barriers. However, the preliminary finding suffers from two limitations: 1) High Uncertainty in Localization Results. Existing study only uses a prompt-based probe to localize knowledge neurons for each fact, while LLMs cannot provide consistent answers for semantically equivalent queries. Thus, it leads to inaccurate localization results with high uncertainty. 2) Lack of Analysis in More Languages. The study only analyzes language-agnostic knowledge neurons on English and Chinese data, without exploring more language families and languages. Naturally, it limits the generalizability of the findings. To address aforementioned problems, we first construct a new benchmark called Rephrased Multilingual LAMA (RML-LAMA), which contains high-quality cloze-style multilingual parallel queries for each fact. Then, we propose a novel method named Multilingual Integrated Gradients with Uncertainty Estimation (MATRICE), which quantifies the uncertainty across queries and languages during knowledge localization. Extensive experiments show that our method can accurately localize language-agnostic knowledge neurons. We also further investigate the role of language-agnostic knowledge neurons in cross-lingual knowledge editing, knowledge enhancement and new knowledge injection.

Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

Unveiling Linguistic Regions in Large Language Models

How do Large Language Models Handle Multilingualism?

Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications

Probing the Emergence of Cross-lingual Alignment during LLM Training

Revealing the Parallel Multilingual Learning within Large Language Models

Unveiling A Core Linguistic Region in Large Language Models

Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners

One Mind, Many Tongues: A Deep Dive into Language-Agnostic Knowledge Neurons in Large Language Models

Sharing Matters: Analysing Neurons Across Languages and Tasks in LLMs

Do Large Language Models Mirror Cognitive Language Processing?

The Rise and Down of Babel Tower: Investigating the Evolution Process of Multilingual Code Large Language Model

Beyond English-Centric LLMs: What Language Do Multilingual Language Models Think in?

Are Structural Concepts Universal in Transformer Language Models? Towards Interpretable Cross-Lingual Generalization

Probing Multimodal Large Language Models for Global and Local Semantic Representations

Large Language Models as Neurolinguistic Subjects: Identifying Internal Representations for Form and Meaning

Exploring the LLM Journey from Cognition to Expression with Linear Representations

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

Concept Space Alignment in Multilingual LLMs