Abstract:Linguistic analysis of language models is one of the ways to explain and describe their reasoning, weaknesses, and limitations. In the probing part of the model interpretability research, studies concern individual languages as well as individual linguistic structures. The question arises: are the detected regularities linguistically coherent, or on the contrary, do they dissonate at the typological scale? Moreover, the majority of studies address the inherent set of languages and linguistic structures, leaving the actual typological diversity knowledge out of scope. In this paper, we present and apply the GUI-assisted framework allowing us to easily probe a massive number of languages for all the morphosyntactic features present in the Universal Dependencies data. We show that reflecting the anglo-centric trend in NLP over the past years, most of the regularities revealed in the mBERT model are typical for the western-European languages. Our framework can be integrated with the existing probing toolboxes, model cards, and leaderboards, allowing practitioners to use and share their standard probing methods to interpret multilingual models. Thus we propose a toolkit to systematize the multilingual flaws in multilingual models, providing a reproducible experimental setup for 104 languages and 80 morphosyntactic features. https://github.com/AIRI-Institute/Probing_framework

Probing Classifiers: Promises, Shortcomings, and Advances

Probing the Probing Paradigm: Does Probing Accuracy Entail Task Relevance?

Does My Representation Capture X? Probe-Ably

Probing via Prompting

Probing Classifiers are Unreliable for Concept Removal and Detection

Predicting Fine-Tuning Performance with Probing

Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals

Understanding Probe Behaviors through Variational Bounds of Mutual Information

Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal View

Knowledge Trees: Gradient Boosting Decision Trees on Knowledge Neurons as Probing Classifier

Low-Complexity Probing via Finding Subnetworks

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Probing artificial neural networks: insights from neuroscience

A Little Confidence Goes a Long Way

Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data

Can We Use Probing to Better Understand Fine-tuning and Knowledge Distillation of the BERT NLU?

A Latent-Variable Model for Intrinsic Probing

Decoding Probing: Revealing Internal Linguistic Structures in Neural Language Models using Minimal Pairs

A Matter of Framing: The Impact of Linguistic Formalism on Probing Results

Universal and Independent: Multilingual Probing Framework for Exhaustive Model Interpretation and Evaluation

Enhancing Robustness in Biomedical NLI Models: A Probing Approach for Clinical Trials