Abstract:This study addresses the question of whether there is anything special about learning a third language, as compared to learning a second language, just by virtue of the third language being the third language acquired, and independently of the specific properties of the third language. We used computational modeling to explore this question for the learning of a small vocabulary of some 400 words, with English as L1, German or Mandarin as L2, and Mandarin and alternatively Dutch, as L3. For computational modeling, we made use of the mathematical framework of linear discriminative learning, which we extended with the learning rule of Widrow-Hoff to enable the modeling of incremental learning of the mappings between form and meaning when words' meanings are represented by vectors of real numbers (embeddings) rather than by abstract symbolic units. A series of simulation experiments covering single-language learning, bilingual learning, and finally trilingual learning, clarified that within the framework of discrimination learning, within-language homophones give rise to frailty in comprehension that in turn for production gives rise to semantic errors in L1, and language intrusions in L2 and L3. Our model correctly predicts production to lag behind comprehension in learning, and it clarified that, within the boundaries of discrimination learning, the properties of the L3 crucially determine whether L3 learning appears to involve a language that is `dormant' with respect to L1 and L2. Qualitatively surprisingly different patterns of acquisition of the L3, and its interactions with L1 and L2, can arise in our simulations without any changes in the mathematics driving learning. Our simulations also show that when words' forms incorporate not only segmental but also suprasegmental information, the nature of errors that arise in production changes. In the general discussion, we reflect on the implications of our findings for the question of what is special about multilingualism.

Subspace Chronicles: How Linguistic Information Emerges, Shifts and Interacts during Language Model Training

Interpretability of Language Models via Task Spaces

Probing Linguistic Information For Logical Inference In Pre-trained Language Models

Investigating semantic subspaces of Transformer sentence embeddings through linear structural probing

Representations as Language: An Information-Theoretic Framework for Interpretability

Information-Restricted Neural Language Models Reveal Different Brain Regions' Sensitivity to Semantics, Syntax and Context

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

A Latent-Variable Model for Intrinsic Probing

Probing Pretrained Language Models for Lexical Semantics

Spectral Probing

Integrating Linguistic Theory and Neural Language Models

Finding Structure in Language Models

Understanding language-elicited EEG data by predicting it from a fine-tuned language model

Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models

On the Linguistic Representational Power of Neural Machine Translation Models

Can Linguistic Knowledge Improve Multimodal Alignment in Vision-Language Pretraining?

Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling

One Model to Rule them all: Multitask and Multilingual Modelling for Lexical Analysis

Bilingual and multilingual mental lexicon: a modeling study with Linear Discriminative Learning

The Grammar-Learning Trajectories of Neural Language Models

Natural Language Multitasking: Analyzing and Improving Syntactic Saliency of Hidden Representations