Abstract:The question of whether people's experience in the world shapes conceptual representation and lexical semantics is longstanding. Word-association, feature-listing and similarity rating tasks aim to address this question but require a subjective interpretation of the latent dimensions identified. In this study, we introduce a supervised representational-alignment method that (i) determines whether two groups of individuals share the same basis of a certain category, and (ii) explains in what respects they differ. In applying this method, we show that congenital blindness induces conceptual reorganization in both a-modal and sensory-related verbal domains, and we identify the associated semantic shifts. We first apply supervised feature-pruning to a language model (GloVe) to optimize prediction accuracy of human similarity judgments from word embeddings. Pruning identifies one subset of retained GloVe features that optimizes prediction of judgments made by sighted individuals and another subset that optimizes judgments made by blind. A linear probing analysis then interprets the latent semantics of these feature-subsets by learning a mapping from the retained GloVe features to 65 interpretable semantic dimensions. We applied this approach to seven semantic domains, including verbs related to motion, sight, touch, and amodal verbs related to knowledge acquisition. We find that blind individuals more strongly associate social and cognitive meanings to verbs related to motion or those communicating non-speech vocal utterances (e.g., whimper, moan). Conversely, for amodal verbs, they demonstrate much sparser information. Finally, for some verbs, representations of blind and sighted are highly similar. The study presents a formal approach for studying interindividual differences in word meaning, and the first demonstration of how blindness impacts conceptual representation of everyday verbs.

Semantic projection: recovering human knowledge of multiple, distinct object features from word embeddings

Contextualized Word Embeddings Encode Aspects of Human-Like Word Sense Knowledge

Bridging the Semantic Latent Space Between Brain and Machine: Similarity is All You Need

An Exploration Of Semantic Relations In Neural Word Embeddings Using Extrinsic Knowledge

Feature2Vec: Distributional semantic modelling of human property knowledge

Seeing the advantage: visually grounding word embeddings to better capture human semantic knowledge

Semantic Vector Spaces for Broadening Consideration of Consequences

Enhancing Semantic Word Representations by Embedding Deeper Word Relationships

Exploring Semantic Representation in Brain Activity Using Word Embeddings.

Enhancing Interpretability using Human Similarity Judgements to Prune Word Embeddings

Lexical semantics enhanced neural word embeddings

Ontological Relations from Word Embeddings

The Interplay of Semantics and Morphology in Word Embeddings

Mapping semantic space: Exploring the higher-order structure of word meaning

A Map of Knowledge

Finding Semantic Equivalence of Text Using Random Index Vectors.

Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

Identifying and interpreting non-aligned human conceptual representations using language modeling

Learning Better Word Embedding by Asymmetric Low-Rank Projection of Knowledge Graph

Towards Semantic Embedding In Visual Vocabulary