Investigating the Contextualised Word Embedding Dimensions Responsible for Contextual and Temporal Semantic Changes

Taichi Aida,Danushka Bollegala
2024-07-03
Abstract:Words change their meaning over time as well as in different contexts. The sense-aware contextualised word embeddings (SCWEs) such as the ones produced by XL-LEXEME by fine-tuning masked langauge models (MLMs) on Word-in-Context (WiC) data attempt to encode such semantic changes of words within the contextualised word embedding (CWE) spaces. Despite the superior performance of SCWEs in contextual/temporal semantic change detection (SCD) benchmarks, it remains unclear as to how the meaning changes are encoded in the embedding space. To study this, we compare pre-trained CWEs and their fine-tuned versions on contextual and temporal semantic change benchmarks under Principal Component Analysis (PCA) and Independent Component Analysis (ICA) transformations. Our experimental results reveal several novel insights such as (a) although there exist a smaller number of axes that are responsible for semantic changes of words in the pre-trained CWE space, this information gets distributed across all dimensions when fine-tuned, and (b) in contrast to prior work studying the geometry of CWEs, we find that PCA to better represent semantic changes than ICA. Source code is available at <a class="link-external link-https" href="https://github.com/LivNLP/svp-dims" rel="external noopener nofollow">this https URL</a> .
Computation and Language
What problem does this paper attempt to address?
This paper aims to explore and analyze which dimensions are responsible for encoding the contextual and temporal semantic changes of words in the Contextualised Word Embeddings (CWEs) space. Specifically, the researchers focus on: 1. **Contextual semantic change**: By comparing pre - trained CWEs and fine - tuned SCWEs (Sense - aware Contextualised Word Embeddings), the researchers hope to understand which dimensions in these embedding vectors can capture the changes in the meaning of words in different contexts. 2. **Temporal semantic change**: In addition to contextual changes, the study also explores the semantic changes of words over time. This involves whether the meaning of words has changed at different points in time. To achieve the above goals, the researchers use Principal Component Analysis (PCA) and Independent Component Analysis (ICA) to transform CWEs, and observe and analyze how the information of semantic changes is distributed through these transformed axes. The experimental results reveal several new insights, such as: - Although there are a small number of axes responsible for the semantic changes of words in the pre - trained CWE space, after fine - tuning, this information will be scattered across all dimensions. - Compared with ICA, PCA performs better in representing semantic changes. Through these analyses, the researchers hope to develop more accurate and efficient low - dimensional semantic change detection methods.