Simple Flow-Based Contrastive Learning for BERT Sentence Representations

Ziyi Tian,Qun Liu,Maotao Liu,Wei Deng
DOI: https://doi.org/10.1007/978-3-031-09726-3_24
2022-01-01
Abstract:Natural language processing is a significant branch of machine learning, and pre-trained models such as BERT have been widely used in it. Previous research has shown that sentence embeddings from pre-trained language models without fine-tune have difficulty in capturing their exact semantics. The ambiguous semantics leads to poor performance on semantic text similarity (STS) tasks. However, fine-tune tends to skew the model toward high-frequency distributions due to the heterogeneous nature of word frequency and word sense distributions. Therefore, fine-tune is not a optimal choice. To address this issue, we propose an unsupervised flow-based contrastive learning model. The model maps sentence embedding distributions to smooth and isotropic Gaussian distributions, thus mitigating the impact caused by irregular word frequency distributions. To evaluate the performance of our model, we use an industry-recognized method that outperforms competing baselines in different sentence-related tasks.
What problem does this paper attempt to address?