CMV-BERT: Contrastive Multi-Vocab Pretraining of BERT

Wei Zhu,Daniel Cheung
2020-01-01
Abstract:In this work, we represent CMV-BERT, which improves the pretraining of a language model via two ingredients: (a) contrastive learning, which is well studied in the area of computer vision; (b) multiple vocabularies, one of which is finegrained and the other is coarse-grained. The two methods both provide different views of an original sentence, and both are shown to be beneficial. Downstream tasks demonstrate our proposed CMV-BERT are effective in improving the pretrained language models.
What problem does this paper attempt to address?