MCHPT: A Weakly Supervise Based Merchant Pre-trained Model.

Zehua Zeng,Xiaohan She,Xuetao Qiu,Hongfeng Chai,Yanming Yang
DOI: https://doi.org/10.1007/978-981-99-1639-9_37
2022-01-01
Abstract:In the last few years, pre-trained models (PTMS) have become the foundation of the downstream natural language processing tasks. The large scale corpus with abundant latent semantical knowledge in the pre-training tasks makes the model learn the semantics of language. However, the general mask language model is not suitable for corpus with a lot of irrelevant and noisy semantics such as merchant information. In our merchant system, we have collected millions of merchants information, including merchant names and address. To deal with these kind of short and noisy corpus and incorporate multi-source external information into the model, in this paper, we propose a weakly supervise based merchant pre-trained model called MCHPT model to learn representations of merchant-language. The model is pre-trained by our designed pre-training tasks on a large scale weakly supervised real-world merchant dataset. The experiment results present that our model outperforms the state-of-the-art pre-trained language models in four downstream merchant related tasks.
What problem does this paper attempt to address?