TourBERT: A pretrained language model for the tourism industry

Veronika Arefieva,Roman Egger
DOI: https://doi.org/10.48550/arXiv.2201.07449
2022-01-19
Computation and Language
Abstract:The Bidirectional Encoder Representations from Transformers (BERT) is currently one of the most important and state-of-the-art models for natural language. However, it has also been shown that for domain-specific tasks it is helpful to pretrain BERT on a domain-specific corpus. In this paper, we present TourBERT, a pretrained language model for tourism. We describe how TourBERT was developed and evaluated. The evaluations show that TourBERT is outperforming BERT in all tourism-specific tasks.
What problem does this paper attempt to address?