Efficient slot labelling

Vladimir Vlasov
2024-01-19
Abstract:Slot labelling is an essential component of any dialogue system, aiming to find important arguments in every user turn. Common approaches involve large pre-trained language models (PLMs) like BERT or RoBERTa, but they face challenges such as high computational requirements and dependence on pre-training data. In this work, we propose a lightweight method which performs on par or better than the state-of-the-art PLM-based methods, while having almost 10x less trainable parameters. This makes it especially applicable for real-life industry scenarios.
Computation and Language
What problem does this paper attempt to address?
The main problems that this paper attempts to solve are the efficiency and performance issues in the slot labelling task in dialogue systems. Specifically, current slot labelling methods usually rely on large pre - trained language models (PLMs), such as BERT or RoBERTa. Although these models are effective, they have the following two main problems: 1. **High computational requirements**: When these models are deployed in an industrial environment, especially when used on the device side, their high computational requirements may not be met. 2. **Dependence on pre - trained data**: The success of these models depends largely on the data used in the pre - training stage. When the type of data input by the user has not appeared in the pre - training data, the model may have difficulty in correctly matching the slots. To solve the above problems, the paper proposes a lightweight method. This method can not only achieve performance comparable to or better than existing PLM - based methods, but also has nearly 10 times fewer parameters, which makes it particularly suitable for actual industrial scenarios. Through this method, the researchers aim to provide a more efficient and adaptable solution to overcome the limitations of existing technologies.