Distant-Supervised Slot-Filling for E-Commerce Queries

Saurav Manchanda,Mohit Sharma,George Karypis
DOI: https://doi.org/10.1109/bigdata52589.2021.9671825
2021-12-15
Abstract:Slot-filling refers to the task of annotating individual terms in a query with the corresponding intended product characteristics (product type, brand, gender, size, color, etc.). These characteristics can then be used by a search engine to return results that better match the query’s product intent. Traditional methods for slot-filling require the availability of training data with ground truth slot-annotation information. However, generating representative labeled data, especially in big-data driven platforms like e-commerce is expensive and time consuming, given the volume and velocity of the data. In this paper, we present distant-supervised probabilistic generative models, that require no manual annotation. The proposed approaches leverage the readily available historical queries and their subsequent transaction logs, and also exploit co-occurrence information among the slots in order to identify intended product characteristics. We evaluate our approaches by considering both how they affect retrieval performance, as well as how well they classify the slots. In terms of retrieval, our approaches achieve better ranking performance (up to 156%) over Okapi BM25. Moreover, our approach that leverages co-occurrence information leads to better performance than the one that does not on both the retrieval and slot classification tasks.
What problem does this paper attempt to address?