Crowdsourcing Diverse Paraphrases for Training Task-oriented Bots

Jorge Ramírez,Auday Berro,Marcos Baez,Boualem Benatallah,Fabio Casati
DOI: https://doi.org/10.48550/arXiv.2109.09420
2021-09-20
Abstract:A prominent approach to build datasets for training task-oriented bots is crowd-based paraphrasing. Current approaches, however, assume the crowd would naturally provide diverse paraphrases or focus only on lexical diversity. In this WiP we addressed an overlooked aspect of diversity, introducing an approach for guiding the crowdsourcing process towards paraphrases that are syntactically diverse.
Computation and Language
What problem does this paper attempt to address?