Text-to-hashtag Generation using Seq2Seq Learning

Augusto Camargo,Wesley Carvalho,Felipe Peressim,Alan Barzilay,Marcelo Finger
DOI: https://doi.org/10.5753/bresci.2021.15797
2021-07-18
Abstract:In this paper, we studied whether models based on BiLSTM and BERT can predict hashtags in Brazilian Portuguese for Ecommerce websites. Hashtags have a sizable financial impact on Ecommerce. We processed a corpus of Ecommerce reviews as inputs, and predicted hashtags as outputs. We evaluated the results using four quantitative metrics: NIST, BLEU, METEOR and a crowdsourced score. A word cloud was used as a qualitative metric. While all computer-generated metrics (NIST, BLEU and METEOR) indicated bad results, the crowdsourced results produced amazing scores. We concluded that the texts predicted by the neural networks are very promising for use as hashtags for products on Ecommerce websites. The code for this work is available at https://github.com/augustocamargo/text-to-hashtag.
What problem does this paper attempt to address?