Transferring from Formal Newswire Domain with Hypernet for Twitter POS Tagging.

Tao Gui,Qi Zhang,Jingjing Gong,Minlong Peng,Di Liang,Keyu Ding,Xuanjing Huang
DOI: https://doi.org/10.18653/v1/d18-1275
2018-01-01
Abstract:Part-of-Speech (POS) tagging for Twitter has received considerable attention in recent years. Because most POS tagging methods are based on supervised models, they usually require a large amount of labeled data for training. However, the existing labeled datasets for Twitter are much smaller than those for newswire text. Hence, to help POS tagging for Twitter, most domain adaptation methods try to leverage newswire datasets by learning the shared features between the two domains. However, from a linguistic perspective, Twitter users not only tend to mimic the formal expressions of traditional media, like news, but they also appear to be developing linguistically informal styles. Therefore, POS tagging for the formal Twitter context can be learned together with the newswire dataset, while POS tagging for the informal Twitter context should be learned separately. To achieve this task, in this work, we propose a hypernetworkbased method to generate different parameters to separately model contexts with different expression styles. Experimental results on three different datasets show that our approach achieves better performance than state-of-theart methods in most cases.
What problem does this paper attempt to address?