Twitter part-of-speech tagging using pre-classification Hidden Markov model

Shichang Sun,Hongbo Liu,Hongfei Lin,Ajith Abraham
DOI: https://doi.org/10.1109/ICSMC.2012.6377881
2012-01-01
Abstract:Hidden Markov models (HMM) have been widely used in natural language processing (NLP), especially in syntactic level applications, which appears naturally as short-range-dependent sequence recognition problems. But the structure of HMM limits the usage of global knowledge including the sentiment analysis of the text, which has become an increasingly popular research topic in NLP now. In this paper, we propose a novel treatment of HMM model to use the result of sentimental subjectivity analysis in syntactic level task, i.e. part-of-speech (POS) tagging. The subjectivity information is introduced as a pre-classification procedure into the interval-type HMM. The subjectivity degree of the testing sentence is used as a combination factor to choose an appropriate value from the interval. Experiments results on public tagging data sets shows that the proposed approach enhanced the performance of POS tagging.
What problem does this paper attempt to address?