Corpus Based Part-of-speech Tagging.

Chengyao Lv,Huihua Liu,Yuanxing Dong,Yunliang Chen
DOI: https://doi.org/10.1007/s10772-016-9356-2
2016-01-01
International Journal of Speech Technology
Abstract:Text corpora which are tagged with part-of-speech (pos) information are useful in many areas of linguistic research. This paper proposes a model of Genetic Expression Programming (GEP) for pos tagging. GEP is used to search for appropriate structures in function space. After the evolution of sequence of tags, GEP can find the best individual as solution. Before simulation, a set of appropriate parameters of algorithm is fitted. Experiments on Brown Corpus show that the proposed model can achieve higher accuracy rate than Genetic Algorithm model and HMM model.
What problem does this paper attempt to address?