Optimization of Chinese Word Segmentation in Named Entity Recognition and Word AIignment

Cun-yan YIN,Shu-jian HUANG,Xin-yu DAI,Jia-jun CHEN
DOI: https://doi.org/10.3969/j.issn.0372-2112.2015.08.003
2015-01-01
Abstract:Bilingual named entity recognition and alignment are important for many natural language processing.Named enti-ty translation can improve a lot the performance of the system like statistical machine translation or cross-language information re-trieval.Quality of Chinese word segmentation does have a big impact over named entity (NE)recognition and bilingual NE extrac-tion.Bilingual alignment information provides indications for NE recognition and word segmentation.Accordingly,based on the characteristics of NE recognition,NE alignment,and word segmentation,this paper proposes an optimization algorithm of Chinese word segmentation.By correcting word segmentation error and adjusting word segmentation granularity,the optimization algorithm can enhance extraction effect of Chinese-English NE translation and performance of statistical machine translation.The experimental result on Chinese-English news corpus shows the efficiency of our algorithm.
What problem does this paper attempt to address?