A Kalman Filter Based Human-Computer Interactive Word Segmentation System For Ancient Chinese Texts

Tongfei Chen,Weimeng Zhu,Xueqiang Lv,Junfeng Hu
DOI: https://doi.org/10.1007/978-3-642-41491-6_3
2013-01-01
Abstract:Previous research showed that Kalman filter based human-computer interaction Chinese word segmentation algorithm achieves an encouraging effect in reducing user interventions. This paper designs an improved statistical model for ancient Chinese texts, and integrates it with the Kalman filter based framework. An online interactive system is presented to segment ancient Chinese corpora. Experiments showed that this approach has advantage in processing domain-specific text without the support of dictionaries or annotated corpora. Our improved statistical model outperformed the baseline model by 30% in segmentation precision.
What problem does this paper attempt to address?