Studies of Traditional Chinese Poet Identification Based on Machine Learning

ZHENG Yan,HE Zhong-shi,LI Liang-yan
2007-01-01
Abstract:Based on machine learning methods-Naïve Baye, this paper proposes a Traditional Chinese Poetry anthor Identification Calculation to identify Libai or Dufu. That derive from Machine Learning Chinese Classical Poem in Tang dynasty. Vector space Model is the Knowledge representation of poetic text. Feature subset selection is peformed base on information gain and hill-climbling strategy. The 98.3% satifatictory accuracy rate is achieved. We also proposed Traditional Chinese Poetry Style Identification Modal with the same methods. The 88.5% style Identificaton accuracy rate is achieved. This paper proposes a text mining method of Traditional Chinese Poetry. This research project is supported by Chinese National Science Fund
What problem does this paper attempt to address?