Study of Word-Based Chinese Document Experimental System and Chinese Free-Text Information Extraction Experiment Based on It

Qian Liu,Hui Jiao,Hui-bo Jia
DOI: https://doi.org/10.1109/icnc.2007.688
2007-01-01
Abstract:This paper presents a word-based Chinese document experimental system which is aimed to make Chinese information processing technology to develop on a more reliable and more efficient basis. This system implements the document storage and processing format, both of which are based on the smallest information carrier: Chinese word. Further an IE algorithm with two steps strategy for the Chinese free text is introduced. And then taking this document system as experimental platform, choosing the abstract part of Chinese Sci_Tech journals as the free text, the IE experiment which is conducted and get good results: accuracy ratio P is 95.03%, recall ratio R is 91.40% and F-value is 93.18% From the experimental results, we can see that the Word-based Chinese Document System designed by us can promote the development of Chinese Information Processing technology to more advanced application stages.
What problem does this paper attempt to address?