Method of Recognizing Unknown Words by Building Single-Word Dictionary

Tong YU,Shufen LIU
DOI: https://doi.org/10.13413/j.cnki.jdxblxb.2015.02.29
2015-01-01
Abstract:Chinese word segmentation is a very important task in information processing.The present Chinese word segmentation technology mainly relies on common-word dictionary.But the dictionary has no recognition capability for unknown words.The authors brought forth a method of using double-dictionary to recognize unknown words.The process is to build a common-word dictionary and a single-word dictionary,then combine them for segmentation,solving the inefficiency in recognizing unknown words.As a result,the accuracy rate can reach above 90%.
What problem does this paper attempt to address?