Character-familization and the Extraction of Unlisted Words

宋作艳
2007-01-01
Abstract:A character-family is a family consisting of all the character combinations which have one same character and are parallel one another. For example,“X 热”(旅游热、化妆热、考研热……). As one of the productive ways to the formation of new words in modern Chinese, Character-familization affects the coding mechanism of Chinese deeply. This paper has made an analysis of the characteristics, and effect of the character-family, and further more provide some method to predict and extract new words (unlisted words) automatically on the base of parallel rules.
What problem does this paper attempt to address?