Internet-oriented Chinese New Words Detection

邹纲,刘洋,刘群,孟遥,于浩,西野文人,亢世勇
DOI: https://doi.org/10.3969/j.issn.1003-0077.2004.06.001
2004-01-01
Abstract:With the fast development of the society,more and more new words come out in our life. It is one of the important topics in Chinese natural language processing to collect those new words. A method is presented for detecting these new words automaitcally in this paper. Through analysing webpages grabbed from the Internet, a large word and string set is built, which new words are detected from and filtered by rules. At last new words which exist in the webpages grabbed are extracted. The system built in this way can find new words in any length and in any field.Now it is applying to the compilation of Modern Chinese New Word Information Dictionary. It reduced human labor a lot in practise.
What problem does this paper attempt to address?