Structure Characteristic Analysisand Automatic Extraction for Korean Noun Phrase

Shuaifei AN,Yude BI
DOI: https://doi.org/10.3969/j.issn.1003-0077.2013.05.030
2013-01-01
Abstract:These years, noun phrase,as a common grammatical phenomenon,has attracted eyes of many scholars in the field of language processing.At present,most researches on noun phrase lie in boundary identification grammatical analysis,semantic analysis,categorization and some other aspects.This thesis abstractsnoun phrases from a large-scale tagged corpus through studying and analyzing rules of left and right boundaries of noun phrases in written Korean.From the experimental result,we can see that high-frequency noun phrases mainly lie in 8 categories.Different kinds of corpus for noun phrases can be built according to the result of the abstract,which lays the foundation of building paralell corpus.It will also be convenient for machine translation,information retrieval and other work in language information processing in the future.
What problem does this paper attempt to address?