A Hybrid Model for Computational Morphology Application

Xu Yang,Wang Hou-Feng
DOI: https://doi.org/10.1109/snpd.2007.34
2007-01-01
Abstract:Computational morphology is a core component in many different types of natural language processing, such as the alignment techniques. This paper describes a method for morphological processing. Based on both rules and statistical models, a lemmatizer is constructed to analyze the English inflectional morphology, and automatically derives the lemmas of the words. The rule model incorporates data from various corpora, machine- readable dictionaries, and an empirical metamorphose rule set, and the statistical model applies mainly the maximum entropy principles to deal with unknown words and ambiguous cases effectively. The knowledge used in our lemmatizer is convenient to update to support the development of natural language processing. Experiments show that the lemmatizer has a wide coverage and high accuracy.
What problem does this paper attempt to address?