Fuzzy Matching for Tree-Based Machine Translation

XIONG Hao,LIU Yang,LIU Qun
DOI: https://doi.org/10.3969/j.issn.1003-0077.2011.02.012
2011-01-01
Abstract:Previous related work of tree-based models treat rules as strings and then match rules using string matching algorithm.However,the performance of tree-based models is largely dependent on the parsing results,and for some languages,the precision of current parser is still far from state-of-the-art.So two rules with one different tag causing by parsing errors seems to be unmatchable.Under exact matching strategy,the size of available rules is implicitly scarce especially in tree-to-tree models,in which the performance is still unacceptable.In this paper,we present a tree kernel based fuzzy matching algorithm which computes the similarity between different rules.Experimental results on NIST 2005 Chinese-to-English test set show that our system achieve an absolute improvement of 1.3% in term of BLEU score over string matching system.Furthermore,when using the packed forest,our method still gets a relative improvement of 0.7 BLEU score.
What problem does this paper attempt to address?