A New Multiword Expression Metric and Its Applications

Fan Bu,Xiao-Yan Zhu,Ming Li
DOI: https://doi.org/10.1007/s11390-011-9410-0
2011-01-01
Abstract:Multiword Expressions (MWEs) appear frequently and ungrammatically in natural languages. Identifying MWEs in free texts is a very challenging problem. This paper proposes a knowledge-free, unsupervised, and language-independent Multiword Expression Distance (MED). The new metric is derived from an accepted physical principle, measures the distance from an n -gram to its semantics, and outperforms other state-of-the-art methods on MWEs in two applications: question answering and named entity extraction.
What problem does this paper attempt to address?