Chinese Entity Detection and Tracking: the Experience in ACE

Wenjie Li,Donglei Qian,Qin Lu,Chunfa Yuan
DOI: https://doi.org/10.1142/s0219427907001718
2007-01-01
International Journal of Computer Processing Of Languages
Abstract:The work presented in this paper is motivated by the practical need for content extraction, and the available data source and evaluation benchmark from the ACE program. The Chinese entity detection and tracking task is of particular interest to us. A novel solution is proposed to alleviate the language-independent and language-dependent problems special in this task. Mention detection takes advantages of machine learning approaches and character-based models. It manipulates different types of entities being mentioned and different constitution units (i.e., extents and heads) separately. Mentions referring to the same entity are linked together by integrating most-specific-first and closest-first rule based pairwise clustering algorithms. Types of mentions and entities are determined by head-driven classification approaches. The implemented system achieves 66.1 of ACE value, which has been one of the top-tier results.
What problem does this paper attempt to address?