Semantic Analysis of Entity Contexts towards Open Named Entity Classification on the Web

Xuan-Hieu Phan,Susumu Horiguchi,Le-Minh Nguyen,Cam-Tu Nguyen
2007-01-01
Abstract:This paper introduces the use of Latent Semantic Analysis (LSA) to uncover se- mantic structures/concepts hidden in en- tity contexts towards improving named en- tity recognition (NER) on the Web. The underlying idea of the paper is that words surrounding entities of the same category are potentially related to each other in one way or another. Analyzing such relations helps build implicit concepts around entity types, making entity contexts more dis- criminative, and avoiding data sparsity for a better classification. Our experiments on a Web data collection of entity contexts show that semantic analysis can give a sig- nificant error reduction.
What problem does this paper attempt to address?