Named Entity Location Prediction Combining Twitter and Web

Yinan Liu,Wei Shen,Zonghai Yao,Jianyong Wang,Zhenglu Yang,Xiaojie Yuan
DOI: https://doi.org/10.1109/tkde.2020.2973261
IF: 9.235
2020-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Knowledge bases are critical to many applications. However, they are greatly incomplete. Enriching knowledge bases with new entities and new location attributes becomes increasingly important. Given a named entity with tweets and Web documents where the entity appears, we aim to predict the entity city-level location combining the geographical location knowledge embedded in both Twitter and Web. This task is helpful for knowledge base enrichment and tweet location prediction. In this paper we propose NELPTW, the first unsupervised framework for Named Entity Location Prediction by leveraging the knowledge from Twitter and Web. Based on each data source, NELPTW utilizes a linear function ranking model to generate several rankings to the candidate location set for each entity. To combine the knowledge from two sources which have different reliability and importance for the location prediction, an unsupervised rank aggregation algorithm is developed to aggregate multiple rankings for each entity to obtain a better ranking. A learning algorithm based on the EM method is proposed to automatically learn the parameters of the ranking model without requiring any training labels. The experimental results over a real world Twitter and Web data set show that our framework significantly outperforms the baselines in terms of accuracy.
What problem does this paper attempt to address?