Detecting Hot Events From Web Search Logs

Yingqin Gu,Jianwei Cui,Hongyan Liu,Xuan Jiang,Jun He,Xiaoyong Du,Zhixu Li
DOI: https://doi.org/10.1007/978-3-642-14246-8_41
2010-01-01
Abstract:Detecting events from web resources is a challenging task, attracting many attentions in recent years. Web search log is an important data source for event detection because the information it contains reflects users' activities and interestingness to various real world events. There are three major issues for event detection from web search logs: effectiveness, efficiency and the organization of detected events. In this paper, we develop a novel Topic and Event Detection method, TED, to address these issues. We first divide the whole data into topics for efficiency consideration, and then incorporate link information, temporal information and query content to ensure the quality of detected events. Finally, events detected are organized through the proposed interestingness measure as well as topics they belong to. Experiments are conducted on a commercial search engine log. The results demonstrate that our method can effectively and efficiently detect hot events and give a meaningful organization of them.
What problem does this paper attempt to address?