Toward integration travel information data using information extraction and instance matching

Feng Shi,Juanzi Li,Jianqiang Hu
2009-01-01
Abstract:In this paper, we introduce the method on how to integrate travel information data embedded in web pages using approaches of information extraction and instance matching. Furthermore we extend the concept of instance matching to find the connotative relationship between instances extracted from different sources in order to improve the result of integration. We extracted more than 145,000 pieces of travel data terms of sight, route, agent, hotel, restaurant and ticket from several different sources, and integrated them into a piece of travel data with comprehensive information. 1 Overview
What problem does this paper attempt to address?