A semantics-based method for extracting geographic scopes of texts

Yi Zhang,Xingguang Wang,Min Chen,Yu Liu
DOI: https://doi.org/10.3772/j.issn.1002-0470.2012.02.009
2012-01-01
Abstract:To process geographic information in Web pages, this paper presents a novel method for extracting the geographic scopes of documents. It assigns the multi-scale geographic scope to a document through a three-stage process for dealing with geographic semantics. Firstly, the toponyms in a document are recognized under the support of the geographic knowledge base. Secondly, the ambiguous toponyms are disambiguated based on geographic and non-geographic semantics, and the evidences for disambiguation are combined by the evidence theory. Lastly, a geo-referenced tree is constructed based on a cognitive theory and the geographic focuses are obtained according to sematic relationships. The geographic location of a document is therefore determined. The above method was implemented in GeoSearcher, a prototype system for geographic information retrieval. The evaluation results show that the proposed method can reach the higher accuracy.
What problem does this paper attempt to address?