Linguistic Web: Bridging between text information sources and Semantic Web

Hao Jingmin,Liao Lejian
DOI: https://doi.org/10.1109/WCICA.2008.4593035
2008-01-01
Abstract:The goal of semantic Web is to make the computer can understand and process data which can only be shown by the current Web. But it is impossible to annotate all the huge amount of data of current Web with semantic labels during a short time. This paper proposes a concept of Linguistic Web, which is to provide a bridging between text information sources of HTML Web pages and Semantic Web. The core of the Semantic Web is ontologies. But then it is rather difficult to automatically acquire world knowledge or domain special knowledge to build ontologies at present. As compared to the difficulties in acquiring semantic knowledge based on domain special ontology, grammatical knowledge of text could be acquired easily, and the latter is more determinate than the former. In the area of Information Retrieval, it is not enough to search information only based on keywords. Under this situation should we consider some web application can employ grammatical knowledge to improve performance. Linguistic Web focuses on building a linguistic ontology, providing grammatical knowledge for web applications. A linguistic ontology based on HPSG (Head driven-Phrase Structure Grammar) was accomplished.
What problem does this paper attempt to address?