Ontology-Driven mashup auto-completion on a data api network

Chunying Zhou,Huajun Chen,Zhipeng Peng,Yuan Ni,Guotong Xie,周春英,陈华钧,彭志鹏,倪渊,谢国彤
DOI: https://doi.org/10.1016/S1007-0214(10)70113-9
2010-01-01
Abstract:The building of data mashups is complicated and error-prone, because this process requires not only finding suitable APls but also combining them in an appropriate way to get the desired result. This paper describes an ontology-driven mashup auto-completion approach for a data API network to facilitate this task. First, a microformats-based ontology was defined to describe the attributes and activities of the data APls. A semantic Bayesian network (sBN) and a semantic graph template were used for the link prediction on the Semantic Web and to construct a data API network denoted as N <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">p</inf> . The performance is improved by a semi-supervised learning method which uses both labeled and unlabeled data. Then, this network is used to build an ontology-driven mashup auto-completion system to help users build mashups by providing three kinds of recommendations. Tests demonstrate that the approach has a precision <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">p</inf> of about 80%, recall <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">p</inf> of about 60%, and F <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">O.5</inf> of about 70% for predicting links between APls. Compared with the API network N <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">e</inf> composed of existing links on the current Web, N <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">p</inf> contains more links including those that should but do not exist. The ontology-driven mashup auto-completion system gives a much better recall, and discounted cumulative gain (DCG) on N <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">p</inf> than on N <inf xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">e</inf> . The tests suggest that this approach gives users more creativity by constructing the API network through predicting mashup APls rather than using only existing links on the Web.
What problem does this paper attempt to address?