A Design of Query Answering Processor for Data Integration Based on Logic

XIE Xing-sheng,LI Bin,FANG Xiang,ZHUANG Zhen-quan
DOI: https://doi.org/10.3969/j.issn.0253-2778.2006.11.015
2006-01-01
JUSTC
Abstract:A logic-based scheme for data integration was proposed,and a query answering processor(QAP),which is a core component of the system, was developed.Data integration is a problem of combining the data residing at different,heterogeneous sources,and providing the user with a unified data view,called mediated schema.It is the task of the system to free the user from the knowledge of where data are,and how data are structured at the sources.In this scheme architecture,the data sources were defined as views over the mediated schema in a paradigm of local-as-view(LAV),the data storage was managed following a quasi-virtual approach,i.e.,while the data still residing at the sources during query processing,a data warehouse,treated as norm-data source,was used seamlessly as an optional enhanced component for data storage buffering as well.Moreover,a logic of the description logics(DL) family was used to model the mediated schema,to formulate queries posed to the system,and to perform several types of automated reasoning supporting both the modeling and the query answering process.By employing a hybrid-reasoning method,which is a hybrid of the Datalog inference in first order predicate logic and automatic reasoning services of description logic,an algorithm,used by QAP to rewrite user queries using views,was presented and illustrated.The study shows that,when the query and views are conjunctive,the algorithm is always able to produce a maximally-contained rewriting,and is scales up well in the presence of a large number of views.
What problem does this paper attempt to address?