A General Multi-Source Data Fusion Framework

Weiming Liu,Chen Zhang,Bin Yu,Yitong Li
DOI: https://doi.org/10.1145/3318299.3318394
2019-01-01
Abstract:With the development of the Internet, the increase of information sources and speed of information release and transmission have led to a sharp increase in the amount of information. To enable users finding more accurate and reliable information in the large heterogeneous multi-source data, data fusion technology becomes more and more important. Data fusion technology structuralizes and integrates heterogeneous data from different sources which greatly improves the comprehensiveness, availability and extensibility of data. This paper proposes a general multi-source data fusion framework. The framework transforms multi-source structured data, semi-structured data and unstructured data into unified data format described by RDF (Resource Description Framework) standard, and then realizes information fusion through data fusion algorithm, to solve the heterogeneity and semantic conflict in multi-source data fusion under the big data environment.
What problem does this paper attempt to address?