Querying Big Data from a Database Perspective

Wenfeng Zhao,Guohua Liu,Zhao Chen,Douglas Nyabuga,Huichun Yang,Heng Zhang,Mengfei Ni
DOI: https://doi.org/10.1109/icsai.2017.8248511
2017-01-01
Abstract:Querying big data is the cornerstone of the application of big data. From a database perspective, query is the function defined by domain, range and a specified semantics. For querying big data, the domain is the big data and becomes quite complicated with characteristics of large volume, heterogeneous types, strong timeliness, weak authenticity, etc. To describe and analyze querying big data theoretically, we propose the definitions of big data and big data system, which includes querying big data. Aforementioned characteristics of big data lead to a break-through in normal form qualification and Closed World Assumption (CWA) related to traditional database. Therefore, this paper also points out several challenges according to those characteristics of domain and analyzes them in detail using the first-order language. Since the complexity of domain makes traditionally tractable queries infeasible, this paper analyzes and summarizes the classification of queries on relational big data according to their structure and computational complexity.
What problem does this paper attempt to address?