Rethinking big data: A review on the data quality and usage issues

Liu Jianzheng,Li Jie,Li Weifeng,Wu Jiansheng
DOI: https://doi.org/10.1016/j.isprsjprs.2015.11.006
IF: 12.7
2016-01-01
ISPRS Journal of Photogrammetry and Remote Sensing
Abstract:The recent explosive publications of big data studies have well documented the rise of big data and its ongoing prevalence. Different types of “big data” have emerged and have greatly enriched spatial information sciences and related fields in terms of breadth and granularity. Studies that were difficult to conduct in the past time due to data availability can now be carried out. However, big data brings lots of “big errors” in data quality and data usage, which cannot be used as a substitute for sound research design and solid theories. We indicated and summarized the problems faced by current big data studies with regard to data collection, processing and analysis: inauthentic data collection, information incompleteness and noise of big data, unrepresentativeness, consistency and reliability, and ethical issues. Cases of empirical studies are provided as evidences for each problem. We propose that big data research should closely follow good scientific practice to provide reliable and scientific “stories”, as well as explore and develop techniques and methods to mitigate or rectify those ‘big-errors’ brought by big data.
What problem does this paper attempt to address?