Definition and Evaluation of Data Quality: User-Oriented Data Object-Driven Approach to Data Quality Assessment

Anastasija Nikiforova
DOI: https://doi.org/10.22364/BJMC.2020.8.3.02
2020-09-28
Baltic Journal of Modern Computing
Abstract:. Data quality issue has emerged since the end of the 60’s, however, more than 50 years later, it remains unresolved and is still current, mainly due the popularity of data and open data. The paper proposes a data object-driven approach to data quality evaluation. This user-oriented solution is based on 3 main components: data object, data quality specification and the process of data quality measuring. These components are defined by 3 graphical DSLs, that are easy enough even for non-IT experts. The approach ensures data quality analysis depending on the use-case. Developed approach allows analysing quality of “third-party” data. The proposed solution is applied to open data sets. The result of approbation of the proposed approach demonstrated that open data have numerous data quality issues. There are also underlined common data quality problems detected not only in Latvian open data but also in open data of 3 European countries.
Computer Science
What problem does this paper attempt to address?