QI2 -- an Interactive Tool for Data Quality Assurance

Simon Geerkens,Christian Sieberichs,Alexander Braun,Thomas Waschulzik
2023-07-10
Abstract:The importance of high data quality is increasing with the growing impact and distribution of ML systems and big data. Also the planned AI Act from the European commission defines challenging legal requirements for data quality especially for the market introduction of safety relevant ML systems. In this paper we introduce a novel approach that supports the data quality assurance process of multiple data quality aspects. This approach enables the verification of quantitative data quality requirements. The concept and benefits are introduced and explained on small example data sets. How the method is applied is demonstrated on the well known MNIST data set based an handwritten digits.
Computers and Society,Artificial Intelligence,Data Structures and Algorithms,Machine Learning
What problem does this paper attempt to address?