Cross platform (RDBMS to NoSQL) database validation tool using bloom filter

Akansha Goyal,Arun Swaminathan,Rasika Pande,Vahida Attar
DOI: https://doi.org/10.1109/icrtit.2016.7569537
2016-04-01
Abstract:Data-structured, semi-structured and unstructured is growing at an exponential rate. Most organizations are dependent on this huge volume of data to make real time decisions. To provide accurate information to these decision making models, it is crucial that the data is of superior quality. Since most organizations are migrating from relational to NoSQL databases, it is vital to validate data after the migration process. Considering the different structures of the two databases, the process of validation is a formidable task. This paper talks about an approach to validate data between cross platform databases using denormalized schema structures and bloom filters. Based on our experimental results, we have been able to validate huge datasets and pinpoint the exact corrupted records in constant space and linear time complexity up to a desired error probability.
What problem does this paper attempt to address?