Reasoning About Micro Dependencies
Ji-Zhou SUN,Jian-Zhong LI,Hong GAO,Xian-Min LIU
DOI: https://doi.org/10.11897/SP.J.1016.2016.02134
2016-01-01
Chinese Journal of Computers
Abstract:Originally,functional dependency theory got a lot of attentions as a schema designing tool,which is not so effective in data repairing.Recent years,more constrains have been proposed to detect and repair inconsistent data,including conditional functional dependencies (CFDs),fixing rules and editing rules,etc.However,to the best of our knowledge,all of the proposals focus on dependencies between entire attributes,while there are ubiquitous dependencies between partial information of the attributes in the real world.For example,the id’s 2-length prefix of an employee may determine her department,while this kind of dependencies have been ignored by previous proposals. In this paper, we firstly propose a class of more general constrains,referred to as micro-dependencies(MDs).Extracting functions(EFs)are involved into MDs to extract partial information from attributes. With dependencies between EFs,more inconsistent data in a dataset can be detected.For static analysis of MDs,we then investigate the satisfiability problem and the implication problem analogous to those for CFDs.And then a sound and complete inference system for implication analysis is developed.Finally,we experimentally show that MDs can detect much more errors in data with an acceptable time cost.