Efficient Discovery of Functional Dependencies with Degrees of Satisfaction.

Q Wei,GQ Chen
DOI: https://doi.org/10.1002/int.20038
IF: 8.993
2004-01-01
International Journal of Intelligent Systems
Abstract:Functional dependency (FD) is an important type of semantic knowledge reflecting integrity constraints in databases, and has nowadays attracted an increasing amount of research attention in data mining. Traditionally, FD is defined in the light of precise or complete data, and can hardly tolerate partial truth due to imprecise or incomplete data (such as noises, nulls, etc.) that may often exist in massive databases, or due to a very tiny insignificance of tuple differences in a huge volume of data. Based on the notion of functional dependencies with degrees of satisfaction (FDs)d, this article presents an efficient approach to discovering all satisfied (FDs)d using some important results obtained from exploration of (FDs)d properties such as extended Armstrong-like axioms and their derivatives. In this way, many dependencies can be inferred from previously discovered ones without scanning databases, and those unsatisfied ones could be filtered out inside (rather than after) the mining process. Fuzzy relation matrix operation is used to infer transitive dependencies in the mining algorithm. Finally, the efficiency is demonstrated with data experiments. © 2004 Wiley Periodicals, Inc. Int J Int Syst 19: 1089–1110, 2004.
What problem does this paper attempt to address?