Decomposing Data Mining By A Process-Oriented Execution Plan

Yan Zhang,Honghui Li,Alexander Woehrer,Peter Brezany,Gang Dai
DOI: https://doi.org/10.1007/978-3-642-16530-6_13
2010-01-01
Abstract:Data mining deals with the extraction of hidden knowledge from large amounts of data. Nowadays, coarse-grained data mining modules are used. This traditional black box approach focuses on specific algorithm improvements and is not flexible enough to be used for more general optimization and beneficial component reuse. The work presented in this paper elaborates on decomposing data mining tasks as data mining execution process plans which are composed of finer-grained data mining operators. The cost of an operator can be analyzed and provides means for more holistic optimizations. This process-based data mining concept is evaluated via an OGSA-DAI based implementations for association rule mining which show the feasibility of our approach as well as the re-usability of some of the data mining operators.
What problem does this paper attempt to address?