Knowledge Discovery Processing Model Based on Data Extractor

ZHENG Hong-Zhen,LIU Yang,ZHAN De-Chen
DOI: https://doi.org/10.3969/j.issn.1002-137X.2006.09.037
2006-01-01
Computer Science
Abstract:In this paper,a knowledge discovery model based on data extractor is proposed.According to this model,the process of knowledge discovery is divided into four stages:data preprocessing,data extraction,data mining and result analysis.This model,standard SQL is used to construct data extractor to prepare data for different learning algo- rithms,to reduce the number of times to invoke the database for the data mining algorithm,to avoid direct access to the data in large database and to make it possible to do rapid data mining to large database.By using this model,data dis- covery process has been sped up,efficiency of data mining has been promoted,and knowledge discovery for large data- base has been realized.Finally,we design the SQL-CA.5 algorithm,which realizes extracting necessary statistical data for decision-tree algorithm CA.5 and realizes construction of decision tree of algorithm CA.5.
What problem does this paper attempt to address?