Analysis of Data Collection and Preparation Issues In Archival Data Mining

Wang Jing,Xin Yuming,Gao Hongyan
DOI: https://doi.org/10.3969/j.issn.1008-0821.2012.06.018
2012-01-01
Abstract:Data mining technology can help people extract implicit,potential and valuable information from massive information resources,so it has been introduced to deal with the explosive growth of archival information resources.Whether the source data for mining is complete and standardized is directly related to the quality of mining.According to the situations,based on the concept description of all aspects of data collection and preparation,combining the status of archival information resources,as well as the characteristics of the data,this paper illustrates the points for attention and the specific method of each aspect before mining,aiming at ameliorating the quality of the information and the mining.This makes a good foundation for consequent research.
What problem does this paper attempt to address?