An Original Data Understanding Process.

Wenjun Quan,Qing Zhou,Hai Nan,Ping Wang
DOI: https://doi.org/10.1145/3207677.3277974
2018-01-01
Abstract:The data mining1 standard process divides a data mining project into six phases, i.e. business understanding, data understanding, data preparation, modeling, evaluation and deployment. The goal of the data understanding phase is to understand the original data. At present, there are relatively few studies on this phase. In practical applications, some visualization methods are usually used to understand the original data. Therefore, we propose a systematic process for data understanding, and make full use of visualization technology to help users understand the data. In addition, we revise the DP (Density Peaks) algorithm to identify the high-density region, and integrate it into the data understanding process. The experimental results show that the data understanding process proposed in this paper is effective.
What problem does this paper attempt to address?