Three Perspectives of Data Mining

Zhi‐Hua Zhou
DOI: https://doi.org/10.1016/s0004-3702(02)00357-0
IF: 14.4
2003-01-01
Artificial Intelligence
Abstract:This paper reviews three recent books on data mining written from three different perspectives, i.e., databases, machine learning, and statistics. Although the exploration in this paper is suggestive instead of conclusive, it reveals that besides some common properties, different perspectives lay strong emphases on different aspects of data mining. The emphasis of the database perspective is on efficiency because this perspective strongly concerns the whole discovery process and huge data volume. The emphasis of the machine learning perspective is on effectiveness because this perspective is heavily attracted by substantive heuristics working well in data analysis although they may not always be useful. As for the statistics perspective, its emphasis is on validity because this perspective cares much for mathematical soundness behind mining methods.
What problem does this paper attempt to address?