Instance Selection by using polar grids

Sang Yongsheng,Zhang Yi
DOI: https://doi.org/10.1109/ICACTE.2010.5579549
2010-01-01
Abstract:Instance selection is about algorithms that search for a representative portion of data that can fulfill a data mining task as if the whole data is used. It is a very important data reduction technique, which can spare much memory and running time for data mining algorithms. This paper proposes a new method for Instance Selection by using Polar Grids (ISPG). The main idea is to search for a subset of instances located close to decision boundary by using a method based on polar grids. Original training instances are mapped into Polar reference frame, and the data space is partitioned as a set of polar grids. Then a special search algorithm is designed for determining which instances locate close to decision boundary. The method can also handle noisy instances and smooth data boundaries. The classical kNearest Neighbors (kNN) classification algorithm is employed to test the proposed method. Experiments show that the proposed method can reduce datasets effectively and achieve reasonable generalization accuracy. Moreover, the method achieves prominent learning speed, which can be used to process large spatial datasets. © 2010 IEEE.
What problem does this paper attempt to address?