Experimental Comparisons of Instances Set Reduction Algorithms

yuelin yu,yangguang liu,bin xu,xiaoqi he
DOI: https://doi.org/10.1007/978-3-642-37829-4_52
2014-01-01
Abstract:As techniques of data acquisition and data storage rapidly developed, more and larger datasets are very easily faced in machine learning. In order to avoid excessive storage and time consuming, and possibly to improve generalization accuracy by removing noise, several works presented as reduction techniques have been proposed. In this paper, firstly, we will review most traditional and typical reduction algorithms and find out their strengths and weaknesses, respectively. In addition, nine typical reduction algorithms are compared performing on 16 classification tasks. At last, some valuable directions for further research are proposed based on discussions and conclusion of traditional algorithms mentioned.
What problem does this paper attempt to address?