Overcoming Key Weaknesses of Distance-based Neighbourhood Methods Using a Data Dependent Dissimilarity Measure

Kai Ming Ting,Ye Zhu,Mark Carman,Yue Zhu,Zhi-Hua Zhou
DOI: https://doi.org/10.1145/2939672.2939779
2016-01-01
Abstract:This paper introduces the first generic version of data dependent dissimilarity and shows that it provides a better closest match than distance measures for three existing algorithms in clustering, anomaly detection and multi-label classification. For each algorithm, we show that by simply replacing the distance measure with the data dependent dissimilarity measure, it overcomes a key weakness of the otherwise unchanged algorithm.
What problem does this paper attempt to address?