A Relative Granular Ratio-Based Outlier Detection Method in Heterogeneous Data

Lu Gao,Mingjie Cai,Qingguo Li
DOI: https://doi.org/10.1016/j.ins.2022.11.154
IF: 8.1
2023-01-01
Information Sciences
Abstract:Outlier detection is the discovery of some objects that are significantly different from many objects in data, and it is widely used in important fields. Most existing methods are based on prior knowledge, while few methods are suitable for heterogeneous data. In this paper, we detect outliers based on neighborhood rough set, which can process heterogeneous data and reduce some hyper-parameters. Considering the few characters of outliers, a relative granular ratio factor is consequently created to measure the size of a neighborhood in which an object belongs. Since outliers always differ from the majority of objects, a granule-based majority set is defined. Then, a valid outlier factor is determined by the feature of a negative region to measure the difference between outliers and the majority set. Finally, a ratio and negative region detection factor (RNRD) is constructed by combining the above factors under a wide range of relations. In addition, the RNRD-based outlier detection (RNROD) algorithm is designed. And experiments show the superiority of RNROD by comparing with seven existing detection algorithms on sixteen heterogeneous datasets.
What problem does this paper attempt to address?