M-DBSCAN: Modified DBSCAN Clustering Algorithm for Detecting and Controlling Outliers

Md Arabin Islam Talukder,Mehedi Hasan Shuvo,Momotaz Begum,Mst. Shapna Akter,Hossain Shahriar,A. K. I. Riad,M. Mostofa
DOI: https://doi.org/10.1145/3605098.3636188
2024-04-08
Abstract:Outlier reduction is crucial in computer science for improving data quality, analysis accuracy, and modeling robustness. Selection and modification of DBSCAN parameters are essential for optimal clustering accuracy and outlier detection. We developed an adaptive technique to minimize outliers in the DBSCAN algorithm using a linear congruential method (LCM) to determine values of Epsilon (Eps) and Min-Points (MinPts), known as modified DBSCAN (M-DBSCAN). To enhance the DBSCAN method, we create integer random numbers for MinPts (1--100) and floating numbers for Eps (0.1--1.5) using LCM. We adjusted parameter lists to reduce outliers based on MinPts and Eps values. We choose parameters based on dataset features and requirements, balancing clustering sensitivity and noise treatment. For experiment result analysis we use the Silhouette Score (SS) method. M-DBSCAN improved all cases and it has 50% poorer outlier accuracy than DBSCAN.
Computer Science
What problem does this paper attempt to address?