Intelligent mapping of geochemical anomalies: Adaptation of DBSCAN and mean-shift clustering approaches

Mahsa Hajihosseinlou,Abbas Maghsoudi,Reza Ghezelbash
DOI: https://doi.org/10.1016/j.gexplo.2024.107393
IF: 4.166
2024-01-08
Journal of Geochemical Exploration
Abstract:Cluster analysis can be used to organize samples and generate ideas regarding the multivariate geochemistry of given dataset. Traditional clustering techniques have the drawbacks of high computational complexity and poor adaptability to big data. Hence, there has recently been much focus on creating better clustering algorithms. Although many clustering algorithms have been applied, and some produce notable clustering results, the performance efficiency of algorithms is often highly dependent on the values the user chooses for the parameters. Currently, density-based spatial clustering of applications with noise clustering (DBSCAN) is widely utilized in image processing, bioinformatics, and social network analysis owing to its ability to detect clusters of various shapes. Even though partitional clustering techniques may be effective when the number of clusters K is known in advance, they cannot implement non-convex clustering and rapidly return to a local optimum. This study proposes the concept of DBSCAN clustering for stream sediment geochemical data. In this respect, the geochemical data collected from Varcheh district, SW Iran, were processed using the clr transformation before applying DBSCAN. Then, PCA was used to minimize the dimension of variables and specify the mineralization-related elements. In the following, one of the PCs connected with mineralization (PC2) was chosen for further analysis. DBSCAB, Mean-shift and Fuzzy K-means algorithms were used to monitor the multi-element geochemical anomalies linked to MVT PbZn deposits in the study area. According to Davies-Bouldin and Silhouette as two validation metrics, it can be deduced that the three SCB models are advantageous, however, the model generated by DBSCAN is preferable to the model generated by Mean-shift and Fuzzy K-means.
geochemistry & geophysics
What problem does this paper attempt to address?