A Fast Multi-Level Co-Location Pattern Mining Approach Based on Extended Maximal Cliques
Jinjia Dai,Lijin Tang,Lizhen Wang,Lihua Zhou,Hongmei Chen
DOI: https://doi.org/10.1109/ijcnn60899.2024.10651236
2024-01-01
Abstract:Spatial multi-level co-location pattern mining (MLCPM) is the improvement and further development of traditional spatial co-location pattern mining, which can simultaneously mine both global and local prevalent co-location patterns. Existing MLCPM algorithms are quite time-consuming and are very sensitive to prevalence thresholds. Especially when the prevalence threshold is changed, the entire mining process needs to be redone. Considering the above problems, this paper proposes a novel MLCPM algorithm based on extended maximal cliques (MLCPM-EMC). Firstly, a new materialization model of neighbor relationships between instances, the extended maximal clique (EMC), is proposed to reduce the number of the maximal cliques. Secondly, a novel hash structure is designed to store the EMCs. With the hash structure, the algorithm can recognize global prevalent colocations quickly. Moreover, a DBSCAN clustering method based on participation instances of local patterns is used to efficiently identify their prevalent regions. Lastly, extensive experiments on both synthetic and real-world datasets show that the EMC materialization model reduces the number of maximal cliques by about 80%. And the MLCPM-EMC algorithm has better performance and scalability than two state-of-the-art baseline algorithms, in particular, the efficiency of re-mining increases several times when changing the prevalence threshold.