Mineral Prospectivity Mapping Using Semi-supervised Machine Learning

Quanke Li,Guoxiong Chen,Detao Wang
DOI: https://doi.org/10.1007/s11004-024-10161-6
2024-10-26
Mathematical Geosciences
Abstract:Recent years have seen great research enthusiasm for the application of artificial intelligence techniques to support mineral exploration. Various supervised machine learning (ML) algorithms have been introduced to the field of mineral prospectivity mapping (MPM), as ML models can automatically construct the complex relationships between known mineral deposits and exploration data. However, the limited number of ore deposits in mineral exploration practice poses a great challenge when using supervised ML models. In this study, we focused on applying semi-supervised ML models, including semi-supervised random forest (SSRF) and support vector machine (S3VM), to address the scarcity problems of ore deposits in ML-based MPM, in order to develop a novel scheme based on generative adversarial networks (SemiGAN) for more accurate MPM. Notably, in order to exclude high-risk unlabeled samples in semi-supervised models, a new scheme was proposed to select representative unlabeled samples by calculating similarity based on principal component analysis and cosine distance. A case study of W–Sn mineral prospectivity modeling in the Nanling metallogenic belt in South China was used to validate the proposed semi-supervised machine learning (SSML) methods. The results show that SemiGAN achieves the highest prediction accuracy (88.3%) and area under the receiver operating characteristic curve (AUC = 0.954). Moreover, the MPMs obtained by SSML predictive models including SemiGAN, SSRF, and S3VM contain more known deposits (105, 105, and 82, respectively) than supervised machine learning predictive models (e.g., RF and SVM, with 66 and 52, respectively) in the top 5% high-potential area. The SSML methods outperform the supervised learning methods in terms of the proportion of mineral deposits and predictive rate curves in the high-potential area. These improvements demonstrate that SSML methods such as SemiGAN and SSRF can effectively improve the predictive performance and generalization of data-driven MPM.
geosciences, multidisciplinary,mathematics, interdisciplinary applications
What problem does this paper attempt to address?