Lithium exploration targeting through robust variable selection and deep anomaly detection: An integrated application of sparse principal component analysis and stacked autoencoders

Saeid Esmaeiloghli,Alexandre Lima,Behnam Sadeghi
DOI: https://doi.org/10.1016/j.chemer.2024.126111
2024-03-16
Chemie der Erde
Abstract:Lithium is a strategic metal for high-technology industries that plays a vital role in realizing electromobility and effective energy storage for smartphones and electric/hybrid vehicles, in addition to being important in accumulating energy from renewable sources. Motivated by this, the development of state-of-the-art workflows is imperative to make Li exploration more efficient and guarantee a sustainable supply to the demanding markets in the future. In this paper, a computational protocol was described to address two critical challenges raised in the Li exploration: (i) selection of mineralization-related pathfinder variables and (ii) recognition of multi-element geochemical anomalies related to ore-forming processes. Robust sparse principal component analysis (SPCA) was employed to reduce active (non-zero) loading coefficients and straightforward selection of Li pathfinder variables. The selected variables were then fed into the training network of stacked autoencoders (SAE) to learn deep representations of multivariate input signals and quantify reconstruction errors linked to Li-vectoring geochemical anomalies. As a representative example, the proposed workflow, i.e., SPCA + SAE, was implemented in the R programming language and applied to a real-case experiment with stream sediment geochemical data pertaining to the Moalleman district, NE Iran. Moreover, compositional robust principal component analysis (RPCA) and Mahalanobis distance (MD) technique were adopted to constitute two comparative models, RPCA + SAE and SPCA + MD, as benchmarks to judge the competence of the SPCA + SAE model in variable selection and anomaly detection, respectively. A performance appraisal by success-rate curves and relevant area under the curves (AUCs) indicated that the SPCA + SAE model brings, by calculating the highest AUC, spatial patterns that are more promising for vectoring towards Li-mineralized grounds. Moreover, AUCSPCA+MD was measured to be greater than AUCRPCA+SAE , suggesting robust variable selection is a more important paradigm than deep anomaly detection for Li exploration targeting. Student's t –statistic method was eventually applied to the anomaly map from the SPCA + SAE model to define relevant thresholds for narrowing down prospective areas for the next round of Li exploration within the study area.
geosciences, multidisciplinary,geography, physical
What problem does this paper attempt to address?