Ultrahigh Dimensional Variable Selection for Mapping Soil Carbon

Benjamin R. Fitzpatrick,David W. Lamb,Kerrie Mengersen
DOI: https://doi.org/10.48550/arXiv.1608.04253
2016-08-15
Applications
Abstract:Modern soil mapping is characterised by the need to interpolate samples of geostatistical response observations and the availability of relatively large numbers of environmental characteristics for consideration as covariates to aid this interpolation. We demonstrate the efficiency of the Least Angle Regression algorithm for Least Absolute Shrinkage and Selection Operator (LASSO) penalized multiple linear regression at selecting covariates to aid the spatial interpolation of geostatistical soil carbon observations under an ultrahigh dimensional scenario. Where an exhaustive search of the models that could be constructed from 800 potential covariate terms and 60 observations would be prohibitively demanding, LASSO variable selection is accomplished with trivial computational investment.
What problem does this paper attempt to address?