Kernel-based retrieval models for hyperspectral image data optimized with Kernel Flows

Zina-Sabrina Duma,Tuomas Sihvonen,Jouni Susiluoto,Otto Lamminpää,Heikki Haario,Satu-Pia Reinikainen
2024-11-12
Abstract:Kernel-based statistical methods are efficient, but their performance depends heavily on the selection of kernel parameters. In literature, the optimization studies on kernel-based chemometric methods is limited and often reduced to grid searching. Previously, the authors introduced Kernel Flows (KF) to learn kernel parameters for Kernel Partial Least-Squares (K-PLS) regression. KF is easy to implement and helps minimize overfitting. In cases of high collinearity between spectra and biogeophysical quantities in spectroscopy, simpler methods like Principal Component Regression (PCR) may be more suitable. In this study, we propose a new KF-type approach to optimize Kernel Principal Component Regression (K-PCR) and test it alongside KF-PLS. Both methods are benchmarked against non-linear regression techniques using two hyperspectral remote sensing datasets.
Machine Learning,Computational Engineering, Finance, and Science,Methodology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to optimize the parameter selection of kernel - based methods in hyperspectral image data. Specifically, the paper proposes a new Kernel Flows (KF) - based method to optimize the kernel parameters in Kernel Principal Component Regression (K - PCR) and compares it with the optimized Kernel Partial Least Squares (K - PLS). The paper benchmarks these two methods using two hyperspectral remote sensing datasets to evaluate their performance in nonlinear regression techniques. ### Background Hyperspectral imaging technology shows great potential in environmental monitoring, such as in water quality monitoring, agriculture, vegetation health, and climate observation. However, although traditional deep - learning methods have been successful in creating retrieval models from hyperspectral data, they also have problems such as over - fitting, poor interpretability, and the need for large amounts of data. In contrast, traditional chemometric models (such as Partial Least Squares regression PLS) require a smaller amount of training data, but have a weaker ability to model nonlinear relationships. Therefore, kernelized chemometric methods can better handle nonlinear dependencies, but the learning of kernel functions and their parameters is a complex problem. ### Research Objectives 1. **Optimize Kernel Parameters**: Use the Kernel Flows (KF) method to optimize the kernel parameters in Kernel Principal Component Regression (K - PCR). 2. **Performance Evaluation**: Through two hyperspectral remote - sensing datasets, compare the performance of the optimized K - PCR and K - PLS with nonlinear regression techniques. 3. **Improve Loss Function**: Propose a new loss function to improve the performance of KF when optimizing K - PCR and K - PLS. ### Methods - **Kernel Flows (KF)**: KF is a method that minimizes the loss function through cross - validation and Stochastic Gradient Descent (SGD) and can effectively learn kernel parameters. - **K - PCR and K - PLS**: The paper extends previous work on K - PLS, applies it to K - PCR, and proposes a new loss function to adapt to the characteristics of K - PCR. - **Datasets**: Two hyperspectral remote - sensing datasets, one is a soil moisture soft sensor, and the other is a vegetation feature model. ### Results - **Soil Moisture Prediction**: The optimized K - PCR model shows higher accuracy in soil moisture prediction, especially on the test set. - **Plant Nitrogen Content Prediction**: In the vegetation feature model, the optimized K - PLS performs best, followed by Gaussian Process Regression (GPR). K - PLS and K - PCR are superior to linear methods and, due to their fewer parameters and cross - validation nature, reduce the risk of over - fitting. ### Discussion - **Performance Improvement**: The new loss function brings a 33% performance improvement in K - PCR and a 2% performance improvement in K - PLS. - **Applicability**: This method is applicable not only to soil moisture prediction but also to the prediction of other biogeophysical quantities, such as plant nitrogen content. - **Future Work**: Future research can explore more effective sample selection strategies to optimize the construction of the kernel matrix and training time for large - scale datasets. ### Conclusion The paper proposes an optimization method based on Kernel Flows to optimize the kernel parameters in K - PCR. The experimental results show that this method performs well in both soil moisture prediction and plant nitrogen content prediction and has broad application prospects.