Model Selection for Partial Least Squares Based Dimension Reduction

Guo-Zheng Li,Rui-Wei Zhao,Hai-Ni Qu,Mingyu You
DOI: https://doi.org/10.1016/j.patrec.2011.11.009
IF: 4.757
2012-01-01
Pattern Recognition Letters
Abstract:Partial least squares (PLS) has been widely applied to process scientific data sets as an effective dimension reduction technique. The main way to determine the number of dimensions extracted by PLS is by using the cross validation method, but its computation load is heavy. Researchers presented fixing the number at three, but intuitively it's not suitable for all data sets. Based on the intrinsic connection between PLS and the structure of data sets, two novel algorithms are proposed to determine the number of extracted principal components, keeping the valuable information while excluding the trivial. With the merits of variety with different data sets and easy implementation, both algorithms exhibit better performance than the previous works on nine real world data sets.
What problem does this paper attempt to address?