A Projection Pursuit Forest Algorithm for Supervised Classification

Natalia da Silva,Dianne Cook,Eun-Kyung Lee
DOI: https://doi.org/10.1080/10618600.2020.1870480
2018-07-26
Abstract:This paper presents a new ensemble learning method for classification problems called projection pursuit random forest (PPF). PPF uses the PPtree algorithm introduced in Lee et al. (2013). In PPF, trees are constructed by splitting on linear combinations of randomly chosen variables. Projection pursuit is used to choose a projection of the variables that best separates the classes. Utilizing linear combinations of variables to separate classes takes the correlation between variables into account which allows PPF to outperform a traditional random forest when separations between groups occurs in combinations of variables. The method presented here can be used in multi-class problems and is implemented into an R (R Core Team, 2018) package, PPforest, which is available on CRAN, with development versions at <a class="link-external link-https" href="https://github.com/natydasilva/PPforest" rel="external noopener nofollow">this https URL</a>.
Machine Learning
What problem does this paper attempt to address?