Extreme Learning Tree

Anton Akusok,Emil Eirola,Kaj-Mikael Björk,Amaury Lendasse
DOI: https://doi.org/10.48550/arXiv.1912.09087
2019-12-19
Abstract:The paper proposes a new variant of a decision tree, called an Extreme Learning Tree. It consists of an extremely random tree with non-linear data transformation, and a linear observer that provides predictions based on the leaf index where the data samples fall. The proposed method outperforms linear models on a benchmark dataset, and may be a building block for a future variant of Random Forest.
Machine Learning
What problem does this paper attempt to address?
The problem this paper attempts to address is improving the performance of decision trees in machine learning tasks, particularly for big data processing. Specifically, the authors propose a new variant of decision trees—Extreme Learning Tree (ELT), which aims to combine the advantages of random methods and decision trees to achieve better predictive performance. ### Main Issues: 1. **Improving the performance of decision trees**: Traditional decision trees have limitations in handling complex nonlinear data, especially in big data scenarios. ELT enhances the expressive and generalization capabilities of decision trees by introducing randomness and nonlinear transformations. 2. **Reducing computational cost**: Random methods can reduce computational costs without sacrificing performance. ELT constructs the tree structure using random features and random split points, thereby reducing the computational burden during the optimization process. 3. **Improving the basic unit of random forests**: ELT can serve as a basic unit of random forests, potentially being used to build more powerful ensemble learning models in the future. ### Solution: - **Extreme Learning Tree (ELT)**: ELT consists of two parts: - **Random Nonlinear Tree**: Constructs an extremely random tree using extended data features (from Extreme Learning Machine, ELM), with nodes randomly split on random features. - **Linear Observer**: Adds a linear model at the leaf nodes of the tree to learn the mapping from leaf nodes to the target output. ### Experimental Results: - Experiments on the Iris dataset show that ELT outperforms linear models (such as Ridge Regression) in accuracy but is slightly lower than traditional decision trees and ELM. - Although ELT does not perform as well as traditional decision trees in single model performance, it significantly surpasses linear models, indicating that ELT has certain advantages in representing data samples. ### Conclusion: - ELT provides a new variant of decision trees that can improve model predictive performance while maintaining low computational costs. - Future research will explore the application of ELT in ensemble methods similar to random forests to further enhance its performance.