Abstract:Random Forest is an ensemble of decision trees based on the bagging and random subspace concepts. As suggested by Breiman, the strength of unstable learners and the diversity among them are the ensemble models' core strength. In this paper, we propose two approaches known as oblique and rotation double random forests. In the first approach, we propose rotation based double random forest. In rotation based double random forests, transformation or rotation of the feature space is generated at each node. At each node different random feature subspace is chosen for evaluation, hence the transformation at each node is different. Different transformations result in better diversity among the base learners and hence, better generalization performance. With the double random forest as base learner, the data at each node is transformed via two different transformations namely, principal component analysis and linear discriminant analysis. In the second approach, we propose oblique double random forest. Decision trees in random forest and double random forest are univariate, and this results in the generation of axis parallel split which fails to capture the geometric structure of the data. Also, the standard random forest may not grow sufficiently large decision trees resulting in suboptimal performance. To capture the geometric properties and to grow the decision trees of sufficient depth, we propose oblique double random forest. The oblique double random forest models are multivariate decision trees. At each non-leaf node, multisurface proximal support vector machine generates the optimal plane for better generalization performance. Also, different regularization techniques (Tikhonov regularisation, axis-parallel split regularisation, Null space regularisation) are employed for tackling the small sample size problems in the decision trees of oblique double random forest. The proposed ensembles of decision trees produce trees with bigger size compared to the standard ensembles of decision trees as bagging is used at eah which results in improved performance. The evaluation of the baseline models and the proposed oblique and rotation double random forest models is performed on benchmark 121 UCI datasets and real-world fisheries datasets. Both statistical analysis and the experimental results demonstrate the efficacy of the proposed oblique and rotation double random forest models compared to the baseline models on the benchmark datasets.

Is rotation forest the best classifier for problems with continuous features?

A New Rotation Forest Ensemble Algorithm

A new ensemble classification approach based on Rotation Forest and LightGBM

Hyperspectral Remote Sensing Image Classification Based on Rotation Forest

Spectral–Spatial Classification for Hyperspectral Data Using Rotation Forests with Local Feature Extraction and Markov Random Fields

Oblique and rotation double random forest

Spectral-spatial Rotation Forest for Hyperspectral Image Classification

RotBoost: A technique for combining Rotation Forest and AdaBoost

Class-Separation-Based Rotation Forest for Hyperspectral Image Classification

Rotation-Based Ensemble Classifiers For High-Dimensional Data

Comparison of random forest, artificial neural networks and support vector machine for intelligent diagnosis of rotating machinery

Hyperspectral Image Classification with Rotation Random Forest Via KPCA

Hyperspectral Image Classification Based on Improved Rotation Forest Algorithm

Deep ensemble forests for industrial fault classification

Ensemble of optimal trees, random forest and random projection ensemble classification

Consistency of random forests

When are Deep Networks really better than Decision Forests at small sample sizes, and how?

LionForests: Local Interpretation of Random Forests

Towards the effectiveness of Deep Convolutional Neural Network based Fast Random Forest Classifier

A Mathematical Programming Approach to Optimal Classification Forests

Asymptotic Properties of High-Dimensional Random Forests