Optimizing the number of branches in a decision forest using association rule metrics

Youness Manzali,Mohamed Elfar
DOI: https://doi.org/10.1007/s10115-024-02069-8
IF: 2.7
2024-02-27
Knowledge and Information Systems
Abstract:Ensemble methods, such as random forest algorithms, typically outperform single classifiers. However, they often demand substantial storage memory and involve relatively time-consuming predictions. Numerous approaches have been introduced to reduce ensemble size while maintaining classification performance. This paper introduces a novel strategy for forest pruning based on association rule metrics called PRM. The PRM method extracts branches from the initial forest, calculates the score for each branch, and removes poor-performing branches. Subsequently, it utilizes the selected branches to predict unseen data by aggregating their predictions. This approach can be applied to various types of tree ensembles. We evaluated the proposed PRM method using twenty UCI machine learning and Kaggle repositories datasets. Compared to four ensemble pruning techniques, our findings demonstrate that PRM can significantly reduce the forest size while enhancing ensemble performance, surpassing state-of-the-art methods.
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?