Hardware Implementation of Random Forest Algorithm Based on Classification and Regression Tree

Ziheng Teng,Lijian Chu,Kai Chen,Guoqiang He,Yuxiang Fu,Li
DOI: https://doi.org/10.1109/iciba50161.2020.9276928
2020-01-01
Abstract:Decision tree is a popular method for various machine learning tasks. Tree learning structure can best meet the requirements of data mining. The current applications of decision tree usually aims for a specific case, In addition, the number of features extracted from these decision trees is relatively low, usually less than 10. Although this reduces the use of resources and time, it is also easy to reduce the accuracy of the algorithm. This paper mainly studies the method of building decision tree based on CART algorithm and the hardware design and implementation of the random forest algorithm. In this paper, a random forest algorithm is introduced to solve the problem of over fitting, the SystemC language is used to model the random forest algorithm, including the training part and the inference part of the algorithm, when the input data exceeds the bank depth, it can be stored 2-4 times by cutting the bank address bit in half, and constructed a system model of the data storage method, the design completes the hardware implementation of the random forest algorithm, the training part includes the data storage module, the top-level control module, and the algorithm execution module, the inference part includes the data storage module and the top-level control module. Finally, the functional correctness of the hardware implementation is verified on FPGA. The decision tree data storage scheme proposed in this paper has high practical value, and improves storage efficiency through bank folding.
What problem does this paper attempt to address?