Improved Algorithm of C4.5 Decision Tree

Chen Ying,Ma Zhong-bing,Huang Min
DOI: https://doi.org/10.3969/j.issn.1003-6970.2013.02.020
2013-01-01
Software
Abstract:The traditional C4.5 algorithm has weak points of too much calculating quantity and failing to creat better decision tree under the circumstance of definite priori knowledge and thereby the time and structure of decision tree’s build are influenced. Computational efficiency can be improved by reducing information gain ratio according to mathematical equivalent infinitesimal property and by making up for errors from simplification by overall optimization strategy. In the optimization, balance degree constant is introduced in consideration of priori knowledge’s special features and consequently optimal branch strategy of C4.5 algorithm can be realized and the decision tree that conforms better to facts can also be founded. It is proved by experiment that compared with traditional C4.5 algorithm, the optimized project is provided with roughly the same classification accuracy and faster build speed.
What problem does this paper attempt to address?