Discretizing Continuous Variables of Bayesian Networks Based on Genetic Algorithms

王飞,刘大有,薛万欣
DOI: https://doi.org/10.3321/j.issn:0254-4164.2002.08.002
2002-01-01
Chinese Journal of Computers
Abstract:Based on Genetic algorithm, this paper presents a discretization algorithm called DGA. Compared with univariate discretization that deals with the single variables individually, the result from DGA is more exact because it is a multivariate discretization method, whereby each variable is discretized taking into account its interaction with the other variables. Besides, it searches for the best discretization strategy by genetic operators, avoiding finding local maxima and specifying an ordering between the variables in advance which are inevitable for deterministic search adopted by previous discretization algorithms. In this paper, (1) fitness function is given that pays attention to not only accuracy and concision of discretization model, but also accuracy and concision of learned structure; (2) encoding is described by giving definition of discretization police equivalence based on essential of discretization strategy; (3) genetic operators are designed that switch individuals to evolve good discretization policy. Experimental results show that this algorithm can effectively discretize continuous variables so that the learned structure takes on good performance.
What problem does this paper attempt to address?