A segregated genetic programming for bioprocess modelling with outliers
Yanling Wu,Jiangang Lu,Youxian Sun
DOI: https://doi.org/10.1002/apj.207
2008-01-01
Asia-Pacific Journal of Chemical Engineering
Abstract:Genetic programming (GP) is often used to model a complex nonlinear system. Nevertheless, if the training data obtained from an industrial process are corrupted by large noise or outliers, the simple GP and GP based on least squares estimator usually cannot come up with an acceptable solution. To overcome this problem, a novel robust GP based on M-estimator is proposed. Moreover, these cut-off parameters of the estimator play a crucial role in degrading the effects of outliers. Usually an optimal value of the cut-off parameter exists but without a priori knowledge of the training data, it is difficult to define it. So a segregated GP using two different cut-off parameters is proposed to solve this problem. The novel feature of this approach is that the algorithm can perform multi-directional search on the whole problem space for different cut-off parameters, so that it can get mixed information from different directional searches and has more chance to find an acceptable solution. In addition, the proposed approach is less sensitive to the values of the cut-off parameters and performs almost as good as a GP with an ideal cut-off parameter. (c) 2008 Curtin University of Technology and John Wiley & Sons, Ltd.