A novel artificial bee colony clustering algorithm with comprehensive improvement
Qiumei Pu,Chiquan Xu,Hui Wang,Lina Zhao
DOI: https://doi.org/10.1007/s00371-021-02367-0
IF: 2.835
2022-01-01
The Visual Computer
Abstract:This paper provides a novel clustering algorithm named CEABC (Comprehensively Enhanced Artificial Bee Colony Algorithm), enhanced by multiple operators including K-means, PSO and directed by Gbest mechanism is proposed, which can be used to solve clustering problem. In the initial stage, the algorithm uses K-means operator to generate the initial nectar source, which not only improves the quality of the initial nectar source, but also avoids the problem of low operating efficiency. In order to enhance the interaction between individuals, the concept of global optimum solution which can also be called Gbest is introduced, where the original one-dimensional information exchange is replaced by the full-dimensional information exchange among nectar sources, so that the information exchange volume of the whole swarm can be improved. In the phase of bee scouting, the above-mentioned global optimal solution is combined with PSO algorithm to generate a brand-new nectar source search method, which will enhance the ability of ABC algorithm to develop nectar sources. Meanwhile, the global optimal solution will continue to guide the bee scouting to generate new nectar sources so as to improve the overall quality of the nectar sources. Then, the experiment carried out on two sets of artificial data sets and four sets of UCI machine learning data sets which are the most representative verifies the clustering performance of the newly proposed CEABC algorithm. In addition, the experimental results are compared with standard ABC algorithm, several other newly proposed ABC-improved algorithms and classical clustering algorithms. Results clearly show that the novel ABC algorithm described in this paper has better accuracy and stability in solving clustering problems, which is a more effective clustering algorithm. We believe that clustering method for high-dimensional data can provide an effective method for data processing in materials, medical treatment, automatic driving and other application fields.