An Ensemble Learning Integration of Multiple CNN with Improved Vision Transformer Models for Pest Classification

Wanshang Xia,Dezhi Han,Dun Li,Zhongdai Wu,Bing Han,Junxiang Wang
DOI: https://doi.org/10.1111/aab.12804
2023-01-01
Annals of Applied Biology
Abstract:Pests are the main threats to crop growth, and the precision classification of pests is conducive to formulating effective prevention and governance strategies. In response to the problems of low efficiency and inadaptability to the large-scale environment of existing pest classification methods, this paper proposes a new pest classification method based on a convolutional neural network (CNN) and an improved Vision Transformer model. First, the MMAlNet is designed to extract the characteristics of the identification object from different scales and finer granularity. Then, a classification model called DenseNet Vision Transformer (DNVT) combining a CNN and an improved vision transformer model is proposed. The proposed DNVT captures both long distance dependencies and local characteristic modelling capabilities, which can effectively improve pest classification accuracy. Finally, the ensemble learning algorithm is used to learn MMAlNet and DNVT classification forecasts for soft voting, further enhancing the classification accuracy of pests. The simulation experiment results on the D0 and IP102 datasets show that the proposed method attained a maximum classification of 99.89 and 74.20%, respectively, which is better than other state-of-the-art methods and has a high practical application value.
What problem does this paper attempt to address?