An efficient transformer algorithm for image recognition based on ensemble learning methodology

Yihong Tang
DOI: https://doi.org/10.1117/12.2604565
2021-10-05
Abstract:Cassava is an important food security crop in Africa because it can withstand harsh environments. However, viral diseases are the main reason for crop failure. Based on photos of cassava provided by local farmers in Africa, it is possible to use deep learning technology to identify some common viral diseases so that they can be treated. This paper introduced an image classification algorithm based on an ensemble learning[4] model, which combined Vision Transformer[2] and EfficientNet [1]. The practice has proved that the model proposed in this paper has a certain improvement in performance compared with traditional image classification methods and can effectively help local farmers.
What problem does this paper attempt to address?