GA-Based Weighted Ensemble Learning for Multi-Label Aerial Image Classification Using Convolutional Neural Networks and Vision Transformers

Ming-Hseng Tseng
DOI: https://doi.org/10.1088/2632-2153/ad10cf
2023-11-29
Machine Learning: Science and Technology
Abstract:Abstract Multi-label classification of aerial images is a crucial task in remote sensing image analysis. Traditional image classification methods have limitations in image feature extraction, leading to an increasing use of deep learning models, such as Convolutional Neural Networks (CNN) and Vision Transformers (ViT). However, the standalone use of these models may have limitations when dealing with multi-label classification. To enhance the generalization performance of multi-label classification of aerial images, this paper combines two CNN and two ViT models, comparing four single deep learning models, a manually weighted ensemble learning method, and a GA-based weighted ensemble method. The experimental results using two public multi-label aerial image datasets show that the classification performance of ViT models is better than CNN models, the traditional weighted ensemble learning model performs better than a single deep learning model, and the GA-based weighted ensemble method performs better than the manually weighted ensemble learning method. The GA-based weighted ensemble method proposed in this study can achieve better multi-label classification performance of aerial images than previous results.
What problem does this paper attempt to address?