Leveraging Large Vision Models for Terrain Classification in HiRISE Mars Orbital Images

Mrs. J. Mary,Chrisil David,Hanna Priyadharshini,M. Aswin
DOI: https://doi.org/10.1109/ICCCNT61001.2024.10725508
2024-06-24
Abstract:In this paper, we present a comprehensive benchmarking and comparative analysis of state-of-the-art Vision Transformer (ViT) architectures, including ViT, BEiT, DeiT, LeViT, and SwinV2, for the classification of HiRISE landmark images. Our study aims to evaluate the performance of these pre-trained models in categorizing images into seven predefined classes, with an additional “unknown” class to account for images that do not fit into these categories. By leveraging the advanced capabilities of transformer-based models, we seek to enhance the accuracy and efficiency of classifying high-resolution satellite imagery, a critical task in remote sensing and planetary exploration.
Environmental Science,Engineering,Computer Science
What problem does this paper attempt to address?