Abstract:Artificial Intelligence (AI) technologies have profoundly transformed the field of remote sensing, revolutionizing data collection, processing, and analysis. Traditionally reliant on manual interpretation and task-specific models, remote sensing has been significantly enhanced by the advent of foundation models--large-scale, pre-trained AI models capable of performing a wide array of tasks with unprecedented accuracy and efficiency. This paper provides a comprehensive survey of foundation models in the remote sensing domain, covering models released between June 2021 and June 2024. We categorize these models based on their applications in computer vision and domain-specific tasks, offering insights into their architectures, pre-training datasets, and methodologies. Through detailed performance comparisons, we highlight emerging trends and the significant advancements achieved by these foundation models. Additionally, we discuss the technical challenges, practical implications, and future research directions, addressing the need for high-quality data, computational resources, and improved model generalization. Our research also finds that pre-training methods, particularly self-supervised learning techniques like contrastive learning and masked autoencoders, significantly enhance the performance and robustness of foundation models in remote sensing tasks such as scene classification, object detection, and other applications. This survey aims to serve as a resource for researchers and practitioners by providing a panorama of advances and promising pathways for continued development and application of foundation models in remote sensing.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to comprehensively investigate and evaluate large pre-trained foundation models used in the field of remote sensing. Specifically, the paper focuses on the following points: 1. **Background and Methods**: - The paper first introduces the background and development of foundation models, explaining how these models significantly enhance the performance and efficiency of remote sensing tasks through large-scale pre-training data and complex architectures. - It discusses in detail the application of foundation models in computer vision tasks and specific domain tasks, including scene classification, semantic segmentation, object detection, change detection, etc. 2. **Model Classification and Analysis**: - The foundation models are classified according to their application domains, covering 51 vision models released from June 2021 to June 2024. - A detailed analysis and comparison of each model's architecture, pre-training datasets, pre-training methods, and performance are provided. 3. **Technical Challenges and Future Directions**: - The paper discusses the challenges faced by foundation models in the field of remote sensing, such as the need for high-quality data, limitations of computational resources, and the improvement of model generalization capabilities. - Future research directions are proposed, emphasizing the importance of self-supervised learning techniques (such as contrastive learning and masked autoencoders) in improving model performance and robustness. 4. **Practical Applications and Impact**: - The paper explores the practical applications of foundation models in areas such as environmental monitoring, agriculture, urban planning, and disaster management, demonstrating how these models significantly improve the performance of various downstream tasks. - It highlights the advantages of foundation models in handling large-scale, multi-modal, and multi-temporal remote sensing data. ### Summary By systematically reviewing and analyzing foundation models in the field of remote sensing, this paper aims to provide researchers and practitioners with a comprehensive resource to help them understand the latest developments and potential application directions of these models. The paper not only summarizes the strengths and weaknesses of existing models but also points out possible paths for future research, providing important references for advancing remote sensing technology.

AI Foundation Models in Remote Sensing: A Survey

Foundation Models for Remote Sensing and Earth Observation: A Survey

Exploring Foundation Models in Remote Sensing Image Change Detection: A Comprehensive Survey

Foundation models in robotics: Applications, challenges, and the future

SatVision-TOA: A Geospatial Foundation Model for Coarse-Resolution All-Sky Remote Sensing Imagery

Foundation Models for Generalist Geospatial Artificial Intelligence

When are Foundation Models Effective? Understanding the Suitability for Pixel-Level Classification Using Multispectral Imagery

A Survey for Foundation Models in Autonomous Driving

Foundation Models for Weather and Climate Data Understanding: A Comprehensive Survey

Generative ConvNet Foundation Model With Sparse Modeling and Low-Frequency Reconstruction for Remote Sensing Image Interpretation

A Survey on Robotics with Foundation Models: toward Embodied AI

Foundation Models in Radiology: What, How, When, Why and Why Not

Resource-efficient Algorithms and Systems of Foundation Models: A Survey

Robot Learning in the Era of Foundation Models: A Survey

Evaluating and Benchmarking Foundation Models for Earth Observation and Geospatial AI

On the Opportunities and Challenges of Foundation Models for Geospatial Artificial Intelligence

A Billion-scale Foundation Model for Remote Sensing Images

On the Opportunities and Challenges of Foundation Models for GeoAI (Vision Paper)

Training and Serving System of Foundation Models: A Comprehensive Survey