Abstract:Pixels covered by clouds in optical Earth Observation images are not usable for most applications. For this reason, only images delivered with reliable cloud masks are eligible for an automated or massive analysis. Current state of the art cloud detection algorithms, both physical models and machine learning models, are specific to a mission or a mission type, with limited transferability. A new model has to be developed every time a new mission is launched. Machine Learning may overcome this problem and, in turn obtain state of the art, or even better performances by training a same algorithm on datasets from different missions. However, simulating products for upcoming missions is not always possible and available actual products are not enough to create a training dataset until well after the launch. Furthermore, labelling data is time consuming. Therefore, even by the time when enough data is available, manually labelled data might not be available at all.&#160;To solve this bottleneck, we propose a transfer learning based method using the available products of the current generation of satellites. These existing products are gathered in a database that is used to train a deep convolutional neural network (CNN) solely on those products. The trained model is applied to images from other - unseen - sensors and the outputs are evaluated. We avoid labelling manually by automatically producing the ground data with existing algorithms. Only a few semi-manually labelled images are used for qualifying the model. Even those semi-manually labelled samples need very few user inputs. This drastic reduction of user input limits subjectivity and reduce the costs.&#160;We provide an example of such a process by training a model to detect clouds in Sentinel-2 images, using as ground-truth the masks of existing state-of-the-art processors. Then, we apply the trained network to detect clouds in previously unseen imagery of other sensors such as the SPOT family or the High-Resolution (HR) Pleiades imaging system, which provide a different feature space.The results demonstrate that the trained model is robust to variations within the individual bands resulting from different acquisition methods and spectral responses. Furthermore, the addition of geo-located auxiliary data that is independent from the platform, such as digital elevation models (DEMs), as well as simple synthetic bands such as the NDVI or NDSI, further improves the results.In the future, this approach opens up the possibility to be used on new CNES&#8217; missions, such as Microcarb or CO3D.

Fewshot learning on global multimodal embeddings for earth observation tasks

MMEarth: Exploring Multi-Modal Pretext Tasks For Geospatial Representation Learning

OmniSat: Self-Supervised Modality Fusion for Earth Observation

Learning transferable cross-modality representations for few-shot hyperspectral and LiDAR collaborative classification

SatCLIP: Global, General-Purpose Location Embeddings with Satellite Imagery

SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting

Multi-Label Guided Soft Contrastive Learning for Efficient Earth Observation Pretraining

Boosting crop classification by hierarchically fusing satellite, rotational, and contextual data

Deep Occlusion Framework for Multimodal Earth Observation Data

Learning Shared Cross-modality Representation Using Multispectral-LiDAR and Hyperspectral Data

Multimodal Crop Type Classification Fusing Multi-Spectral Satellite Time Series with Farmers Crop Rotations and Local Crop Distribution

Enhancing Self-Supervised Learning for Remote Sensing with Elevation Data: A Case Study with Scarce And High Level Semantic Labels

Multilabel-Guided Soft Contrastive Learning for Efficient Earth Observation Pretraining

SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images

Improving future optical Earth Observation products using transfer learning

A Generalizable and Accessible Approach to Machine Learning with Global Satellite Imagery

Assisted learning for land use classification: The important role of semantic correlation between heterogeneous images

More Diverse Means Better: Multimodal Deep Learning Meets Remote Sensing Imagery Classification

SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation

Exploring Robust Features for Few-Shot Object Detection in Satellite Imagery