CloudSEN12+: The largest dataset of expert-labeled pixels for cloud and cloud shadow detection in Sentinel-2

Cesar Aybar,Lesly Bautista,David Montero,Julio Contreras,Daryl Ayala,Fernando Prudencio,Jhomira Loja,Luis Ysuhuaylas,Fernando Herrera,Karen Gonzales,Jeanett Valladares,Lucy A Flores,Evelin Mamani,Maria Quiñonez,Rai Fajardo,Wendy Espinoza,Antonio Limas,Roy Yali,Alejandro Alcántara,Martin Leyva,Raúl Loayza-Muro,Bram Willems,Gonzalo Mateo-García,Luis Gómez-Chova
DOI: https://doi.org/10.1016/j.dib.2024.110852
2024-08-19
Abstract:Detecting and screening clouds is the first step in most optical remote sensing analyses. Cloud formation is diverse, presenting many shapes, thicknesses, and altitudes. This variety poses a significant challenge to the development of effective cloud detection algorithms, as most datasets lack an unbiased representation. To address this issue, we have built CloudSEN12+, a significant expansion of the CloudSEN12 dataset. This new dataset doubles the expert-labeled annotations, making it the largest cloud and cloud shadow detection dataset for Sentinel-2 imagery up to date. We have carefully reviewed and refined our previous annotations to ensure maximum trustworthiness. We expect CloudSEN12+ will be a valuable resource for the cloud detection research community.
What problem does this paper attempt to address?