A Sentinel-2 multi-year, multi-country benchmark dataset for crop classification and segmentation with deep learning

Dimitrios Sykas,Maria Sdraka,Dimitrios Zografakis,Ioannis Papoutsis
DOI: https://doi.org/10.48550/arXiv.2204.00951
2022-04-28
Abstract:In this work we introduce Sen4AgriNet, a Sentinel-2 based time series multi country benchmark dataset, tailored for agricultural monitoring applications with Machine and Deep Learning. Sen4AgriNet dataset is annotated from farmer declarations collected via the Land Parcel Identification System (LPIS) for harmonizing country wide labels. These declarations have only recently been made available as open data, allowing for the first time the labeling of satellite imagery from ground truth data. We proceed to propose and standardise a new crop type taxonomy across Europe that address Common Agriculture Policy (CAP) needs, based on the Food and Agriculture Organization (FAO) Indicative Crop Classification scheme. Sen4AgriNet is the only multi-country, multi-year dataset that includes all spectral information. It is constructed to cover the period 2016-2020 for Catalonia and France, while it can be extended to include additional countries. Currently, it contains 42.5 million parcels, which makes it significantly larger than other available archives. We extract two sub-datasets to highlight its value for diverse Deep Learning applications; the Object Aggregated Dataset (OAD) and the Patches Assembled Dataset (PAD). OAD capitalizes zonal statistics of each parcel, thus creating a powerful label-to-features instance for classification algorithms. On the other hand, PAD structure generalizes the classification problem to parcel extraction and semantic segmentation and labeling. The PAD and OAD are examined under three different scenarios to showcase and model the effects of spatial and temporal variability across different years and different countries.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the lack of standardized, cross - time - and - space consistently - labeled multi - country, multi - year satellite datasets in agricultural monitoring. Specifically, the author points out that within the European Union (EU), due to the non - standardization of national label systems and the limitations of publicly accessing farmers' declared data, it is difficult to obtain large - scale, high - quality satellite image datasets of agricultural land that can be used for training and evaluating deep - learning models. These problems impede the generalization ability of deep - learning models between different countries and regions. To solve the above problems, the author has developed and provided the Sen4AgriNet dataset, which is a multi - country, multi - year benchmark dataset based on Sentinel - 2 satellite time - series images, aiming to support research in machine - learning / deep - learning applications such as crop classification, parcel extraction, and semantic segmentation. The main features of Sen4AgriNet include: 1. **Multi - temporal**: Capturing changes in crop phenology. 2. **Multi - annual**: Modeling seasonal changes. 3. **Multi - national**: Modeling geospatial variability. 4. **Object aggregation**: Processing in combination with ground - truth data (parcel geometric information). 5. **Modular**: It can be extended to more countries or incorporate other sensors and non - Earth - observation data (such as meteorological data). In addition, Sen4AgriNet also introduces a unified crop classification method, which is based on the indicative crop classification scheme of the Food and Agriculture Organization of the United Nations (FAO) and customized according to the requirements of the Common Agricultural Policy (CAP) of the EU. This helps to solve the problem of inconsistent crop labels among different countries, thereby improving the generalization ability of the model. By creating such a large - scale, high - quality, and spatio - temporally consistent dataset, Sen4AgriNet provides researchers with a powerful tool to explore and develop more effective agricultural remote - sensing technologies, and support the development in fields such as smart agriculture, CAP implementation, and agricultural insurance.