Abstract:Background InformationAnnotated overhead imagery dataset for the paper "SolarMapper: estimating solar array location, size, and capacity using deep learning and overhead imagery".This dataset is a subset of the very high resolution aerial imagery provided through the Connecticut Department of Energy and Environmental Protection shared via the University of Connecticut (CT ECO 2016 Imagery & Elevation (uconn.edu)). The original resolution of the imagery is 3 inch (7.62 cm) and we downsampled all of the imagery to a resolution of 30 cm, which is the same resolution as most high resolution satellite imagery.87 image tiles in total are included in this dataset. We manually annotated all visible solar PV panels with polygons for semantic segmentation purposes. We split the dataset into a training and a validation set at a 2:1 ratio. Details of each subset are shown below:Training set: 57 image tiles, 33.12 km2 of ground area, 608 solar PV arrays, 0.058 km2 of solar PV panel area.Validation set: 30 image tiles, 17.43 km2 of ground area, 1,003 solar PV arrays, 0.091 km2 of solar PV panel area.Additionally, we compared our sampled imagery with Microsoft's US Building Footprints dataset (microsoft/USBuildingFootprints: Computer generated building footprints for the United States (github.com)) to split all image tiles into 3 brackets (high, medium, and low building density) by the number of buildings with in an image tile. More details about how we assigned these image tiles could be found in the paper. Such assignment was only applied to the 30 validation tiles.File Descriptionct_images_and_labels.zipThis .zip file contains all aerial image (.jpg) files and corresponding semantic segmentation annotation mask (.png) files. All .jpg imagery files are 8-bit RGB images and all .png files are binary arrays where 1 is for solar PV pixels and 0 is for non-solar PV pixels. An aerial imagery file and its corresponding mask file share the same filename but have different filename extensions. E.g., 000795_sw.jpg is the aerial imagery file for tile 000795_sw, and 000795_sw.png is the corresponding mask file.tile_assignments.csvThis .csv file records if an image tile is in the training or validation set, and if it's a high, medium or low building density tile. There are 3 columns in the .csv file:- tile_name (String): The tile name ID of a tile. This corresponds to filanmes of .jpg aerial imagery files and .png annotation mask files.- training_split (Categorical): Whether the image tile is in the training or the validation set. Values are either 'train' for the training set or "valid" for the validation set.- density_split (Categorical): Which building density bracket does the image tile belong to. Since the building density based assignment was only applied to the 30 validation tiles, all training tiles have "NA" in this column. For the validation tiles, values are either "high", "med", or "low" corresponding to the 3 building density brackets (high, medium and low).

Connecticut Solar PV Semantic Segmentation Dataset

Multi-resolution dataset for photovoltaic panel segmentation from satellite and aerial imagery

A crowdsourced dataset of aerial images with annotated solar photovoltaic arrays and installation metadata

SolarDK: A high-resolution urban solar panel image classification and localization dataset

HyperionSolarNet: Solar Panel Detection from Aerial Images

What You Get is Not Always What You See: Pitfalls in Solar Array Assessment Using Overhead Imagery

Deeplab V3+ Based Segmentation Method for PV Panels with Aerial Orthoimages

Application of a semantic segmentation convolutional neural network for accurate automatic detection and mapping of solar photovoltaic arrays in aerial imagery

CUS3D: A New Comprehensive Urban-Scale Semantic-Segmentation 3D Benchmark Dataset

SolarFormer: Multi-scale Transformer for Solar PV Profiling

Generalized deep learning model for photovoltaic module segmentation from satellite and aerial imagery

Mask R-CNN Based Segmentation Method for Satellite Imagery of Photovoltaics Generation Systems

Rooftop PV Segmenter: A Size-Aware Network for Segmenting Rooftop Photovoltaic Systems from High-Resolution Imagery

GloSoFarID: Global multispectral dataset for Solar Farm IDentification in satellite imagery

UAVid: A semantic segmentation dataset for UAV imagery

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds

Extracting 10-M Resolution Photovoltaic Landcover Using A Slightly Modified U-Net and Sentinel-2 Images

SpaceNet 6: Multi-Sensor All Weather Mapping Dataset

Annotated Dataset for Training Cloud Segmentation Neural Networks Using High-Resolution Satellite Remote Sensing Imagery

A Novel Framework for Solar Panel Segmentation From Remote Sensing Images: Utilizing Chebyshev Transformer and Hyperspectral Decomposition

Satellite Sunroof: High-res Digital Surface Models and Roof Segmentation for Global Solar Mapping