Connecticut Solar PV Semantic Segmentation Dataset

Wei Hu,Kyle Bradbury
DOI: https://doi.org/10.6084/m9.figshare.18982199.v3
2022-01-01
Abstract:Background InformationAnnotated overhead imagery dataset for the paper "SolarMapper: estimating solar array location, size, and capacity using deep learning and overhead imagery".This dataset is a subset of the very high resolution aerial imagery provided through the Connecticut Department of Energy and Environmental Protection shared via the University of Connecticut (CT ECO 2016 Imagery & Elevation (uconn.edu)). The original resolution of the imagery is 3 inch (7.62 cm) and we downsampled all of the imagery to a resolution of 30 cm, which is the same resolution as most high resolution satellite imagery.87 image tiles in total are included in this dataset. We manually annotated all visible solar PV panels with polygons for semantic segmentation purposes. We split the dataset into a training and a validation set at a 2:1 ratio. Details of each subset are shown below:Training set: 57 image tiles, 33.12 km2 of ground area, 608 solar PV arrays, 0.058 km2 of solar PV panel area.Validation set: 30 image tiles, 17.43 km2 of ground area, 1,003 solar PV arrays, 0.091 km2 of solar PV panel area.Additionally, we compared our sampled imagery with Microsoft's US Building Footprints dataset (microsoft/USBuildingFootprints: Computer generated building footprints for the United States (github.com)) to split all image tiles into 3 brackets (high, medium, and low building density) by the number of buildings with in an image tile. More details about how we assigned these image tiles could be found in the paper. Such assignment was only applied to the 30 validation tiles.File Descriptionct_images_and_labels.zipThis .zip file contains all aerial image (.jpg) files and corresponding semantic segmentation annotation mask (.png) files. All .jpg imagery files are 8-bit RGB images and all .png files are binary arrays where 1 is for solar PV pixels and 0 is for non-solar PV pixels. An aerial imagery file and its corresponding mask file share the same filename but have different filename extensions. E.g., 000795_sw.jpg is the aerial imagery file for tile 000795_sw, and 000795_sw.png is the corresponding mask file.tile_assignments.csvThis .csv file records if an image tile is in the training or validation set, and if it's a high, medium or low building density tile. There are 3 columns in the .csv file:- tile_name (String): The tile name ID of a tile. This corresponds to filanmes of .jpg aerial imagery files and .png annotation mask files.- training_split (Categorical): Whether the image tile is in the training or the validation set. Values are either 'train' for the training set or "valid" for the validation set.- density_split (Categorical): Which building density bracket does the image tile belong to. Since the building density based assignment was only applied to the 30 validation tiles, all training tiles have "NA" in this column. For the validation tiles, values are either "high", "med", or "low" corresponding to the 3 building density brackets (high, medium and low).
What problem does this paper attempt to address?