GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis

Srikumar Sastry,Subash Khanal,Aayush Dhakal,Nathan Jacobs
2024-04-10
Abstract:We present GeoSynth, a model for synthesizing satellite images with global style and image-driven layout control. The global style control is via textual prompts or geographic location. These enable the specification of scene semantics or regional appearance respectively, and can be used together. We train our model on a large dataset of paired satellite imagery, with automatically generated captions, and OpenStreetMap data. We evaluate various combinations of control inputs, including different types of layout controls. Results demonstrate that our model can generate diverse, high-quality images and exhibits excellent zero-shot generalization. The code and model checkpoints are available at
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the problem of high-resolution satellite image synthesis, with specific objectives including: 1. **Controllability**: Achieving global style control through text prompts or geographic coordinates, allowing users to specify scene semantics or regional appearance. 2. **Layout Control**: Using reference images (such as OpenStreetMap images) to control the layout of the generated satellite images. 3. **Synthesis Capability under Geographic Conditions**: Combining geographic location information (features extracted through SatCLIP) to enable the model to generate satellite images based on the geographic characteristics of specific areas. 4. **Diversity and High Quality**: Generating diverse and high-quality satellite images, demonstrating good zero-shot generalization capability. Through these technical means, the proposed method in the paper not only synthesizes realistic satellite images but also allows for style and layout control of the images through different conditional inputs without altering the trained model. This research is expected to bring new solutions to application fields such as urban planning and data augmentation.