Multi-Scale Cropping Mechanism for Remote Sensing Image Captioning

Xueting Zhang,Qi Wang,Shangdong Chen,Xuelong Li
DOI: https://doi.org/10.1109/IGARSS.2019.8900503
2019-01-01
Abstract:With the rapid development of artificial satellite, a large number of high resolution remote sensing images can be easily obtained now. Recently, remote sensing image captioning, which aims to generate accurate and concise descriptive sentences for remote sensing images, has been promoted by template-based model and encoder-decoder model with several related datasets released. Based on an encoder-decoder model, we propose a training mechanism of multi-scale cropping for remote sensing image captioning in this paper, which can extract more fine-grained information from remote sensing images and enhance the generalization performance of the base model. The experimental results on two datasets UCM-captions and Sydney-captions demonstrate that the proposed approach availably improves the performances in describing high resolution remote sensing images.
What problem does this paper attempt to address?