Improved Visual Vocabularies for Scene Classification of High Resolution Remote Sensing Imagery in Urban Areas

Lijun Zhao,Ping Tang
DOI: https://doi.org/10.1109/jurse.2019.8808962
2019-01-01
Abstract:The improvement of spatial resolution of remote sensing images provides more and more ground details, which makes it possible to better interpret unban land use and analyze urban functional areas. The bag-of-visual-words (BOVW) model becomes a well-known method to deal with such land-use scene classification problems. Generally, the unsupervised k-means clustering method is used to construct visual dictionaries, in which there often exist large amounts of local features to cluster in the visual vocabulary construction stage, largely affecting the computational complexity and hindering the schedule of visual dictionary generation. To solve the above mentioned problems, this paper proposes a two-phase k-means clustering based visual vocabulary construction method. The proposed method can add information of predefined scene categories from different images, which is beneficial for the selection of initial cluster centers, and, to a certain extent, alleviates the problem that the amount of samples to be clustered in a single course of clustering can be too large. Experimental results on the public UCM-21 dataset show that compared with the traditional method, the proposed one can not only reduce the dictionary construction time but also help improve the classification accuracy.
What problem does this paper attempt to address?