Cloud Implementation of the K-means Algorithm for Hyperspectral Image Analysis

Juan Mario Haut,Mercedes Paoletti,Javier Plaza,Antonio Plaza
DOI: https://doi.org/10.1007/s11227-016-1896-3
IF: 3.3
2017-01-01
The Journal of Supercomputing
Abstract:Remotely sensed hyperspectral imaging offers the possibility to collect hundreds of images, at different wavelength channels, for the same area on the surface of the Earth. Hyperspectral images are characterized by their large volume and dimensionality, which makes their processing and storage difficult. As a result, several techniques have been developed in previous years to perform hyperspectral image analysis on high-performance computing architectures. However, the application of cloud computing techniques has not been as widespread. There are many potential advantages in exploiting cloud computing architectures for distributed hyperspectral image analysis. In this paper, we present a cloud implementation (developed using Apache Spark) of the popular K-means algorithm for unsupervised hyperspectral image clustering. The experimental results suggest that cloud architectures allow for the efficient distributed processing of large hyperspectral image data sets.
What problem does this paper attempt to address?