Unsupervised Deep Clustering for Fashion Images.

Cairong Yan,Umar Subhan Malhi,Yongfeng Huang,Ran Tao
DOI: https://doi.org/10.1007/978-3-030-21451-7_8
2019-01-01
Abstract:In many visual domains like fashion, building an effective unsupervised clustering model depends on visual feature representation instead of structured and semi-structured data. In this paper, we propose a fashion image deep clustering (FiDC) model which includes two parts, feature representation and clustering. The fashion images are used as the input and are processed by a deep stacked autoencoder to produce latent feature representation, and the output of this autoencoder will be used as the input of the clustering task. Since the output of the former has a great influence on the later, the strategy adopted in the model is to integrate the learning process of the autoencoder and the clustering together. The autoencoder is trained with the optimal number of neurons per hidden layers to avoid overfitting and we optimize the cluster centroid by using stochastic gradient descent and backpropagation algorithm. We evaluate FiDC model on a real-world fashion dataset downloaded from Amazon where images have been extracted into 4096-dimensional visual feature vectors by convolutional neural networks. The experimental results show that our model achieves state-of-the-art performance.
What problem does this paper attempt to address?