Using Line Segments to Train Multi-Stream Stacked Autoencoders for Image Classification

Xue-song Tang,Kuangrong Hao,Hui Wei,Yongsheng Ding
DOI: https://doi.org/10.1016/j.patrec.2017.05.025
IF: 4.757
2017-01-01
Pattern Recognition Letters
Abstract:Recently, deep learning paradigm and models derived from them have achieved outstanding success in many fields in computer vision such as object recognition, image classification and image segmentation. In this work, the authors preprocess images into segments and then extract their geometric information as inputs to stacked autoencoders. A multi-stream framework based on the different geometric feature spaces of the segments is implemented to learn deep geometric representations that have more discriminative powers and generative capabilities. In order to assess the robustness and smoothness of the proposed representation, four representative Geometric Feature Sets (GFSs) are investigated. To further verify the effectiveness of the proposed method, we apply those GFSs for the image classification experiments on four challenging datasets. Given a smaller size of depth, the proposed multi-stream method achieves comparable or better results compared to the best performers. (C) 2017 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?