Integrating Pixels and Segments: A Deep-Learning Method Inspired by the Informational Diversity of the Visual Pathways

Xue-song Tang,Hui Wei,Kuangrong Hao,Mingbo Zhao,Dawei Li
DOI: https://doi.org/10.1016/j.neucom.2018.10.096
IF: 6
2019-01-01
Neurocomputing
Abstract:Visual cortex is able to process information in multiple pathways and integrate various forms of representations. This paper proposed a bio-inspired method that utilizes the line-segment-based representation to perform a dedicated channel for the geometric feature learning process. The extracted geometric information can be integrated with the original pixel-based information and implemented on both the convolutional neural networks (SegCNN) and the stacked autoencoders (SegSAE). Segment-based operations such as segConvolve and segPooling are designed to further process the extracted geometric features. The proposed models are verified on the MNIST dataset, Caltech 101 dataset and QuickDraw dataset for image classification. According to the experimental results, the proposed models can facilitate the classification accuracies especially when the sizes of the training set are limited. Particularly, the method based on multiple representations is found to be effective for classifying the hand-drawn sketches.
What problem does this paper attempt to address?