A Hierarchical Coarse-to-fine Perception for Small-Target Categorization of Butterflies under Complex Backgrounds
Zhongqi Lin,Jingdun Jia,Wanlin Gao,Feng Huang
DOI: https://doi.org/10.3233/jifs-190747
2020-01-01
Journal of Intelligent & Fuzzy Systems
Abstract:Small-target categorization of butterflies suffers from large-scale search space of candidate target locations, subtler discriminations, camouflaged appearances, and complex backgrounds. Precise localization and domain-specific discrimination extraction are crucial for this issue. In this work, a novel hierarchical coarse-to-fine convolutional neural network (C-t-FCNN) was proposed. It consists of CoarseNet and FineNet, which incorporate object-level and part-level representations into framework. Specifically, the coarse-grained features containing the orientation description are generated by CoarseNet, while the fine-grained discriminations with semantic distinctiveness are captured by FineNet. Next, the correspondences are established to mark the target regions, background regions, and mismatched regions depending on the quantification of scale-invariant feature transform (SIFT) descriptors. Then, the features are subsampled via spatial pyramid pooling (SPP) for size uniformity and integration. Finally, the irrelevant background and mismatched regions are eliminated by the support vector machine (SVM) with a radial basis function (RBF) kernel, leaving only the target-specific patches for finer-scale extraction. Hence the numeracy can be economized from identifying irrelevant areas and can be rescheduled in feature extraction and final decision, which can suppress time complexity simultaneously. A total of 119,016 augmented butterfly images spanning 47 categories are utilized for model training, while 13,734 images are evaluated for effectiveness verification. The C-t-FCNN delivers impressive performance, i.e., it achieves a validation accuracy of 92.08% and a testing accuracy of 91.6%, which outperforms state-of-the-arts.