Object Detection Algorithm Based On Attention Mask Fusion

Dong Xiao-Xiao,He Xiao-Hai,Wu Xiao-Hong,Qing Lin-Bo,Teng Qi-Zhi
DOI: https://doi.org/10.3788/YJYXS20193408.0825
2019-01-01
Chinese Journal of Liquid Crystals and Displays
Abstract:In computer vision tasks, balancing the accuracy and speed of object detection plays a significant role in subsequent practical applications such as object tracking and recognition. An object detection algorithm based on attention mask fusion is proposed. Firstly, the VGG network is used to extract features, and a series of preselected boxes are obtained after preliminary regression and binary classification. Then, the preselected boxes are input into the feature pyramid structure, learning effective features adaptively by constructing the attention mask module, and more representational features are gotten by integrating the feature pyramid structure and the attention mask module. Finally, the multiscale detection results are obtained by multiple classification and regression. Experiments are conducted on the data sets of PASCAL VOC 2007 and PASCAL VOC 2012. Test set results show that under the condition that Intersection over Union(IOU) is 0.5, the mean average precision (mAP) for the image input of 320 x 320 is 81.0% and 79.0% respectively, and the detection speed is 60.9 fps, realizing the balance between precision and speed. In this paper, the attention information is integrated into object detection to achieve the balance of accuracy and speed of generic object detection.
What problem does this paper attempt to address?