An enhanced algorithm for object detection based on generative adversarial structure

Yun Zhang,Cheng Huang,Yuyao Zhang,Shujuan Yu,Liya Huang,Na Xie
DOI: https://doi.org/10.1016/j.engappai.2024.108427
IF: 8
2024-05-01
Engineering Applications of Artificial Intelligence
Abstract:The performance of object detection networks is often limited by the depth of the feature extraction network. Increasing network parameters may yield limited improvements in the detection system's performance. Additional careful designs of network details are necessary, but they can significantly increase training difficulty. This paper introduces a novel object detection method that utilizes generative adversarial training. Our approach focuses on minimizing the EM distance (Wasserstein distance) of the feature distribution as the primary training objective. We enhance the image features so that training of GAN (Generative Adversarial Networks) yields a feature distribution that exceeds that of the original dataset, obtaining a better feature extraction network. A new loss function is also added to the adversarial training process to ensure stable improvement of the detector. A comparative experiment conducted with the original CenterNet network on MS COCO (Microsoft Common Objects in COntext) 2017 reveals that the generative adversarial training method significantly improves the average precision for most of the examined backbone networks. Among the four backbone networks employed in the experiments, the mean improvement in network AP (Average Precision) values ranged from 0.3 to 0.9, demonstrating their success with minimal training efforts. Moreover, none of the four backbone networks experienced an increase in network parameters during inference. Experimental results indicate that the proposed architecture effectively enhances the network's feature extraction capability without compromising speed during inference.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary
What problem does this paper attempt to address?