Infrared Image Captioning Based on Unsupervised Learning and Reinforcement Learning

Chenjun Gao,Ganghui Bian,Yanzhi Dong,Xiaohu Yuan,Huaping Liu
DOI: https://doi.org/10.1109/icarce55724.2022.10046598
2022-01-01
Abstract:When sufficient prior knowledge is lacking or manual annotation is difficult, solving the problem directly based on training samples of unknown category can greatly reduce the time cost. Therefore, we add unsupervised learning to the preliminary groundwork of image captioning for efficient image domain conversion to achieve batch generation of the required images. At the same time, more and more infrared images are being applied to assist decision making and environment perception. Generating more diverse and discriminative image captions in similar scenes will be effective in enhancing decision making and perception capabilities. Our infrared image caption model trained with reinforcement learning has satisfactory results both in terms of quantitative scores and in real scene tests.
What problem does this paper attempt to address?