A Method to Establish a Synthetic Image Dataset of Stored-Product Insects for Insect Detection

Jiangtao Li,Yuwei Su,Zhaojun Cui,Jida Tian,Huiling Zhou
DOI: https://doi.org/10.1109/access.2022.3188282
IF: 3.9
2022-01-01
IEEE Access
Abstract:In recent years, deep-learning models have resulted in significant progress in insect recognition. However, training deep neural networks requires a large amount of data, and data collection and labeling are time consuming and labor intensive. This study proposes a method for establishing a synthetic image dataset of stored-product insects to provide well-labelled image data for insect detection tasks. Proxy virtual worlds are leveraged to obtain synthetic data with annotations. A dynamic generation approach was presented to generate synthetic images with diverse insect targets, various backgrounds, and changing lighting conditions by using a camera module in the constructed virtual scene. The coordinates of the bounding boxes and the category labels of insect targets in each synthetic image were obtained by calculating the geometrical relationships between the insect targets and the camera module. A texture translation network was developed to conduct image-to-image translation and launch to enhance the verisimilitude of the synthetic images. A synthetic image dataset was established for three insect species, Cyptolestes ferrugineus (Stephens), Sitophilus oryzae (Linnaeus), and Tribolium castaneum (Herbst).A set of assessments was introduced to evaluate the synthetic image dataset, including the statistical characteristics and experimental verification. The experimental results demonstrated that the use of synthetic data reduces the demand for real data. The proposed method may provide a novel solution for providing training data with correct annotations for insect detection, without tedious image collection and manual labeling.
What problem does this paper attempt to address?