Enhancing Object Detection Performance for Small Objects through Synthetic Data Generation and Proportional Class-Balancing Technique: A Comparative Study in Industrial Scenarios

Jibinraj Antony,Vinit Hegiste,Ali Nazeri,Hooman Tavakoli,Snehal Walunj,Christiane Plociennik,Martin Ruskowski
2024-01-29
Abstract:Object Detection (OD) has proven to be a significant computer vision method in extracting localized class information and has multiple applications in the industry. Although many of the state-of-the-art (SOTA) OD models perform well on medium and large sized objects, they seem to under perform on small objects. In most of the industrial use cases, it is difficult to collect and annotate data for small objects, as it is time-consuming and prone to human errors. Additionally, those datasets are likely to be unbalanced and often result in an inefficient model convergence. To tackle this challenge, this study presents a novel approach that injects additional data points to improve the performance of the OD models. Using synthetic data generation, the difficulties in data collection and annotations for small object data points can be minimized and to create a dataset with balanced distribution. This paper discusses the effects of a simple proportional class-balancing technique, to enable better anchor matching of the OD models. A comparison was carried out on the performances of the SOTA OD models: YOLOv5, YOLOv7 and SSD, for combinations of real and synthetic datasets within an industrial use case.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper focuses on improving the performance of small object detection in industrial scenarios. The state-of-the-art object detection models perform well on medium to large objects but are not effective on small objects. Due to the time-consuming and error-prone process of collecting and annotating small object data, as well as the imbalanced nature of the datasets, the convergence efficiency of the models is low. To address this challenge, the paper proposes an innovative approach that enhances the dataset through the generation of synthetic data and proportional category balancing techniques. Firstly, the paper introduces the use of synthetic data generation to alleviate the challenges in collecting and annotating real data while creating a balanced data distribution. By converting CAD (Computer-Aided Design) models into synthetic images, the data points of small objects can be increased, thereby balancing the dataset. Additionally, the impact of simple proportional category balancing techniques on anchor matching for object detection models (such as YOLOv5, YOLOv7, and SSD) is studied. In the experimental section, the paper compares the performance of different models under combinations of real and synthetic data and finds that the best model performance is achieved when the amount of synthetic data is approximately half of the real data (DS-3). As the amount of synthetic data further increases (DS-4 and DS-5), the model's performance decreases in practical applications, possibly due to the over-reliance on unrealistic synthetic data. In conclusion, the proposed method effectively enhances the detection performance for small objects, particularly when a suitable amount of synthetic data is combined with real data. Through this approach, even with simple synthetic data generated from CAD models, the detection accuracy for small objects can be significantly improved.