Enhancing Object Detection Performance for Small Objects through Synthetic Data Generation and Proportional Class-Balancing Technique: A Comparative Study in Industrial Scenarios

Jibinraj Antony,Vinit Hegiste,Ali Nazeri,Hooman Tavakoli,Snehal Walunj,Christiane Plociennik,Martin Ruskowski

2024-01-29

Abstract:Object Detection (OD) has proven to be a significant computer vision method in extracting localized class information and has multiple applications in the industry. Although many of the state-of-the-art (SOTA) OD models perform well on medium and large sized objects, they seem to under perform on small objects. In most of the industrial use cases, it is difficult to collect and annotate data for small objects, as it is time-consuming and prone to human errors. Additionally, those datasets are likely to be unbalanced and often result in an inefficient model convergence. To tackle this challenge, this study presents a novel approach that injects additional data points to improve the performance of the OD models. Using synthetic data generation, the difficulties in data collection and annotations for small object data points can be minimized and to create a dataset with balanced distribution. This paper discusses the effects of a simple proportional class-balancing technique, to enable better anchor matching of the OD models. A comparison was carried out on the performances of the SOTA OD models: YOLOv5, YOLOv7 and SSD, for combinations of real and synthetic datasets within an industrial use case.

Computer Vision and Pattern Recognition,Machine Learning

What problem does this paper attempt to address?

The paper focuses on improving the performance of small object detection in industrial scenarios. The state-of-the-art object detection models perform well on medium to large objects but are not effective on small objects. Due to the time-consuming and error-prone process of collecting and annotating small object data, as well as the imbalanced nature of the datasets, the convergence efficiency of the models is low. To address this challenge, the paper proposes an innovative approach that enhances the dataset through the generation of synthetic data and proportional category balancing techniques. Firstly, the paper introduces the use of synthetic data generation to alleviate the challenges in collecting and annotating real data while creating a balanced data distribution. By converting CAD (Computer-Aided Design) models into synthetic images, the data points of small objects can be increased, thereby balancing the dataset. Additionally, the impact of simple proportional category balancing techniques on anchor matching for object detection models (such as YOLOv5, YOLOv7, and SSD) is studied. In the experimental section, the paper compares the performance of different models under combinations of real and synthetic data and finds that the best model performance is achieved when the amount of synthetic data is approximately half of the real data (DS-3). As the amount of synthetic data further increases (DS-4 and DS-5), the model's performance decreases in practical applications, possibly due to the over-reliance on unrealistic synthetic data. In conclusion, the proposed method effectively enhances the detection performance for small objects, particularly when a suitable amount of synthetic data is combined with real data. Through this approach, even with simple synthetic data generated from CAD models, the detection accuracy for small objects can be significantly improved.

Enhancing Object Detection Performance for Small Objects through Synthetic Data Generation and Proportional Class-Balancing Technique: A Comparative Study in Industrial Scenarios

Enhancing Object Detection Accuracy in Autonomous Vehicles Using Synthetic Data

Synthetica: Large Scale Synthetic Data for Robot Perception

POSEIDON: A Data Augmentation Tool for Small Object Detection Datasets in Maritime Environments

A Novel Pre-Processing Approach and Benchmarking Analysis for Faster, Robust, and Improved Small Object Detection Methods

Combining Synthetic Images and Deep Active Learning: Data-Efficient Training of an Industrial Object Detection Model

YOLO Adaptive Developments in Complex Natural Environments for Tiny Object Detection

Synthetic Data for Object Classification in Industrial Applications

SOD-YOLOv8—Enhancing YOLOv8 for Small Object Detection in Aerial Imagery and Traffic Scenes

SOD-YOLOv8 -- Enhancing YOLOv8 for Small Object Detection in Traffic Scenes

Towards Large-Scale Small Object Detection: Survey and Benchmarks.

Exploring the Effectiveness of Dataset Synthesis: An application of Apple Detection in Orchards

Accurate and real-time object detection in crowded indoor spaces based on the fusion of DBSCAN algorithm and improved YOLOv4-tiny network

Compatibility Review for Object Detection Enhancement through Super-Resolution

Sampling Techniques for Large-Scale Object Detection From Sparsely Annotated Objects

Concerning Imbalance and Bounding Box Loss to Detect Small Targets in Remote Sensing

Automatically Prepare Training Data for YOLO Using Robotic In-Hand Observation and Synthesis

A Small-Scale Object Detection Algorithm in Intelligent Transportation Scenarios

Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation Strategies

Small Target-YOLOv5: Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5

Augmentation for small object detection