Simple In-place Data Augmentation for Surveillance Object Detection

Munkh-Erdene Otgonbold,Ganzorig Batnasan,Munkhjargal Gochoo
2024-04-17
Abstract:Motivated by the need to improve model performance in traffic monitoring tasks with limited labeled samples, we propose a straightforward augmentation technique tailored for object detection datasets, specifically designed for stationary camera-based applications. Our approach focuses on placing objects in the same positions as the originals to ensure its effectiveness. By applying in-place augmentation on objects from the same camera input image, we address the challenge of overlapping with original and previously selected objects. Through extensive testing on two traffic monitoring datasets, we illustrate the efficacy of our augmentation strategy in improving model performance, particularly in scenarios with limited labeled samples and imbalanced class distributions. Notably, our method achieves comparable performance to models trained on the entire dataset while utilizing only 8.5 percent of the original data. Moreover, we report significant improvements, with mAP@.5 increasing from 0.4798 to 0.5025, and the mAP@.5:.95 rising from 0.29 to 0.3138 on the FishEye8K dataset. These results highlight the potential of our augmentation approach in enhancing object detection models for traffic monitoring applications.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper primarily aims to address the issue of poor model performance in traffic monitoring tasks due to limited annotated samples. Specifically, the research proposes a simple and highly targeted data augmentation technique specifically for object detection datasets based on fixed cameras. ### Problems the Paper Attempts to Solve 1. **Improve Model Performance**: Enhance the performance of models in traffic monitoring tasks, especially when the number of annotated samples is limited. 2. **Handle Class Imbalance**: Address the issue of imbalanced sample quantities of different classes in the training dataset, particularly in traffic monitoring scenarios where class distribution is uneven due to varying traffic patterns, infrastructure layouts, and behavioral norms in different geographic locations. 3. **Reduce Dependence on Large Annotated Data**: Alleviate the high dependency of object detection models on large amounts of accurately annotated data through effective data augmentation methods. 4. **Enhance Small Dataset Performance**: Achieve or approach the performance level of models trained on a full dataset using a small amount of data. ### Method Overview The researchers propose an "in-place" data augmentation method, which focuses on placing objects in the same position as the original to ensure the effectiveness of the augmentation and avoid overlapping with the original objects or previously selected objects. This method was extensively tested on two traffic monitoring datasets (FishEye8K and UA-DETRAC), and the results show that it can significantly improve model performance, especially in cases of limited sample size and class imbalance. ### Main Contributions - Proposed a simple data augmentation technique tailored for fixed-camera monitoring scenarios. - Achieved performance comparable to or better than models trained on the full dataset using only 8.5% of the original dataset. - Experimental results on the FishEye8K dataset showed that the mean Average Precision (mAP@.5) increased from 0.4798 to 0.5025, and mAP@.5:.95 increased from 0.29 to 0.3138. - For the UA-DETRAC dataset, significant performance improvements were also achieved by blurring non-interest areas and applying data augmentation. ### Conclusion The method proposed in this study is particularly suitable for traffic monitoring tasks based on fixed cameras, effectively improving model performance in cases of limited sample size and class imbalance. However, this method is mainly targeted at fixed-camera scenarios, and further research may be needed for broader applications.