BGM: Background Mixup for X-ray Prohibited Items Detection

Weizhe Liu,Renshuai Tao,Hongguang Zhu,Yunda Sun,Yao Zhao,Yunchao Wei
2024-11-30
Abstract:Prohibited item detection is crucial for ensuring public safety, yet current X-ray image-based detection methods often lack comprehensive data-driven exploration. This paper introduces a novel data augmentation approach tailored for prohibited item detection, leveraging unique characteristics inherent to X-ray imagery. Our method is motivated by observations of physical properties including: 1) X-ray Transmission Imagery: Unlike reflected light images, transmitted X-ray pixels represent composite information from multiple materials along the imaging path. 2) Material-based Pseudo-coloring: Pseudo-color rendering in X-ray images correlates directly with material properties, aiding in material distinction. Building on a novel perspective from physical properties, we propose a simple yet effective X-ray image augmentation technique, Background Mixup (BGM), for prohibited item detection in security screening contexts. The essence is the rich background simulation of X-ray images to induce the model to increase its attention to the foreground. The approach introduces 1) contour information of baggage and 2) variation of material information into the original image by Mixup at patch level. Background Mixup is plug-and-play, parameter-free, highly generalizable and provides an effective solution to the limitations of classical visual augmentations in non-reflected light imagery. When implemented with different high-performance detectors, our augmentation method consistently boosts performance across diverse X-ray datasets from various devices and environments. Extensive experimental results demonstrate that our approach surpasses strong baselines while maintaining similar training resources.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that when detecting contraband items in X - ray security inspection images, the existing methods lack a comprehensive data - driven exploration. Specifically, the existing detection methods based on X - ray images fail to fully utilize the unique physical characteristics of X - ray images when dealing with data augmentation, resulting in limited improvement in model performance. ### Main problems: 1. **Small dataset scale**: Due to privacy policies and the limitations of X - ray imaging systems, the publicly available X - ray image datasets are far smaller than natural image datasets. 2. **Difficult to label**: Labeling contraband items in X - ray images requires professionals because the pixels in these images represent the composite information of multiple materials along the imaging path, rather than a single object. 3. **Traditional data augmentation methods are not applicable**: Traditional data augmentation methods (such as Mixup, Random Erasing, etc.) do not work well on X - ray images because these methods do not take into account the unique characteristics of X - ray images, such as transmission imaging and pseudo - color rendering. ### Solutions proposed in the paper: To address the above challenges, the paper proposes a new data augmentation method - **Background Mixup (BGM)**, which is specifically designed for the characteristics of X - ray security inspection images. BGM improves the model's attention to foreground targets by simulating rich background information, thereby enhancing the detection performance. ### Specific methods: 1. **Self Patch Mixup (SPM)**: Operates at the contour level, randomly selects background patches and places them globally, and introduces transparency in local Mixup operations to enrich the background information. 2. **Color Patch Mixup (CPM)**: Operates at the material level, randomly selects patches with random colors, and introduces transparency in local Mixup operations to simulate material changes. ### Experimental results: Experiments show that the BGM method significantly improves the performance of different detection models on multiple X - ray datasets, especially in complex occlusions and cluttered backgrounds. For example, on the PIDray dataset, the DINO detector using ResNet - 50 as the backbone network, the mAP is increased from 68.4% to 70.1%, with an overall performance improvement of 1.7%. ### Summary: By introducing the BGM method, the paper solves the problem of insufficient data augmentation in the existing X - ray image detection methods and provides a simple and effective solution that can significantly improve the performance of the detection model.