Abstract:With the recent advance of deep learning, a large number of methods have been developed for prohibited item detection in X-ray security images. Generally, these methods train models on a single X-ray image dataset that may contain only limited categories of prohibited items. To detect more prohibited items, it is desirable to train a model on the multi-dataset that is constructed by combining multiple datasets. However, directly applying existing methods to the multi-dataset cannot guarantee good performance because of the large domain discrepancy between datasets and the occlusion in images. To address the above problems, we propose a novel Dual-Mode Learning Network (DML-Net) to effectively detect all the prohibited items in the multi-dataset. In particular, we develop an enhanced RetinaNet as the architecture of DML-Net, where we introduce a lattice appearance enhanced sub-net to enhance appearance representations. Such a way benefits the detection of occluded prohibited items. Based on the enhanced RetinaNet, the learning process of DML-Net involves both common mode learning (detecting the common prohibited items across datasets) and unique mode learning (detecting the unique prohibited items in each dataset). For common mode learning, we introduce an adversarial prototype alignment module to align the feature prototypes from different datasets in the domain-invariant feature space. For unique mode learning, we take advantage of feature distillation to enforce the student model to mimic the features extracted by multiple pre-trained teacher models. By tightly combining and jointly training the dual modes, our DML-Net method successfully eliminates the domain discrepancy and exhibits superior model capacity on the multi-dataset. Extensive experimental results on several combined X-ray image datasets demonstrate the effectiveness of our method against several state-of-the-art methods. Our code is available at https://github.com/vampirename/dmlnet.

Dual-Mode Learning for Multi-Dataset X-Ray Security Image Detection

Dual-Level Boost Network for Long-Tail Prohibited Items Detection in X-ray Security Inspection

Combination of Deep Learning with Representation Learning in X-ray Prohibited Item Detection

Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images Like Humans?

Multi-Target Prohibited Item Recognition Algorithm for X-Ray Security Scene

MMCL: Boosting Deformable DETR-Based Detectors with Multi-Class Min-Margin Contrastive Learning for Superior Prohibited Item Detection

FDTNet: Enhancing frequency-aware representation for prohibited object detection from X-ray images via dual-stream transformers

Over-sampling De-occlusion Attention Network for Prohibited Items Detection in Noisy X-ray Images

Multi-Class 3D Object Detection Within Volumetric 3D Computed Tomography Baggage Security Screening Imagery

X-YOLO: An Efficient Detection Network of Dangerous Objects in X-Ray Baggage Images

Multi-Object Detection in Security Screening Scene Based on Convolutional Neural Network

Towards Real-World Prohibited Item Detection: A Large-Scale X-ray Benchmark

I$^2$OL-Net: Intra-Inter Objectness Learning Network for Point-Supervised X-Ray Prohibited Item Detection

Prohibited item detection within X-ray security inspection images based on an improved cascade network

A GAN based method for multiple prohibited items synthesis of X-ray security image

Differential Feature Awareness Network within Antagonistic Learning for Infrared-Visible Object Detection

Towards Real-world X-ray Security Inspection: A High-Quality Benchmark and Lateral Inhibition Module for Prohibited Items Detection

SIXray : A Large-scale Security Inspection X-ray Benchmark for Prohibited Item Discovery in Overlapping Images

InDuDoNet: An Interpretable Dual Domain Network for CT Metal Artifact Reduction

Multispectral Object Detection Based on Multilevel Feature Fusion and Dual Feature Modulation

Multi-Label Local to Global Learning: A Novel Learning Paradigm for Chest X-Ray Abnormality Classification.