Text-Guided Image Generation for Railway Intrusion Anomaly Detection

Jinghang Chen,Chi Zhang,Xiaodan Feng,Yuehu Liu
DOI: https://doi.org/10.1109/ICUS58632.2023.10318477
2023-01-01
Abstract:Railway intrusion are characterized by strong suddenness, high unpredictability and many disturbing factors, which are key factors affecting railway safety. Currently, the deep neural network-based intrusion object detection algorithm relies on the diversity of training data. However, the data of railway intrusion image is scarce and difficult to obtain. Existing image generation methods face the problem of insufficient realism in the generated images and difficulty in generating large amounts of data. We aim to explore a text-guided image data generation framework for specific visual tasks by using an existing large language model combined with a text-to-image diffusion model. In this paper, we construct a framework for railway intrusion image generation guided by textual prompts and build a dataset of railway intrusion images based on this framework. The dataset generated in this paper contains a wide range of intrusion images and includes a variety of environmental factors such as weather and time, which provides better diversity than existing datasets. The evaluation results show that the intrusion images generated in this paper have high quality and realism.
What problem does this paper attempt to address?