FTA-DETR: An efficient and precise fire detection framework based on an end-to-end architecture applicable to embedded platforms

Hongtao Zheng,Gaoyang Wang,Duo Xiao,Hong Liu,Xiaoyin Hu
DOI: https://doi.org/10.1016/j.eswa.2024.123394
IF: 8.5
2024-02-18
Expert Systems with Applications
Abstract:Timely fire alarms are crucial as they can save lives and avoid major economic losses. However, due to the complexity of the structure, the current mainstream DETR-based fire detection models are problematic in terms of practicality because they require large amounts of memory and long inference times. Meanwhile, high-quality fire detection datasets are very scarce, severely limiting the performance of the algorithms. To address these challenges and improve accuracy in complex fire environments, first, we introduce a dataset quality enhancement framework based on diffusion model (DDPM) to improve the quality of low-quality fire alarm datasets. Second, we propose a novel Deformable-DETR-based fire detection framework (FTA-DETR). Among the innovative optimizations of FTA-DETR, first, we introduce a trainable matrix in the encoder to compute features, which reduces the computational burden of the encoder, highlights compelling features, and significantly reduces the training time. Second, we improve the encoding block by alternately updating high-level and low-level features, greatly reducing the amount of feature computation required for effective detection. This encoder structure is compatible with any state-of-the-art transformer decoder. Next, to accommodate the multi-scale nature of fires and different environmental complexities, we modify the loss function to WiouV3, which not only speeds up the convergence of the model but also improves the performance. Finally, we smoothly combine FTA-DETR with an acceleration engine like TensorRT to improve inference speed with little loss of accuracy. The experiments show that the dataset quality enhancement framework based on the diffusion model generates high quality datasets, and the enhanced dataset can greatly improve the detection performance of FTA-DETR (mAP increased by 2.42%). Meanwhile, FTA-DETR outperforms almost all current fire detection frameworks in terms of detection accuracy and interference resistance, with accuracy reaching 98.32% and 99.21% on the two datasets, Mivia and FireNet, respectively, and precision reaching 94% on the BoWFire dataset. In addition, FTA-DETR after being paired with the TensorRT framework achieves an inference speed of 76 FPS on the Jetson Orin Nano, a small embedded device with very limited computational power. The code is available at https://github.com/wanggoat/FTA-detr .
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
What problem does this paper attempt to address?