Abstract:Rising global fire incidents necessitate effective solutions, with forest surveillance emerging as a crucial strategy. This paper proposes a complete solution using technology that integrates visible and infrared spectrum images through Unmanned Aerial Vehicles (UAVs) for enhanced detection of people and vehicles in forest environments. Unlike existing computer vision models relying on single-sensor imagery, this approach overcomes limitations posed by limited spectrum coverage, particularly addressing challenges in low-light conditions, fog, or smoke. The developed 4-channel model uses both types of images to take advantage of the strengths of each one simultaneously. This article presents the development and implementation of a solution for forest monitoring ranging from the transmission of images captured by a UAV to their analysis with an object detection model without human intervention. This model consists of a new version of the YOLOv5 (You Only Look Once) architecture. After the model analyzes the images, the results can be observed on a web platform on any device, anywhere in the world. For the model training, a dataset with thermal and visible images from the aerial perspective was captured with a UAV. From the development of this proposal, a new 4-channel model was created, presenting a substantial increase in precision and mAP (Mean Average Precision) metrics compared to traditional SOTA (state-of-the-art) models that only make use of red, green, and blue (RGB) images. Allied with the increase in precision, we confirmed the hypothesis that our model would perform better in conditions unfavorable to RGB images, identifying objects in situations with low light and reduced visibility with partial occlusions. With the model's training using our dataset, we observed a significant increase in the model's performance for images in the aerial perspective. This study introduces a modular system architecture featuring key modules: multisensor image capture, transmission, processing, analysis, and results presentation. Powered by an innovative object detection deep-learning model, these components collaborate to enable real-time, efficient, and distributed forest monitoring across diverse environments.

Real-Time Multi-Modal Active Vision for Object Detection on UAVs Equipped With Limited Field of View LiDAR and Camera

Fast Detection and Recognition Method of UAV in Sky Background

A Small UAV Detection Method Based on Optical Flow and Visual Feature Fusion

Detection and Recognition Method of Fast Low-Altitude Unmanned Aerial Vehicle Based on Dual Channel

Monocular-GPS Fusion 3D Object Detection for UAVs

A Vision-based UAV Tracker Aiming at Aerial Targets

Traffic Collisions Early Warning Aided by Small Unmanned Aerial Vehicle Companion

UAV Tracking with Lidar as a Camera Sensors in GNSS-Denied Environments

Real-Time Multi-Modal Semantic Fusion on Unmanned Aerial Vehicles

Real-time Active Detection of Targets and Path Planning Using UAVs

Lightweight UAV Object-Detection Method Based on Efficient Multidimensional Global Feature Adaptive Fusion and Knowledge Distillation

Vision-based system for a real-time detection and following of UAV

Real time object detection using LiDAR and camera fusion for autonomous driving

UAV Active Perception and Motion Control for Improving Navigation Using Low-Cost Sensors

Real-Time Detection for Small UAVs: Combining YOLO and Multi-frame Motion Analysis

Real-Time Multi-Modal Semantic Fusion on Unmanned Aerial Vehicles with Label Propagation for Cross-Domain Adaptation

Dynamic Object Tracking on Autonomous UAV System for Surveillance Applications

Ensuring UAV Safety: A Vision-only and Real-time Framework for Collision Avoidance Through Object Detection, Tracking, and Distance Estimation

UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection

Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection

Applying deep learning to real-time UAV-based forest monitoring: Leveraging multi-sensor imagery for improved results