Multispectral Object Detection with Deep Learning

Md Osman Gani,Somenath Kuiry,Alaka Das,Mita Nasipuri,Nibaran Das
DOI: https://doi.org/10.48550/arXiv.2102.03115
2021-02-05
Abstract:Object detection in natural scenes can be a challenging task. In many real-life situations, the visible spectrum is not suitable for traditional computer vision tasks. Moving outside the visible spectrum range, such as the thermal spectrum or the near-infrared (NIR) images, is much more beneficial in low visibility conditions, NIR images are very helpful for understanding the object's material quality. In this work, we have taken images with both the Thermal and NIR spectrum for the object detection task. As multi-spectral data with both Thermal and NIR is not available for the detection task, we needed to collect data ourselves. Data collection is a time-consuming process, and we faced many obstacles that we had to overcome. We train the YOLO v3 network from scratch to detect an object from multi-spectral images. Also, to avoid overfitting, we have done data augmentation and tune hyperparameters.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to address the problem of object detection in natural scenes, particularly in situations where the visible spectrum is not applicable, such as foggy or low-light conditions. The authors propose using multispectral images (including thermal infrared and near-infrared images) to improve the effectiveness of object detection. Due to the lack of publicly available multispectral datasets, the authors collected their own image dataset containing RGB, NIR (near-infrared), and Thermal (thermal infrared) spectra, and annotated 10 categories of objects. Additionally, to reduce overfitting and increase the amount of data, the authors performed data augmentation and used the YOLO v3 network to train on the multispectral images to evaluate the performance of object detection under different spectral combinations.