Automatic object detection in aerial image using bent identity-convolutional neural network and fine tuning algorithm

Lalitha V.P.,Shanta Rangaswamy
DOI: https://doi.org/10.1007/s11042-022-11948-7
IF: 2.577
2022-02-12
Multimedia Tools and Applications
Abstract:In RS (Remote Sensing) imaging, one of the most important tasks is Object Detection (OD) which aims to extract the object location and class information. Object detection is termed the highly advanced approach, which is helpful in RS image analysis, scene and image content understanding. In recent years, the researchers have made extensive efforts to present an effective OD model with the limitation of detecting specific classes such as roads and buildings in the RS images. For capturing aerial images, satellite sensors are utilized, and these images are affected by various factors like background noise, variation of viewpoint and interferences etc. To provide accurate detection of objects, an effective method is needed in analysing aerial images. However, this work presents an automatic OD using BI-CNN_FTA (Bent Identity-Convolutional Neural Network Fine-Tuning Algorithm) to detect more object classes relating to urbanization applications. The overall framework of proposed method is divided into five phases-Pre-processing, Object Localization, Segmentation, Feature Extraction and Classification. The input image is pre-processed by Advanced Dual Domain (ADD) filtering. Next, the objects are localized with the help of Grid Guided Localization (GGL) which draw an imaginary box around each object in an image. The segmentation process is done with the help of an improved tree Markov (ITM) random field model. Further, various features like texture, geometric, color descriptors and Zernike moments are extracted. Finally, the object classification is performed by BI-CNN_FTA. During this stage, each object class are categorized based on urbanization applications and learning loss is optimized. The proposed method is executed in the PHYTON platform. It has obtained improved performance in terms of Mean Average Precision (mAP = 85.5%), F1-score (83.78%), Detection rate (97%) and Precision-Recall curve when compared with other deep learning networks on the DOTA dataset.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?