Hyperparameter tuning of convolutional neural networks for building construction image classification

André Luiz Carvalho Ottoni,Marcela Silva Novo,Dayana Bastos Costa
DOI: https://doi.org/10.1007/s00371-021-02350-9
IF: 2.835
2022-01-12
The Visual Computer
Abstract:Deep Learning models have important applications in image processing. However, one of the challenges in this field is the definition of hyperparameters. Thus, the objective of this work is to propose a rigorous methodology for hyperparameter tuning of Convolutional Neural Network for building construction image classification, especially in roofs structure analysis. For this, the HyperTuningSK algorithm was developed, intended to create recommendation rankings for two hyperparameters: learning rate and optimizer. The approach uses concepts from the statistical design of experiments, such as Analysis of Variance and the Scott–Knott clustering algorithm. In addition, the adopted database includes images of inspections on buildings roofs made with unmanned aerial vehicles. The images are divided into two classes: (i) roofs with clean gutters and (ii) roofs with dirty gutters. The methods recommended by the HyperTuningSK algorithm achieved good results in comparison to the hyperparameters adopted in the literature. In this respect, adagrad015 achieved the highest average values of accuracy in the validation (100%\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$100\%$$\end{document}) and testing steps (90%\documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$$90\%$$\end{document}) for Convolutional Neural Network architecture with 12 layers. In addition, the hyperparameters recommended by the HyperTuningSK algorithm achieved the best test results for other two literature architectures: Densenet121 (85.7%) and VGG16 (84.4%).
computer science, software engineering
What problem does this paper attempt to address?