Influence of Different Activation Functions on Deep Learning Models in Indoor Scene Images Classification

Basavaraj S. Anami,Chetan V. Sagarnal
DOI: https://doi.org/10.1134/S1054661821040039
2022-03-19
Pattern Recognition and Image Analysis
Abstract:The success of deep learning in the field of computer vision and object recognition has made significant breakthroughs, especially in improving recognition accuracy. The scene recognition algorithms have been evolved over the years because of the developments in machine learning and deep convolution neural networks (DCNNs). In this paper, the classification of indoor scenes using three deep learning models, namely, ResNet, MobileNet, and EfficientNet is attempted. The influence of activation functions on classification accuracy is being explored. Three activation functions, namely, tanh, ReLU, and sigmoid, are deployed in the work. The MIT-67 indoor dataset is split into scenes with and without people to test its effect on the accuracy of classification. The novelty of the work includes splitting the dataset, based on the spatial layout and segregating, into two groups, namely, with people and without people. Amongst the three pre-trained models, EfficientNet has given good results.
What problem does this paper attempt to address?