Benchmark analysis of popular ImageNet classification deep CNN architectures

Mustafa Alghali Elsaid Muhammed,Ahmed Abdalazeem Ahmed,Tarig Ahmed Khalid
DOI: https://doi.org/10.1109/smarttechcon.2017.8358502
2017-08-01
Abstract:Deep Convolutional Neural Networks (CNNs) have recently demonstrated the state-of-the-art classification performance on ImageNet Large Scale Visual Recognition Challenge (ILSVRC) since 2012, yet there is relatively no clear understanding of the reasons behind their outstanding performance, or how they might be improved, In this paper we present a novel benchmarking of multiple state-of-the-art deep CNN architectures by providing an analysis of important performance metrics: speed, memory consumption, and network parameters utilization. Key findings are: (1) fully connected layers are of a high cost on speed and memory consumption compared to the sparsely connected layers; (2) the depth and the performance of an architecture are in a nonlinear relationship and constrained by layers transformation types; (3) 1×1 convolutions are an efficient way to reduce dimensionality and pooling features; (4) addition units as in residual and densely connected networks accelerate the backpropagation time by distributing the gradients through the graph, we believe our set of benchmarks are a step towards better realization of the best architectural design choices of Deep CNNs.
What problem does this paper attempt to address?