Abstract:In recent years, the optimization of network architecture plays an increasingly important role in the performance improvement of neural networks. We introduce an interactive dual-branch attention mechanism and three different lightweight-oriented strategies to build an accurate and compact residual network model in this work. The channel attention and spatial attention are fused to construct a novel bottleneck to enhance the feature representation ability for accurate performance. Asymmetric convolutions with spatial factorization, channel splitting, depthwise separable convolution with width multiplier adjustment are further combined to compress the parameter size of the attention-driven model for a lightweight and compact residual network named ALResNet. The experimental results of 92.1% top-1 testing accuracy at the inference speed of 14.90 fps on Animals-10 and 89.4% top-1 testing accuracy at the inference speed of 16.21 fps on CIFAR-10, as well as 4.77M parameters and 736.82 MFLOPs, demonstrate that the proposed ALResNet achieves a decent tradeoff between accuracy and computing efficiency for fast inference on resource-limited mobile devices for vision-based tasks. ∗Corresponding author: Aiwen Luo (faith.awluo@gmail.com). Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from permissions@acm.org. MLMI’21, September 17–19, 2021, Hangzhou, China © 2021 Association for Computing Machinery. ACM ISBN 978-1-4503-8424-7/21/09. . . $15.00 https://doi.org/10.1145/3490725.3490729 CCS CONCEPTS • Computing methodologies; • Object recognition; • Neural networks; • Model verification and validation;

High-speed hyperparameter optimization for deep ResNet models in image recognition

Hyperparameter Optimization for Deep Residual Learning in Image Classification

An efficient approach to escalate the speed of training convolution neural networks

Improved Residual Networks for Image and Video Recognition

Research on Optimization of Image Recognition Algorithm Based on Deep Learning

A Framework for Designing the Architectures of Deep Convolutional Neural Networks

Random search as a neural network optimization strategy for Convolutional-Neural-Network (CNN)-based noise reduction in CT

Best Practices for Convolutional Neural Networks Applied to Object Recognition in Images

Efficient ResNets: Residual Network Design

ALResNet: Attention-Driven Lightweight Residual Network for Fast and Accurate Image Recognition

Convolution Neural Network Hyperparameter Optimization Using Simplified Swarm Optimization

RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network

Multi-Residual Networks: Improving the Speed and Accuracy of Residual Networks

Optimizing Image Classification: Automated Deep Learning Architecture Crafting with Network and Learning Hyperparameter Tuning

TResNet: High Performance GPU-Dedicated Architecture

Overparametrization of HyperNetworks at Fixed FLOP-Count Enables Fast Neural Image Enhancement

Assessment of Optimizers impact on Image Recognition with Convolutional Neural Network to Adversarial Datasets

Optimization of Convolutional Neural Network using Microcanonical Annealing Algorithm

Optimizing Convolutional Neural Network Hyperparameters by Enhanced Swarm Intelligence Metaheuristics

Artificial Intelligence Image Recognition Method Based on Convolutional Neural Network Algorithm

Hyperparameter tuning of convolutional neural networks for building construction image classification