Training-free Neural Architecture Search on Hybrid Convolution-attention Networks

Yi Fan,Yu-Bin Yang
DOI: https://doi.org/10.1109/icme57554.2024.10687652
2024-01-01
Abstract:Hybrid convolution-attention networks have emerged as state-of-the-art deep network structures, garnering significant attention in recent years. Additionally, training-free Neural Architecture Search (NAS) has proven to be an effective method for further enhancing the performance of deep networks. However, existing training-free NAS methods are primarily designed for pure convolutional networks or pure attention networks, lacking effective adaptation to hybrid convolution-attention networks. To address this issue, we propose a novel training-free NAS method called CA-NAS. CA-NAS constructs two subnetworks based on a candidate network, obtains the proxy of each subnetwork, and then combines the two proxies using the importance of each subnetwork to calculate the proxy of the candidate network. Furthermore, we introduce staged CA-NAS as an extension of our method. Experimental results demonstrate that our search results achieve higher top-1 classification accuracy compared to existing models across three search spaces with different model sizes.
What problem does this paper attempt to address?