Discretization-aware architecture search

Yunjie Tian,Chang Liu,Lingxi Xie,Jianbin jiao,Qixiang Ye
DOI: https://doi.org/10.1016/j.patcog.2021.108186
IF: 8
2021-12-01
Pattern Recognition
Abstract:The search cost of neural architecture search (NAS) has been largely reduced by differentiable architecture search and weight-sharing methods. Such methods optimize a super-network with all possible edges and operations, and determine the optimal sub-network by discretization, i.e., pruning off operations/edges of small weights. However, the discretization process performed on either operations or edges incurs significant inaccuracy and thus the quality of the architecture is not guaranteed. In this paper, we propose discretization-aware architecture search (DA2S), and target at pushing the super-network towards the configuration of desired topology. DA2S is implemented with an entropy-based loss term, which can be regularized to differentiable architecture search in a plug-and-play fashion. The regularization is controlled by elaborated continuation functions, so that discretization is adaptive to the dynamic change of edges and operations. Experiments on standard image classification benchmarks demonstrate the effectiveness of our approach, in particular, under imbalanced network configurations that were not studied before. Code is available at github.com/sunsmarterjie/DAAS.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?