Powering One-Shot Topological NAS with Stabilized Share-Parameter Proxy.

Ronghao Guo,Chen Lin,Chuming Li,Keyu Tian,Ming Sun,Lu Sheng,Junjie Yan
DOI: https://doi.org/10.1007/978-3-030-58568-6_37
2020-01-01
Abstract:One-shot NAS method has attracted much interest from the research communitydue to its remarkable training efficiency and capacity to discover highperformance models. However, the search spaces of previous one-shot based worksusually relied on hand-craft design and were short for flexibility on thenetwork topology. In this work, we try to enhance the one-shot NAS by exploringhigh-performing network architectures in our large-scale Topology AugmentedSearch Space (i.e., over 3.4*10^10 different topological structures).Specifically, the difficulties for architecture searching in such a complexspace has been eliminated by the proposed stabilized share-parameter proxy,which employs Stochastic Gradient Langevin Dynamics to enable fast sharedparameter sampling, so as to achieve stabilized measurement of architectureperformance even in search space with complex topological structures. Theproposed method, namely Stablized Topological Neural Architecture Search(ST-NAS), achieves state-of-the-art performance under Multiply-Adds (MAdds)constraint on ImageNet. Our lite model ST-NAS-A achieves 76.4with only 326M MAdds. Our moderate model ST-NAS-B achieves 77.9just required 503M MAdds. Both of our models offer superior performances incomparison to other concurrent works on one-shot NAS.
What problem does this paper attempt to address?