Hierarchical Memory-Constrained Operator Scheduling of Neural Architecture Search Networks

Zihan Wang,Chengcheng Wan,Yuting Chen,Ziyi Lin,He Jiang,Lei Qiao
DOI: https://doi.org/10.1145/3489517.3530472
2022-01-01
Abstract:Neural Architecture Search (NAS) is widely used in industry, searching for neural networks meeting task requirements. Meanwhile, it faces a challenge in scheduling networks satisfying memory constraints. This paper proposes HMCOS that performs hierarchical memory-constrained operator scheduling of NAS networks: given a network, HMCOS constructs a hierarchical computation graph and employs an iterative scheduling algorithm to progressively reduce peak memory footprints. We evaluate HMCOS against RPO and Serenity (two popular scheduling techniques). The results show that HMCOS outperforms existing techniques in supporting more NAS networks, reducing 8.7~42.4% of peak memory footprints, and achieving 137--283x of speedups in scheduling.
What problem does this paper attempt to address?