Sifter: an Efficient Operator Auto-Tuner with Speculative Design Space Exploration for Deep Learning Compiler

Qianhe Zhao,Rui Wang,Yi Liu,Hailong Yang,Zhongzhi Luan,Depei Qian
DOI: https://doi.org/10.1109/tc.2024.3441820
IF: 3.183
2024-01-01
IEEE Transactions on Computers
Abstract:Deep learning compiler can automatically optimize operators. It provides higher flexibility compared to vendor libraries. However, existing DNN operator tuning methods mostly rely on search-based approaches, which still face challenges such as large design spaces and long tuning times. To address these issues, we propose Sifter, an efficient DNN operator auto-tuner with speculative design space exploration. By training and analyzing decision trees, we extract shared characteristics of high-quality schedules and summarize them as pruning rules. Applying these rules during the optimization allows us to speculatively explore the design space, minimize unnecessary hardware measurements, and shorten the optimization time without compromising the optimization result. We conducted experiments on three different platforms with various operators and models. The results demonstrate that Sifter reduces 52% of redundant schedules and shortens the optimization time by 41% while maintaining operator optimization performance at the state-of-the-art level.
What problem does this paper attempt to address?