A Model-Specific End-to-End Design Methodology for Resource-Constrained TinyML Hardware.

Yanchi Dong,Tianyu Jia,Kaixuan Du,Yiqi Jing,Qijun Wang,Pixian Zhan,Yadong Zhang,Fengyun Yan,Yufei Ma,Yun Liang,Le Ye,Ru Huang
DOI: https://doi.org/10.1109/DAC56929.2023.10247791
2023-01-01
Abstract:Tiny machine learning (TinyML) becomes appealing as it enables machine learning on resource-constrained devices with ultra low energy and small form factor. In this paper, a model-specific end-to- end design methodology is presented for TinyML hardware design. First, we introduce an end-to-end system evaluation method using Roofline models, which considering both AI and other general-purpose computing to guide the architecture design choices. Second, to improve the efficiency of AI computation, we develop an enhanced design space exploration framework, TinyScale, to enable optimal low-voltage operation for energy-efficient TinyML. Finally, we present a use case driven design selection method to search the optimal hardware design across a set of application use cases. Our model-specific design methodology is evaluated on both TSMC 22nm and 55nm technology for MLPerf Tiny benchmark and a keyword spotting (KWS) SoC design. With the help of our end-to-end design methodology, an optimal TinyML hardware can be automatically explored with significant energy and EDP improvements for a diverse of TinyML use cases.
What problem does this paper attempt to address?