Efficient Architectural Exploration of TAGE Branch Predictor for Embedded Processors.

Libo Huang,Qi Yu,Chaobing Zhou,Jianqiao Ma,Zhisheng Li,Qiang Dou
DOI: https://doi.org/10.1016/j.mejo.2019.04.019
2019-01-01
Abstract:Embedded processors are usually limited by silicon budget and power consumption, and utilizing limited resources to design an accurate branch predictor becomes an urgent issue. In this paper, we conduct design space explorations of TAGE predictor with ultra-small RAM for embedded processors. We first define the design space exploration problem, and then we propose PSO-based exploration framework to explore parameters of TAGE predictor. To conduct explorations with different purposes, we propose a composite metric which integrates prediction accuracy, area, and power consumption. For explorations only considering prediction accuracy, results show that compared with Bi-mode and GShare predictor, parameters explored by our method achieves better performance-area efficiency. For explorations considering the area and power consumption, results show that compared to accuracy-first exploration, area-first exploration achieves better performance-area and performance-power efficiency. In addition, area-first exploration and power-first exploration have similar results. We also study how training traces impact overall performance. Results show that the performance gap between different training traces is small and our method is insensitive to traces selected for exploration.
What problem does this paper attempt to address?