Simultaneous multithreading trace processors: Improving trace processors performance

Kai-feng Wang,Zhen-zhou Ji,Ming-zeng Hu
DOI: https://doi.org/10.1016/j.micpro.2005.07.004
IF: 3.503
2006-01-01
Microprocessors and Microsystems
Abstract:Trace Processors is a promising next-generation microarchitecture that exploits implicit thread-level parallelism (TLP) in conventional applications by employing aggressive control and data speculation techniques. Although high performance can be achieved by trace processors, but in fact, processing element (PE) resources are still underutilized due to frequent trace cache misses and next-trace mispeculations. When trace cache miss occurs, trace dispatch engine must stall and supply nothing to idle PE until the completion of trace construction. When next-trace mispeculation occurs, in addition to trace dispatch engine stall, all speculative execution results after the mispeculated trace must be discarded. All the operations on those squashed traces are useless. When trace processors scales up with more PEs, this problem will become more severe.
What problem does this paper attempt to address?