Parallel Programming and Its Architectures Based on Data Access Separated Algorithm Kernels

Dake Liu,Joar Sohl,Jian Wang
DOI: https://doi.org/10.4018/jertcs.2010103004
2012-01-01
Abstract:A novel master-multi-SIMD architecture and its kernel (template) based parallel programming flow is introduced as a parallel signal processing platform. The name of the platform is ePUMA (embedded Parallel DSP processor architecture with Unique Memory Access). The essential technology is to separate data accessing kernels from arithmetic computing kernels so that the run-time cost of data access can be minimized by running it in parallel with algorithm computing. The SIMD memory subsystem architecture based on the proposed flow dramatically improves the total computing performance. The hardware system and programming flow introduced in this article will primarily aim at low-power high-performance embedded parallel computing with low silicon cost for communications and similar real-time signal processing.
What problem does this paper attempt to address?