Time and power optimization for heterogeneous parallel embedded systems
Edwin Sha,Meikang Qiu
2007-01-01
Abstract:For heterogeneous parallel embedded systems, we exploit the time and power optimization in various aspects. In high-level architecture synthesis, we address high-level architecture synthesis for real-time Digital Signal Processing (DSP) using heterogeneous functional units (FUs). With more and more different types of FUs available, same type of operations can be processed by heterogeneous FUs with different costs, where the cost may relate to power, reliability, etc. Furthermore, some tasks may not have fixed execution time. Such tasks usually contain conditional instructions and/or operations that could have different execution times for different inputs. Therefore, for such special purpose architecture synthesis, an important problem is how to assign a proper function unit type to each operation of a DSP application and generate a schedule in such a way that we can minimize the total costs while satisfying timing constraints with guaranteed confidence probabilities. We propose several efficient algorithms to solve it. The experiments show that our algorithms can effectively reduce the total cost compared with the previous work.Low power is becoming a critical design issue and performance metric in embedded system design. DSP processor has multiple FUs and can process several instructions simultaneously. While this multiple-FU architecture can be exploited to increase instruction-level parallelism and improve time performance, it causes more power consumption. To solve this problem, several techniques have been proposed. We combine Dynamic Voltage Scaling (DVS) and soft real-time to solve the Voltage Assignment with Probability (VAP) Problem. VAP problem involves finding a voltage level to be used for each node of an Probabilistic Date Flow Graph (PDFG) in uniprocessor and multiprocessor DSP systems. This work tremendously improves the state-of-the-art techniques. Another application is heterogeneous sensor network. We apply our efficient algorithms to dynamic adjust the working mode of sensors and achieved significant energy saving. Also, we design new rotation scheduling algorithms for real-time applications that produce schedules consuming minimal energy. Furthermore, we combine data mining and prefetching to reduce energy consumption. All these three techniques significantly reduce energy consumption. Many high-performance DSP processors employ multi-bank on-chip memory to improve performance and energy consumption. This architectural feature supports higher memory bandwidth by allowing multiple data memory accesses to be executed in parallel. However, making effective use of multi-bank memory remains difficult, considering the combined effect of performance and energy requirement. In this project, our focus is to study the assignment and scheduling problem that minimizes the total energy while satisfying performance requirements. Our approach has several major contributions: First, we study the combined effects of energy-saving and performance of memory in a systematic approach. Second, we exploit the energy saving of memory with memory type assignment. Third, data locality has been improved by using variable partition.