Abstract:With the increasing application of IoT devices, the memory subsystem, as the performance and energy bottleneck of IoT systems, has received a lot of attention. One of the keys is on-chip memory which can bridge the performance gap between the CPU and main memory. While many off-the-shelf embedded processors utilize the hybrid on-chip memory architecture containing scratchpad memories (SPMs) and caches, most existing literature ignores the collaboration between caches and SPMs. This paper proposes static SPM allocation strategies for the architecture mentioned above in IoT systems, which try to minimize the overall instruction memory subsystem latency and/or energy consumption. We capture the intra- and inter-task cache conflict misses via a fine-grained temporal cache behavior model. Based on this cache conflict information, we propose an integer linear programming (ILP) algorithm to generate an optimal static function level SPM allocation for system performance. Furthermore, to improve the scalability of the proposed allocation scheme for an enormous task set, we offer the interference factor to calculate the interference impact quantitatively. Then, based on the interference factor, we present two approximate knapsack based heuristic algorithms to provide near optimal static allocation schemes at both function- and basic block-level granularities, which favors fast design space exploration. The experiment results demonstrate that the proposed solution achieves a 30.85% improvement in memory performance, and up to 31.39% reduction in energy consumption, compared to the existing SPM allocation scheme at the function level. In addition, the proposed basic block level allocation algorithm shows better performance than our function level allocation algorithm and other basic block level allocation algorithm.

Data Allocation for Embedded Systems with Hybrid On-Chip Scratchpad and Caches

Data Allocation Optimization for Hybrid Scratch Pad Memory with SRAM and Nonvolatile Memory

Fast and Accurate Code Placement of Embedded Software for Hybrid On-Chip Memory Architecture

Managing hybrid on-chip scratchpad and cache memories for multi-tasking embedded systems

Optimizing Data Allocation for Loops on Embedded Systems with Scratch-Pad Memory

Optimal Data Allocation for Scratch-Pad Memory on Embedded Multi-core Systems

Optimizing Code Allocation for Hybrid On-Chip Memory in Iot Systems

Temperature-Aware Data Allocation for Embedded Systems with Cache and Scratchpad Memory

DPA: Demand-Based Partition and Data Allocation for Hybrid On-Chip Memory

A Semi-automatic Scratchpad Memory Management Framework for CMP.

Optimizing Data Allocation and Memory Configuration for Non-Volatile Memory Based Hybrid SPM on Embedded CMPs.

Energy Optimization for Data Allocation with Hybrid SRAM+NVM SPM

Core Working Set Based Scratchpad Memory Management.

Energy-Oriented Dynamic SPM Allocation Based on Time-Slotted Cache Conflict Graph

Low-Power Low-Latency Data Allocation for Hybrid Scratch-Pad Memory

Compiler-Assisted Dynamic Scratch-Pad Memory Management With Space Overlapping For Embedded Systems

Management and Optimization for Nonvolatile Memory-Based Hybrid Scratchpad Memory on Multicore Embedded Processors

Data Allocation for Hybrid Memory with Genetic Algorithm

WCET-Aware Energy-Efficient Data Allocation on Scratchpad Memory for Real-Time Embedded Systems

Optimal data allocation algorithm for loop-centric applications on scratch-PAD memories

TTEC: Data Allocation Optimization for Morphable Scratchpad Memory in Embedded Systems