Abstract:Non-volatile memory technologies are among the most promising technologies for implementing the main memories and caches in future microprocessors and replacing the traditional DRAM and SRAM technologies. However, one of the most challenging design issues of the non-volatile memory technologies is the limited write. In this article, we first propose to exploit the narrow-width values to improve the lifetime of non-volatile last-level caches with word-level write variation reduction. Leading zeros masking scheme is proposed to reduce the write stress to the upper half of the narrow-width data. To balance the write variations between the upper half and the lower half of the narrow-width data, two swapping schemes, the swap on write (SW) and swap on replacement (SRepl), are proposed. Two existing optimization schemes, the multiple dirty bit (MDB) and read before write (RBW), are adopted with our word-level swapping design. To further reduce the write variation on the partition level, we propose to exploit the cache partitioning design to improve the lifetime. Based on the observation that different applications demonstrate different cache access (write) behaviors, we propose to partition the last-level cache for different applications and balance the write variations by partition swapping. Both software-based and hardware-based partitioning and swapping schemes are proposed and evaluated for different situations. Our experimental results show that both our word-and partition-level designs can improve the lifetime of the non-volatile caches effectively with low performance and energy overheads.

Optimal Loop Tiling for Minimizing Write Operations on NVMs with Complete Memory Latency Hiding

Loop Interchange and Tiling for Multi-Dimensional Loops to Minimize Write Operations on NVMs

Efficient Loop Scheduling for Chip Multiprocessors with Non-Volatile Main Memory

Leveraging emerging nonvolatile memory in high-level synthesis with loop transformations

Nonvolatile memory allocation and hierarchy optimization for high-level synthesis

Optimal task allocation on non-volatile memory based hybrid main memory

Minimizing Write Activities to Non-Volatile Memory Via Scheduling and Recomputation

Checkpointing-Aware Loop Tiling for Energy Harvesting Powered Nonvolatile Processors.

Quail: Using NVM Write Monitor to Enable Transparent Wear-Leveling.

Write Activity Reduction on Non-volatile Memories via Data Migration and Recomputation for Embedded CMPs

Task Allocation on Nonvolatile-Memory-Based Hybrid Main Memory

Write Activity Minimization for Nonvolatile Main Memory Via Scheduling and Recomputation

Scheduling to Optimize Cache Utilization for Non-Volatile Main Memories.

Efficient Subgraph Matching on Non-volatile Memory.

Redesign the Memory Allocator for Non-Volatile Main Memory.

Write Activity Reduction on Non-Volatile Main Memories for Embedded Chip Multiprocessors.

A Method for Hiding the Increased Non-Volatile Cache Read Latency

Demystifying the Performance of HPC Scientific Applications on NVM-based Memory Systems

Enhancing security ofNVM-based main memory with dynamicFeistel networkmapping

Word- and Partition-LevelWrite Variation Reduction for Improving Non-Volatile Cache Lifetime

Memory-Aware Loop Mapping on Coarse-Grained Reconfigurable Architectures