Abstract:As virtualization becomes a key technique for supporting cloud computing, much effort has been made to reduce virtualization overhead, so a virtualized system can match its native performance. One major overhead is due to memory or page table virtualization. Conventional virtual machines rely on a shadow mechanism to manage page tables, where a shadow page table maintained by the VMM (Virtual Machine Monitor) maps virtual addresses to machine addresses while a guest maintains its own virtual to physical page table. This shadow mechanism will result in expensive VM exits whenever there is a page fault that requires synchronization between the two page tables. To avoid this cost, both Intel and AMD provide hardware assists, EPT (extended page table) and NPT (nested page table), to facilitate address translation. With the hardware assists, the MMU (Memory Management Unit) maintains an ordinary guest page table that translates virtual addresses to guest physical addresses. In addition, the extended page table as provided by EPT translates from guest physical addresses to host physical or machine addresses. NPT works in a similar style. With EPT or NPT, a guest page fault can be handled by the guest itself without triggering VM exits. However, the hardware assists do have their disadvantage compared to the conventional shadow mechanism -- the page walk yields more memory accesses and thus longer latency. Our experimental results show that neither hardware-assisted paging (HAP) nor shadow paging (SP) can be a definite winner. Despite the fact that in over half of the cases, there is no noticeable gap between the two mechanisms, an up to 34% performance gap exists for a few benchmarks. We propose a dynamic switching mechanism that monitors TLB misses and guest page faults on the fly, and dynam-ically switches between the two paging modes. Our experiments show that this new mechanism can match and, sometimes, even beat the better performance of HAP and SP.

Skip TLB flushes for reused pages within mmap's

FlexPointer: Fast Address Translation Based on Range TLB and Tagged Pointers

Nomad: Non-Exclusive Memory Tiering via Transactional Page Migration

RPFF: A Remote Page-Fault Filter for Post-copy Live Migration

UMap: Enabling Application-driven Optimizations for Page Management

numaPTE: Managing Page-Tables and TLBs on NUMA Systems

Adaptive Page Migration Policy With Huge Pages in Tiered Memory Systems

Page Tables: Keeping them Flat and Hot (Cached)

Selective Hardware/software Memory Virtualization

Page Table Management for Heterogeneous Memory Systems

User Mode Memory Page Management: An old idea applied anew to the memory wall problem

LearnedFTL: A Learning-Based Page-Level FTL for Reducing Double Reads in Flash-Based SSDs

Translation look-aside buffer with consecutive page merging and recycling

Optimizing Performance of Persistent Memory File Systems Using Virtual Superpages.

Swift shadow paging (SSP): no write-protection but following TLB flushing

Exploiting Superpages in a Nonvolatile Memory File System.

On-chip memory dynamic optimization for embedded Linux

Optimizing File Systems on Heterogeneous Memory by Integrating DRAM Cache with Virtual Memory Management.

(No)Compromis: paging virtualization is not a fatality

TPP: Transparent Page Placement for CXL-Enabled Tiered-Memory

Redesign the Memory Allocator for Non-Volatile Main Memory.